Harish Kesava Rao
Principal Data Engineer & Data Architect @ Atlassian
Previously: Data Engineering @ Databricks, Amazon, Indeed, Informatica
Hello, I am a Principal Data Engineer & Data Architect for the Data Engineering organization at Altassian.
My most recent role, prior to Atlassian’s was Staff Software Engineer and Data Engineering Team Lead for Security at Databricks, where I was involved in overseeing the execution of multi-quarter Data Engineering projects, including design, automation and scaling of multi-TB/day data in AWS and Azure.
Earlier, I was in Senior Data Engineering(Individual Contributor) roles at Salesforce’s Tableau, Amazon’s Prime Video and Indeed.com. In Amazon, I was hired as the first Data Engineer to help build a Data Lake for Prime Video Search. At Indeed, I worked for Marketing, Sales and Finance to help build their datawarehouses.
Prior to Indeed, I was in various roles are Informatica, focusing mainly on building ETL pipelines between relational databases and MPP databases.
My interests and key areas of work include:
- Understanding short-term & long-term data engineering requirements, converting them into high-level data architecture roadmaps for execution.
- Designing, building and scaling big data pipelines to ingest large volume data via Spark and AWS, Azure. I also have experience deploying big data pipelines on-premise.
- Performance tuning for high volume ingestion pipelines with SparkStreaming / AWS Kinesis, Azure Eventhubs on Databricks.
- Automation of workflows (such as Apache Airflow and other equivalent tooling on the cloud), automated infrastructure management on the cloud using Terraform (HCL).
- Experience in configuring and deploying Docker containers for specialized use cases.
- Extensive experience in writing Data Engineering code (primarily Python and Scala), adhering to good software engineering practices.
- I also contribute to Open Source projects on a regular basis. I developed the first DAG Sensors for the Databricks Provider in Apache Airflow.
news
Apr 29, 2024 | Joined Atlassian India as Principal Data Engineer & Data Architect. |
---|---|
Apr 30, 2023 | Created the Databricks Partition Sensor (for the Databricks Provider) for Apache Airflow. |
Apr 2, 2023 | First major contribution to Apache Airflow – Databricks SQL Sensor for Airflow. ![]() |
May 2, 2022 | Joined Databricks as Staff Software Engineer and Data Engineering Team Lead for Security. |
Oct 22, 2020 | Joined Amazon Prime Video as their first (Senior) Data Engineer for Prime Video Search. |
latest posts
Mar 1, 2023 | Building a data lake on Microsoft Azure. |
---|---|
Jun 1, 2021 | Building a data lake on Amazon Web Services. |
Nov 23, 2019 | Deploying on-premise big data pipelines. |