Harish Kesava Rao

I am a Data Engineer, with a majority of my recent roles involving Data Platforms. My journey in data started in technical presales in the relational world, involving databases such as Oracle, Postgres and SQL Server, ETL tooling such as Ab Initio or Informatica and eventually moving onto on-premise big data with Hadoop, Hive, Presto, Spark, Kubernetes and Docker. Later, I began working with the cloud on AWS, Azure and Databricks, also managing infrastructure with AWS CDK or Terraform. More recently, I have been utilizing my skills to cater to AI and ML use cases to power agents, and ML models for sentiments, themes and features. I have worked in teams of different sizes and objectives, ranging from one member teams to 30 member teams, across a variety of tech stacks. I am also an open-source contributor and I blog about data.

latest posts

Dec 29, 2025	From Experimental Notebooks to Production: A Data Engineer's perspective of Scaling Data Science Applications
Aug 4, 2025	Building a Governed Data Lakehouse
May 10, 2025	A Practitioner's Journey with Databricks AI/BI Genie
Dec 13, 2024	Data Architecture and Modeling — A Primer
Mar 1, 2023	Building a data lake on Microsoft Azure.