Open Source Contributions

Active contributions to Apache Airflow, Delta Lake — focused on data infrastructure, observability, and provider tooling.

I contribute to open-source projects in the data engineering and AI/ML infrastructure space. Contributions span provider tooling, observability improvements, and platform compatibility fixes.

Apache Airflow

Databricks provider, Google provider, Snowflake provider

DatabricksPartitionSensor — authored and merged; enables partition-level sensing on Databricks tables for dependency management in Airflow DAGs DatabricksSQLSensor — authored and merged; SQL-based sensing for Databricks workflows Fix malformed URI† — Snowflake Connector Google BigQuery & PubSub provider unit test fixes† Google Sheets & Simple HTTP unit test fixes

Note: contributions linked above marked with † were made under @harishkrao

Delta Lake

delta-io/delta

Improved error observability in SnapshotManager.getLogSegmentForVersion — enhanced diagnostics for version resolution failures, making debugging significantly faster for users hitting log segment errors (PR approved, pending merge)