Open Source Contributions
Active contributions to Apache Airflow, Delta Lake — focused on data infrastructure, observability, and provider tooling.
I contribute to open-source projects in the data engineering and AI/ML infrastructure space. Contributions span provider tooling, observability improvements, and platform compatibility fixes.
Apache Airflow
Databricks provider, Google provider, Snowflake provider
DatabricksPartitionSensor — authored and merged; enables partition-level sensing on Databricks tables for dependency management in Airflow DAGs DatabricksSQLSensor — authored and merged; SQL-based sensing for Databricks workflows Fix malformed URI† — Snowflake Connector Google BigQuery & PubSub provider unit test fixes† Google Sheets & Simple HTTP unit test fixes†
Note: contributions linked above marked with † were made under @harishkrao
Delta Lake
delta-io/delta
Improved error observability in SnapshotManager.getLogSegmentForVersion — enhanced diagnostics for version resolution failures, making debugging significantly faster for users hitting log segment errors (PR approved, pending merge)