Home
Welcome! š
Iām Pradeep Kalluri, a Data Engineer specializing in building scalable cloud data platforms and production-grade data pipelines.
Currently at NatWest Bank in London, working across modern data platforms to build reliable, scalable data flows that support analytics and reporting across the business.
š” What I Do
- Data Ingestion & Processing - Kafka, PySpark, Amazon S3, Snowflake
- Pipeline Orchestration - Airflow DAGs, workflow automation
- Cloud Data Platforms - Azure (Databricks, ADF, Data Lake), AWS (S3, Glue)
- Data Transformation - ETL/ELT pipelines, dbt, SQL, Python
- Analytics & Visualization - Tableau dashboards, business intelligence, Power BI
šÆ Technical Stack
Cloud Platforms: AWS, Azure, Microsoft Fabric
Data Engineering: PySpark, Kafka, Snowflake, Azure Databricks, ADF
Orchestration: Apache Airflow, Azure Data Factory
ETL/ELT: SSIS, Python, SQL, dbt
Visualization: Tableau, Power BI
Tools: Confluence, Jira, Git
š Recent Work
Technical Writing:
- From Raw to Refined: Data Pipeline Architecture at Scale - Medium, Nov 2024
- Cross-posted on Dev.to
Open Source Contributions:
- dbt-core - Improved user experience after project initialization (PR #12190) - Nov 2024
- Apache Airflow - Enhanced documentation for data masking features (PR #58587) - Nov 2024
Speaking:
- Submitted talks to 5 data engineering conferences and meetups across the UK
š Currently
- Building production data pipelines at NatWest Bank
- Writing about data engineering on Medium and Dev.to
- Contributing to open-source projects: dbt-core, Apache Airflow
- Pursuing speaking opportunities at data engineering meetups
š¢ Experience Highlights
- NatWest Bank - Building scalable data flows with Kafka, PySpark, Snowflake, and Airflow
- Accenture - Delivered enterprise cloud data solutions across Azure and AWS
- Capgemini - Developed data engineering solutions for enterprise clients
š Connect With Me
| GitHub | Medium | Dev.to |
š Based in London, United Kingdom