Home
Welcome! π
Iβm Pradeep Kalluri, a Data Engineer specializing in building scalable cloud data platforms and production-grade data pipelines.
Currently at NatWest Bank in London, designing and delivering reliable data engineering solutions that power analytics and business intelligence across the organization.
π‘ What I Do
I build end-to-end data platforms that transform raw data into actionable insights:
- Data Ingestion - Real-time streaming (Kafka) and batch processing from cloud storage (S3, Azure Data Lake)
- Distributed Processing - Large-scale data transformation using PySpark and Databricks
- Data Warehousing - Building curated datasets in Snowflake with optimized data models
- Pipeline Orchestration - Workflow automation and monitoring with Apache Airflow
- Analytics Engineering - dbt transformations, data quality frameworks, BI integration
π― Technical Stack
Languages: Python (PySpark, Pandas), SQL, Shell Scripting
Cloud Platforms: AWS (S3, Glue, Lambda), Azure (Databricks, ADF, Data Lake), Microsoft Fabric
Data Engineering: Apache Kafka, Apache Airflow, Snowflake, dbt, ETL/ELT pipelines
Databases: Snowflake, Azure SQL, Redshift, PostgreSQL, MySQL
DevOps: Docker, Terraform, CI/CD (GitHub Actions, Azure DevOps), Git
BI Tools: Tableau, Power BI
π Recent Achievements
π Technical Writing
Published on The New Stack β + 71,000+ views across platforms
- The Weekend Our Pipeline Processed the Same Data 47 Times - Published on The New Stack (Jan 2026)
- A Beginnerβs Guide to Contributing to Apache Airflow - Published on Apache Airflow Official Medium (Feb 2026)
- Why 71,000 Data Engineers Read My Article - Lessons on technical writing (Dec 2025)
- 5 Data Pipeline Mistakes That Cost Me Weeks - Production debugging stories (Dec 2025)
- Data Quality at Scale - 71,000 views! (Nov 2025)
- From Raw to Refined: Data Pipeline Architecture - Scalable pipeline design (Nov 2025)
Published on The New Stack, Apache Airflow Official Medium, cross-posted on Dev.to, and discussed on Redditβs r/dataengineering
π€ Speaking
Oxford Microsoft Data Platform Group - January 21, 2026 β
COMPLETED
Topic: βFrom Raw to Refined: Building Production Data Pipelines That Scaleβ
Audience: 50+ registrations from industry leaders
Presented to data engineers with 50+ registrations for the event from industry leaders on production data pipeline architecture. Received positive feedback from Microsoft Senior Cloud Solution Architect and invited back for dedicated Apache Airflow session.
13 conference proposals submitted to data engineering conferences and meetups across Europe
View all speaking engagements β
π» Open Source Contributions
Apache Airflow - 3 merged PRs + 5+ PR reviews completed:
- Data masking documentation (PR #58587) - β MERGED
- Pool name validation fix (PR #59938) - β MERGED
- Bug fix contribution (PR #61005) - β MERGED
dbt-core - 1 merged + 2 active contributions:
- Fixed
@requires.catalogsdecorator for compile command (PR #12388) - β MERGED (Feb 2026) - dbt init UX fix (PR #12232) - π‘ Under review
- Debug compilation error fix (PR #12502) - π‘ Under review
π Certifications
Microsoft Fabric Data Engineer Associate β January 4, 2026 View credential
SnowPro Core (COF-C03) β Score: 923/1000 β February 16, 2026 View credential
π Currently
- Building production data pipelines at NatWest Bank processing millions of transactions daily
- Writing about data engineering on Medium (71K+ views) and Dev.to
- Contributing to Apache Airflow (3 merged PRs) and dbt-core open source projects
- Speaking at data engineering meetups and conferences (Oxford Microsoft Data Platform Group - Jan 2026)
- Mentoring data engineers on Topmate β βThe Career Launcherβ service
- Pursuing UK Global Talent Visa in Digital Technology
π’ Professional Experience
NatWest Bank (Sep 2025 - Present) - Data Engineer
Building scalable data platforms with Kafka, PySpark, Snowflake, and Airflow
Accenture (Jul 2023 - Aug 2025) - Data Engineer
Delivered enterprise cloud data solutions across Azure and AWS for major clients
Dpoint Group (May 2022 - Jun 2023) - Data Engineer
Developed BI solutions and ETL pipelines supporting operational analytics
π Connect With Me
Email β’ LinkedIn β’ GitHub β’ Medium β’ Dev.to
π Based in London, United Kingdom
Passionate about building reliable, scalable data platforms that empower data-driven decision making.