Results-oriented Data Engineer with over 5+ years of experience delivering scalable, cloud-native data solutions across AWS, Microsoft Azure, and Google Cloud Platform (GCP). Certified in all three platforms, I specialize in PySpark, DBT, Snowflake, Python, and SQL to design, build, and optimize data pipelines that drive real-time analytics and data-driven decisions.
I excel in end-to-end pipeline development, from data ingestion and transformation to validation and optimization, ensuring data quality, performance, and scalability. With strong data modeling expertise and a focus on business impact, I am committed to building efficient, high-performance data architectures.
With a proven track record of managing complex workflows and enabling actionable insights, I help organizations leverage data for smarter, faster decision-making.
Years Experience
Projects Completed
Certifications
Developed an end-to-end data pipeline using AWS services and Airflow for reporting from Google Sheets API data.
Implemented a real-time data ingestion system with Spark Streaming for processing high-velocity data.
Developed a machine learning model for predicting flight delays with 85% accuracy using historical data.
Utilized R and clustering techniques to segment customers for targeted marketing campaigns.
+91 9502192674
Pathapatnam, Andhra Pradesh, India