p
prateek_715

Prateek T

@prateek_715

Data Engineer

India
English, Hindi
About me
I am a Data Engineer with hands-on experience in PySpark, Kafka, Python, SQL, and the Hadoop ecosystem. Currently, I build large-scale data pipelines and ETL workflows at Infosys, focusing on medallion architecture and Spark optimization. I have a strong foundation in ML-powered data products and experience taking projects from EDA to deployed APIs.... Read more

Skills

p
prateek_715
Prateek T
Offline • 
Average response time: 1 hour

See my services

Formulas & Macros
I will solve your excel problems

Work experience

Infosys

Data Engineer

Infosys • Full-time

Sep 2025 - Present10 mos

Deployed on Databricks platform; helped build production pipelines processing daily 2–9 GB datasets (7-12 million rows): designed schema transformations for medallion architecture, engineered PySpark optimizations (partition pruning, shuffle hash, broadcast joins), implemented data serialization tuning; optimizations reduced job execution time by upto 20% in some pipelines. Led data quality validation, schema design improvements, and schema evolution to accommodate upstream data changes; worked cross-functionally with team lead and senior engineers on parallelism optimization strategies