
Talha H
Data Platform Engineer
Skills

See my services


Portfolio
Work experience
Data Platform Engineer
ACE Money Transfer • Full-time
Oct 2024 - Present • 1 yr 7 mos
Designed and built a modern data platform from scratch, ensuring scalability and performance. Developed a custom real-time CDC replication solution for both SQL and NoSQL databases. Optimized cloud cost and data architecture, improving efficiency and reducing expenses. Translated business requirements into meaningful schemas and data models. Delivered 10+ end-to-end data projects integrating data from diverse sources and platforms. AWS Glue PySpark Job to Transforms and curates raw data into structured format. Using DBT and Pyspark for modelling and storing processed data in clickhouse for analysis. Using Airflow and Glue Workflows for the orchestration and workflows. Optimize data integrity, consistency, scalability, and performance in database designs. Develop ER diagrams, data dictionaries, and metadata documentation. Implement data security, data governance & compliance measures (GDPR, PII ).
Data Engineer
BI Cube
May 2023 - Sep 2024 • 1 yr 4 mos
Responsibilities: > Designed and implemented robust data extraction processes. > Fivetran to enhance data extraction efficiency and reliability. > Managed data loading into the Snowflake data warehouse, ensuring data was structured and ready for analysis. > Maintained data integrity and security throughout the loading process. > Utilized DBT (Data Build Tool) for data transformation. > Conducted thorough evaluations of tools to optimize workflows. > Recognized the advantages of Airflow over DBT Cloud. > Advocated for transitioning to Airflow due to its open-source nature and extensive functionality, offering capabilities beyond DBT Cloud. > Created structured and readily usable datasets for analytics and reporting. > Leveraged the Sigma reporting tool for advanced analytics. > Generated critical reports, including technician performance, monthly revenue, and job costing reports. > Analyzed critical reports, including technician performance, monthly revenue, and job costing reports, to extract actionable insights. > Leveraged Sigma reporting tool for advanced analytics and visualization of key metrics and KPIs > Continuously monitored data integrity and identified areas for improvement in data quality and consistency. > Generated Reports and Dashboards as needed to address specific business questions and requirements.
Data Engineer
DatatalksClub
Jan 2023 - Jun 2023 • 5 mos
Created a complete ETL pipeline using Apache Airflow, automating ingestion and transformation of NewYork Taxi data with 99% success rate. Extracted and staged data from public APIs into PostgreSQL, improving pipeline stability and reducing data loss to less than 0.1%. Established ETL pipelines on Google Cloud Platform (BigQuery, Dataform, Cloud Storage, Cloud Functions, Cloud Composer ) improving understanding of scalable data processing by 80%.