Data Engineer & Architect

Sohaib Zafar Ansari

I design and build data infrastructure that scales — from pipelines to analytics platforms.

SnowflakedbtAirflowAzurePython

ABOUT

Building data solutions that matter

I’m a data engineer with a passion for turning raw, messy data into reliable, scalable systems. My background spans data architecture, pipeline engineering, and analytics — with a sharp focus on performance and maintainability.

I thrive on fast learning and adapt quickly to new technologies. Currently sharpening my expertise in data science and machine learning to bridge the gap between engineering and insight.

EXPERTISE

What I work with

🗄️
Data Engineering
Apache Spark · Apache Airflow · dbt · PySpark
☁️
Cloud & Warehousing
Snowflake · BigQuery · AWS (S3, Glue, Redshift) · Azure
📊
Analytics & BI
Tableau · Power BI · Looker · SQL · Excel
🐍
Programming
Python · SQL · Bash · R
🔁
Data Architecture
Data Modelling · ETL/ELT · Data Lakes · Lakehouse · CDC
🤖
ML & Data Science
Scikit-learn · Pandas · NumPy · Feature Engineering

WHAT I BRING

“I bring high enthusiasm, fresh perspective, and a genuine hunger to solve real data challenges — not just maintain the status quo.”

8+
Years in data engineering
10+
Data projects delivered
5+
Tools & platforms