πŸ‘‹ Hi, I’m Jing You

Data Engineering & Analytics Engineering | Microsoft Fabric | Power BI | Python | SQL I build scalable data systems that turn raw information into actionable insights.
My background blends business operations, customer understanding, and technical engineering β€” giving me a unique edge in solving real-world data problems.


⚑ Core Skills

Data Engineering

Technologies

Currently Learning


πŸ’Ό Featured Projects


πŸ“Œ NPPES Medical Provider Data Pipeline

Tech Stack: Python, AWS S3, PostgreSQL, Docker
A full batch ETL pipeline processing 8.7 million medical provider records.

Highlights

πŸ‘‰ View Project β†’


πŸ“Œ Weather Data Integration Pipeline

Tech Stack: Python, REST API, Pandas, PostgreSQL, DuckDB, Docker
A multi-source data pipeline combining real-time weather data with station metadata.

Core Features

merged = weather_df.merge(stations_df, on='city', how='inner')
merged.to_parquet('weather_clean.parquet')
# Data Validation with DuckDB
con.execute("""
    SELECT COUNT(*) as null_count
    FROM weather WHERE temperature_f IS NULL
""")

Project Highlights

Business Impact

Enables analysis of weather-driven business patterns:


🧠 How I Think About Data Engineering

Systems Over Scripts

I design pipelines that are:

Data Quality First

Bad data = bad decisions.
I prioritize:

Business-Aware Engineering

My hospitality and operations background helps me:


πŸ“¬ Contact

Open to Data Engineer / Analytics Engineer / BI roles (Nashville or Remote)