ShaanSatsangi.
CS undergrad → Aspiring Data Unicorn
Building the future with Data Engineering, ML, and Analytics.

Dedicated.
Innovative.
I'm a Computer Science undergrad at JECRC, Jaipur, and honestly? I'm not just aiming to be a Data Engineer. I'm building myself into a Data Unicorn — that rare breed that sits at the intersection of Data Engineering, Data Science, and Data Analysis.
My philosophy is simple: clean data over clever models, and bulletproof pipelines over flashy demos. Whether it's architecting a medallion lakehouse or digging into sentiment patterns, I'm here to bridge the gap between raw bytes and business gold.
Technical Skills




Featured Projects
End-to-end projects spanning AI, NLP, mobile, and automation — all real, all shipped.

- Deterministic 100-point engineering score across repo quality, maturity, OSS, consistency & recruiter signal — AI only narrates
- FastAPI + Next.js with Neon Postgres persistence, GitHub OAuth, and an Upstash Redis warm cache (repeat analysis p95 ≤ 200 ms)
- Groq llama-3.3-70b Roast + Mentor narration and shareable 1200×630 'GitHub Receipts' OG scorecards

- Architected an automated ETL pipeline with Airflow to ingest and process 10k+ sales records
- Designed a star-schema warehouse in PostgreSQL for optimized analytical querying
- Surfaced actionable business insights via a dynamic Power BI dashboard focusing on sales performance

- Medallion lakehouse pipeline (Bronze → Silver → Gold) in Databricks with Delta Lake tables
- FastAPI backend on Render serving analytics from Neon Postgres fact tables
- Next.js dashboard with animated cards, genre splits, binge sessions & listening rhythm charts

- Deployed a locally trained ML model for intent classification without external AI APIs
- Implemented speech recognition, intent classification & system automation in one integrated pipeline
- Built on Python + TensorFlow + NLP for real-time command processing

- YuNet face detection + SFace 128-dimensional embeddings (ONNX) with cosine-similarity matching at a configurable threshold
- Caches pre-computed embeddings keyed on file hash to eliminate redundant inference
- Real-time progress streaming via Server-Sent Events; three-table SQLite schema for resumable runs

- Gesture-based SOS trigger with real-time location tracking via MapLibre GL
- Supabase (Postgres + auth) backend with row-level security and PLpgSQL functions
- Twilio-powered SMS alerts dispatched to saved trusted contacts on emergency

- Pipeline handling 1,000+ reviews at 85% accuracy using Scikit-learn & Pandas
- Classifies sentiments in under 0.5 seconds per review via Jupyter Notebook
- TF-IDF vectorization, tokenization & stop-word removal boosted precision by 20%
Internship Journey
- Built ML classification and prediction models using Python with real-world datasets
- Cleaned, analyzed & engineered features to extract key business insights from raw data
- Worked on practical projects leveraging Scikit-learn, Pandas, and data visualization libraries
- Designed fully responsive web pages using HTML, CSS, and Bootstrap framework
- Improved UI/UX through clean, accessible layouts and user-centered design principles
- Delivered polished frontend interfaces for client-facing products
Education
and Research Center
Certs & Achievements







Get in Touch
Have a project, an opportunity, or just want to say hello? I'd love to hear from you.

