SS
0%loading experience

ShaanSatsangi.

CS undergrad → Aspiring Data Unicorn Building the future with Data Engineering, ML, and Analytics.

scroll
Shaan Satsangi
Shaan Satsangi
CS undergrad · Data Engineer & AI/ML developer
Jaipur, Rajasthan, India
7+
Projects Shipped
2
Internships
5+
Certifications
About me

Dedicated.
Innovative.

I'm a Computer Science undergrad at JECRC, Jaipur, and honestly? I'm not just aiming to be a Data Engineer. I'm building myself into a Data Unicorn — that rare breed that sits at the intersection of Data Engineering, Data Science, and Data Analysis.

My philosophy is simple: clean data over clever models, and bulletproof pipelines over flashy demos. Whether it's architecting a medallion lakehouse or digging into sentiment patterns, I'm here to bridge the gap between raw bytes and business gold.

PythonSQLApache AirflowApache SparkdbtTensorFlowScikit-learnOpenCVNext.jsFastAPIFlaskDockerPostgreSQLRedisFirebaseGoogle CloudC / C++
What I work with

Technical Skills

Data Engineering
Data Engineering
Apache Airflow / Spark82%
dbt / SQL (PostgreSQL)88%
Databricks / Delta Lake78%
ETL / ELT Pipelines85%
Data Science & AI
Data Science & AI
TensorFlow / Keras86%
Scikit-learn / Pandas92%
OpenCV / NLP84%
Local LLM Inference80%
Data Analysis & Analytics
Data Analysis & Analytics
Power BI / Tableau82%
SQL (Complex Queries)90%
Matplotlib / Seaborn85%
Actionable Insights80%
Software Engineering
Software Engineering
C / C++88%
System Design75%
OOP / DSA84%
Linux / Bash / Docker80%
Developer tools
PythonPython
FastAPIFastAPI
Next.jsNext.js
ReactReact
TypeScriptTypeScript
PostgreSQLPostgreSQL
RedisRedis
DockerDocker
TensorFlowTensorFlow
Scikit-learnScikit-learn
OpenCVOpenCV
PandasPandas
Apache SparkApache Spark
AirflowAirflow
JupyterJupyter
GitGit
GitHubGitHub
VS CodeVS Code
FlaskFlask
FirebaseFirebase
Google CloudGoogle Cloud
SQLiteSQLite
LinuxLinux
VercelVercel
What I've built

Featured Projects

End-to-end projects spanning AI, NLP, mobile, and automation — all real, all shipped.

Skill Issue — GitHub Intelligence
May 2026
Next.jsNext.jsFastAPIFastAPIPythonPythonNeon PostgresNeon PostgresUpstash RedisUpstash RedisGroq LLM
Skill Issue — GitHub Intelligence
An AI-powered GitHub intelligence platform. Drop in a username and it turns repos, OSS contributions, and coding discipline into a deterministic engineering score and shareable receipts.
  • Deterministic 100-point engineering score across repo quality, maturity, OSS, consistency & recruiter signal — AI only narrates
  • FastAPI + Next.js with Neon Postgres persistence, GitHub OAuth, and an Upstash Redis warm cache (repeat analysis p95 ≤ 200 ms)
  • Groq llama-3.3-70b Roast + Mentor narration and shareable 1200×630 'GitHub Receipts' OG scorecards
CRM + Sales — Data Warehouse
April 2026
PythonPythonApache AirflowApache AirflowPostgreSQLPostgreSQLPower BIETL
CRM + Sales — Data Warehouse
An end-to-end data warehousing project using the Maven Analytics dataset. Built a robust ETL pipeline that cleans and transforms raw CRM data into a structured PostgreSQL warehouse.
  • Architected an automated ETL pipeline with Airflow to ingest and process 10k+ sales records
  • Designed a star-schema warehouse in PostgreSQL for optimized analytical querying
  • Surfaced actionable business insights via a dynamic Power BI dashboard focusing on sales performance
YouTube Wrapped — Data Pipeline
May 2025
DatabricksDelta LakeFastAPIFastAPINext.jsNext.jsNeon PostgresNeon Postgres
YouTube Wrapped — Data Pipeline
A personal 'Spotify Wrapped' for YouTube. End-to-end data pipeline that transforms Google Takeout exports into a polished year-in-review analytics dashboard.
  • Medallion lakehouse pipeline (Bronze → Silver → Gold) in Databricks with Delta Lake tables
  • FastAPI backend on Render serving analytics from Neon Postgres fact tables
  • Next.js dashboard with animated cards, genre splits, binge sessions & listening rhythm charts
JARVIS — Offline AI Assistant
Oct 2025
PythonPythonLocal LLMsSemantic MemoryWake-word DetectionTTS
JARVIS — Offline AI Assistant
A modular, privacy-first AI voice bot built with local LLM inference and semantic memory. Operates entirely offline without external APIs.
  • Deployed a locally trained ML model for intent classification without external AI APIs
  • Implemented speech recognition, intent classification & system automation in one integrated pipeline
  • Built on Python + TensorFlow + NLP for real-time command processing
FaceFilter AI — Facial Recognition
2025
PythonPythonFlaskFlaskOpenCV DNNSQLiteSQLiteSSE
FaceFilter AI — Facial Recognition
A locally-run face-recognition platform that detects, matches, and organizes photos by face — no cloud uploads, no API keys.
  • YuNet face detection + SFace 128-dimensional embeddings (ONNX) with cosine-similarity matching at a configurable threshold
  • Caches pre-computed embeddings keyed on file hash to eliminate redundant inference
  • Real-time progress streaming via Server-Sent Events; three-table SQLite schema for resumable runs
Sahara — Women Safety App
Feb 2025
Next.jsNext.jsTypeScriptTypeScriptSupabaseTailwind CSSTwilioMapLibre
Sahara — Women Safety App
A mobile-first safety web app with gesture-based SOS, real-time location tracking, and Twilio-powered alerts to trusted contacts — all backed by Supabase.
  • Gesture-based SOS trigger with real-time location tracking via MapLibre GL
  • Supabase (Postgres + auth) backend with row-level security and PLpgSQL functions
  • Twilio-powered SMS alerts dispatched to saved trusted contacts on emergency
Review Reader — Sentiment Analysis
Aug 2025
PythonPythonPandasPandasScikit-learnScikit-learnTF-IDFJupyterJupyter
Review Reader — Sentiment Analysis
A high-accuracy NLP pipeline that classifies sentiments across 1,000+ reviews in milliseconds using classical ML with advanced text preprocessing.
  • Pipeline handling 1,000+ reviews at 85% accuracy using Scikit-learn & Pandas
  • Classifies sentiments in under 0.5 seconds per review via Jupyter Notebook
  • TF-IDF vectorization, tokenization & stop-word removal boosted precision by 20%
Work experience

Internship Journey

July 2024 – August 2024
Upflairs Pvt. Ltd.
Data Science with AI/ML in Python
Jaipur, Rajasthan
  • Built ML classification and prediction models using Python with real-world datasets
  • Cleaned, analyzed & engineered features to extract key business insights from raw data
  • Worked on practical projects leveraging Scikit-learn, Pandas, and data visualization libraries
August 2023 – September 2023
Upflairs Pvt. Ltd.
Frontend Web Development
Jaipur, Rajasthan
  • Designed fully responsive web pages using HTML, CSS, and Bootstrap framework
  • Improved UI/UX through clean, accessible layouts and user-centered design principles
  • Delivered polished frontend interfaces for client-facing products
Where I study

Education

Jaipur Engineering College
and Research Center
Bachelor of Technology in Computer Science & Engineering
Nov 2022 – Present
7.39
CGPA
Credentials & wins

Certs & Achievements

Microsoft
Microsoft
Fundamentals of Machine Learning
December 26, 2023
Verified
Microsoft
Microsoft
Fundamental AI Concepts
December 26, 2023
Verified
CISCO Networking Academy
CISCO Networking Academy
Introduction to Cybersecurity
Networking Academy Program
Completed
CISCO Networking Academy
CISCO Networking Academy
Cybersecurity Essentials
Networking Academy Program
Completed
IEEE
IEEE
2nd Position — Debate Competition
IEEE Student Chapter
🥈 Runner-up
micro1
micro1
Data Science, AI/ML Engineer & Data Engineer
May 3, 2026
AI Certified
Hobbies & Interests
Hobbies & Interests
Movies · Music · Exploring History
Always curious, always learning
Personal
Let's connect

Get in Touch

Have a project, an opportunity, or just want to say hello? I'd love to hear from you.