Data Scientist | 3+ years in FinTech, Product Analytics | UH '25 Alum
I'm a Data Scientist and Data Engineer who builds end-to-end systems — from ingestion to inference. With an MS in Data Science from the University of Houston, I've shipped production pipelines at Accenture's fintech platform, genomics products that capture DNA from air, and ML models that turn raw data into decisions worth millions.
I work across the full stack: Kafka to Snowflake, dbt to SageMaker, XGBoost to RAG. Whether it's cutting feature prep from 6 hours to 23 seconds or reviewing 100+ ads/min with zero hallucinations — I build things that scale and insights that land.
I tend to overfit on Fridays, fine-tune on Saturdays, and regularize on Sundays. Currently seeking Data Engineer, Data Scientist, and AI/ML Engineer roles where the data is messy and the impact is real.
Wild Genomics, CA
University of Houston, TX
Accenture, India
Capgemini, India
End-to-end analytics pipeline for NYC taxi operations (8.6M+ trips, $180M+ quarterly revenue), combining batch and streaming architectures to enable real-time demand forecasting and dynamic pricing optimization.
AI-powered RAG system for digital advertising compliance, automating Google Ads policy review across 341 regulatory chunks from 25 documents to enable scalable, real-time ad moderation.
End-to-end ML platform processing 10.6M orders to deliver personalized recommendations, churn prevention, and customer lifetime value prediction — identifying $4.65M in annual business opportunity across 7 customer segments.
End-to-end AWS data pipeline processing 12.3M IMDb records with XGBoost movie rating prediction (R²=0.664) — combining batch ETL, feature engineering, ML training, real-time inference, and automated orchestration into a single production-grade system.
University of Houston
Houston, TX, USA
GPA: 3.7/4.0
Jawaharlal Nehru Technological University
Hyderabad, INDIA
GPA: 3.68/4.0
I had the pleasure of working with Varun during his internship at Wild Genomics as a Data Science Intern. From day one, Varun impressed us with his technical acumen, professionalism, and eagerness to contribute meaningfully to our mission. He developed and optimized bioinformatics pipelines for complex environmental DNA datasets, demonstrating strong skills in Python, R, and machine learning. What stood out most was Varun's ability to learn quickly, adapt to new challenges, and consistently deliver results. He brought positive energy to the team and demonstrated strong initiative while remaining collaborative. I wholeheartedly recommend him to any organization looking for a talented and driven data scientist
It's been an absolute pleasure working with Varun during his internship at Wild Genomics. Varun approached his role with professionalism, curiosity, and a strong drive to learn. He led the design and implementation of an end-to-end bioinformatics pipeline for processing complex, multi-marker eDNA sequencing datasets, delivering a solution that is both robust and scalable. Varun stood out for his teamwork, clear communication, and creative problem-solving. The pipeline he developed is already revealing species-level insights from airborne eDNA that traditional methods would struggle to capture.