Project Case Studies
Deep dives into production systems I've built. Each project includes the problem, solution, measurable impact, and what I learned.
RAG Exercise Plan Generator
Production AI System for Healthcare
AI-powered exercise plan generator using Retrieval-Augmented Generation. Healthcare professionals input natural language queries like "shoulder exercises for post-surgery recovery, week 2" and receive personalized, structured rehabilitation plans.
# The Problem
Tenzr had 1,578 static exercises in their database but no intelligent way to surface them. Creating personalized rehabilitation plans required manual exercise selection, which was time-consuming and inconsistent.
# The Solution
Built a full RAG pipeline with hybrid search (70% semantic + 30% keyword) that transforms natural language queries into structured exercise plans. The system detects constraints like injuries and restrictions, ensuring safety-focused recommendations.
# Architecture
User Query → Constraint Detection (injuries, goals) → Hybrid Search (70% semantic + 30% keyword) → Context Preparation (top 15 exercises) → LLM Generation (structured prompts) → Structured Output (exercise plan)
# Impact
- ✓First production RAG system at Tenzr
- ✓Transformed 1,578 exercises into intelligent, queryable system
- ✓Created 40+ KB of technical documentation across 6 focused docs
- ✓Established foundation for future AI-powered healthcare features
# What I Learned
- →Hybrid search significantly outperforms pure semantic for domain-specific retrieval
- →Context window size matters - too much context causes LLM parse errors
- →LlamaIndex provides better RAG primitives than LangChain for this use case
PgBouncer Database Optimization
99.4% Latency Reduction
Connection pooling optimization that dramatically improved database performance across the entire platform. Built with comprehensive benchmarking tools and documentation for reproducibility.
# The Problem
Direct PostgreSQL connections were creating significant latency overhead. Each new connection took ~329ms to establish, causing noticeable delays in healthcare workflows where responsiveness matters.
# The Solution
Implemented PgBouncer connection pooling with optimized configuration. Created custom benchmarking tooling to measure before/after performance with statistical rigor.
# Impact
- ✓Connection time: 329ms → 2ms (99.4% reduction)
- ✓Concurrent handling: +458% capacity (5.6×)
- ✓Throughput: +6.3% queries per second
- ✓Complex queries: 13% faster execution
# What I Learned
- →Performance improvements without metrics are just opinions
- →Benchmarking tooling is as important as the optimization itself
- →Documentation enables future engineers to build on your work
Energy Marketplace Optimization
90% Page Load Improvement
Database and application performance optimization for an energy marketplace platform serving industrial facilities in India.
# The Problem
Page loads were taking 20+ seconds, making the platform unusable for factory workers with limited time and varying technical literacy. Users were abandoning the platform.
# The Solution
Identified inefficient database queries using N+1 patterns. Implemented eager loading, query optimization, and strategic caching to reduce round trips.
# Impact
- ✓Page load time: 20s → 2s (90% reduction)
- ✓Significantly improved user adoption
- ✓Enabled usage by workers with limited technical background
# What I Learned
- →On-site user research reveals problems that logs never show
- →Performance is a feature, especially for users with limited connectivity
- →N+1 queries are the most common performance killer
Industrial Shift Logging System
Full-Stack for Manufacturing
Role-based access control system for industrial shift logging, built after conducting on-site user research at manufacturing facilities.
# The Problem
Factory workers needed to log shifts but existing systems assumed high technical literacy. The platform needed to work for users ranging from floor workers to plant managers, with Hindi language support.
# The Solution
Built a full-stack application with role-based access control, bilingual interface (English/Hindi), and simplified UI patterns informed by direct observation of users.
# Impact
- ✓Enabled shift logging across all technical literacy levels
- ✓Bilingual support for Hindi-speaking users
- ✓Role-based access control for workers, supervisors, and managers
# What I Learned
- →User research at actual work sites is invaluable
- →Accessibility means different things in different contexts
- →Simple UX often requires more complex engineering
PickMyElective
AI-Powered Course Discovery for University Students
RAG-powered course recommendation system that helps university students discover elective courses through natural language search. Built in 12 hours at JourneyHacks 2026 with a team of 3.
# The Problem
University students struggle to find elective courses that match their interests. Traditional course catalogs require keyword searches and manual filtering through hundreds of options, with no way to express preferences naturally.
# The Solution
Built a 3-tier microservices system with RAG-powered semantic search. Users describe courses in natural language like "easy science course with no prerequisites" and receive semantically relevant matches with personalized explanations for why each course fits.
# Architecture
User Query → Query Interpretation (Gemini LLM) → Embedding Generation (OpenAI) → Vector Search (ChromaDB) → Post-filtering (campus, level, WQB) → Ranking (semantic + elective score) → Match Reason Generation (Gemini LLM) → Personalized Results
# Impact
- ✓Semantic search over 1,200+ courses using ChromaDB vector database
- ✓5-step RAG pipeline: interpret → embed → search → rank → explain
- ✓Hybrid ranking: 80% semantic relevance + 20% elective quality score
- ✓Sub-second response times with rate limiting and query history
# What I Learned
- →RAG pipeline design with hybrid ranking outperforms pure semantic search
- →Post-filtering after vector search handles complex filter combinations efficiently
- →Microservices architecture enables independent scaling of AI workloads
This Portfolio
Terminal-aesthetic portfolio built with Next.js 15, React 19, and Tailwind CSS 4. Features three theme modes, Framer Motion animations, and mobile-first responsive design.
Want to Collaborate?
I'm always interested in discussing infrastructure challenges, performance optimization, or interesting technical problems.