MANVIR
HEER

Project Case Studies

Deep dives into production systems I've built. Each project includes the problem, solution, measurable impact, and what I learned.

~/projects$ls -la featured/
rag-exercise-generator.md

RAG Exercise Plan Generator

Production AI System for Healthcare

Tenzr Health
Oct 2025
Production

AI-powered exercise plan generator using Retrieval-Augmented Generation. Healthcare professionals input natural language queries like "shoulder exercises for post-surgery recovery, week 2" and receive personalized, structured rehabilitation plans.

# The Problem

Tenzr had 1,578 static exercises in their database but no intelligent way to surface them. Creating personalized rehabilitation plans required manual exercise selection, which was time-consuming and inconsistent.

# The Solution

Built a full RAG pipeline with hybrid search (70% semantic + 30% keyword) that transforms natural language queries into structured exercise plans. The system detects constraints like injuries and restrictions, ensuring safety-focused recommendations.

# Architecture

User Query
  → Constraint Detection (injuries, goals)
  → Hybrid Search (70% semantic + 30% keyword)
  → Context Preparation (top 15 exercises)
  → LLM Generation (structured prompts)
  → Structured Output (exercise plan)

# Impact

  • First production RAG system at Tenzr
  • Transformed 1,578 exercises into intelligent, queryable system
  • Created 40+ KB of technical documentation across 6 focused docs
  • Established foundation for future AI-powered healthcare features

# What I Learned

  • Hybrid search significantly outperforms pure semantic for domain-specific retrieval
  • Context window size matters - too much context causes LLM parse errors
  • LlamaIndex provides better RAG primitives than LangChain for this use case
ownership: 100% backend/architecture, 95% frontend, 100% documentation
PythonFastAPILlamaIndexQdrantOpenAI GPT-4o-miniNext.jsDocker
pgbouncer-optimization.md

PgBouncer Database Optimization

99.4% Latency Reduction

Tenzr Health
2025
Production

Connection pooling optimization that dramatically improved database performance across the entire platform. Built with comprehensive benchmarking tools and documentation for reproducibility.

# The Problem

Direct PostgreSQL connections were creating significant latency overhead. Each new connection took ~329ms to establish, causing noticeable delays in healthcare workflows where responsiveness matters.

# The Solution

Implemented PgBouncer connection pooling with optimized configuration. Created custom benchmarking tooling to measure before/after performance with statistical rigor.

# Impact

  • Connection time: 329ms → 2ms (99.4% reduction)
  • Concurrent handling: +458% capacity (5.6×)
  • Throughput: +6.3% queries per second
  • Complex queries: 13% faster execution
Before
329ms connection time
After
2ms connection time
=
Improvement
99.4%

# What I Learned

  • Performance improvements without metrics are just opinions
  • Benchmarking tooling is as important as the optimization itself
  • Documentation enables future engineers to build on your work
ownership: Full implementation + benchmarking tool + documentation
PostgreSQLPgBouncerBenchmarkingDocumentation
marketplace-performance.md

Energy Marketplace Optimization

90% Page Load Improvement

A2P Energy
Sep - Dec 2024
Production

Database and application performance optimization for an energy marketplace platform serving industrial facilities in India.

# The Problem

Page loads were taking 20+ seconds, making the platform unusable for factory workers with limited time and varying technical literacy. Users were abandoning the platform.

# The Solution

Identified inefficient database queries using N+1 patterns. Implemented eager loading, query optimization, and strategic caching to reduce round trips.

# Impact

  • Page load time: 20s → 2s (90% reduction)
  • Significantly improved user adoption
  • Enabled usage by workers with limited technical background

# What I Learned

  • On-site user research reveals problems that logs never show
  • Performance is a feature, especially for users with limited connectivity
  • N+1 queries are the most common performance killer
ownership: Database optimization + performance profiling
NestJSPostgreSQLQuery OptimizationEager Loading
shift-logging-system.md

Industrial Shift Logging System

Full-Stack for Manufacturing

A2P Energy
Sep - Dec 2024
Production

Role-based access control system for industrial shift logging, built after conducting on-site user research at manufacturing facilities.

# The Problem

Factory workers needed to log shifts but existing systems assumed high technical literacy. The platform needed to work for users ranging from floor workers to plant managers, with Hindi language support.

# The Solution

Built a full-stack application with role-based access control, bilingual interface (English/Hindi), and simplified UI patterns informed by direct observation of users.

# Impact

  • Enabled shift logging across all technical literacy levels
  • Bilingual support for Hindi-speaking users
  • Role-based access control for workers, supervisors, and managers

# What I Learned

  • User research at actual work sites is invaluable
  • Accessibility means different things in different contexts
  • Simple UX often requires more complex engineering
ownership: Full-stack development + user research
NestJSNext.jsTypeScriptRBACi18n
pickmyelective.md

PickMyElective

AI-Powered Course Discovery for University Students

JourneyHacks 2026
Jan 2026
Hackathon

RAG-powered course recommendation system that helps university students discover elective courses through natural language search. Built in 12 hours at JourneyHacks 2026 with a team of 3.

# The Problem

University students struggle to find elective courses that match their interests. Traditional course catalogs require keyword searches and manual filtering through hundreds of options, with no way to express preferences naturally.

# The Solution

Built a 3-tier microservices system with RAG-powered semantic search. Users describe courses in natural language like "easy science course with no prerequisites" and receive semantically relevant matches with personalized explanations for why each course fits.

# Architecture

User Query
  → Query Interpretation (Gemini LLM)
  → Embedding Generation (OpenAI)
  → Vector Search (ChromaDB)
  → Post-filtering (campus, level, WQB)
  → Ranking (semantic + elective score)
  → Match Reason Generation (Gemini LLM)
  → Personalized Results

# Impact

  • Semantic search over 1,200+ courses using ChromaDB vector database
  • 5-step RAG pipeline: interpret → embed → search → rank → explain
  • Hybrid ranking: 80% semantic relevance + 20% elective quality score
  • Sub-second response times with rate limiting and query history

# What I Learned

  • RAG pipeline design with hybrid ranking outperforms pure semantic search
  • Post-filtering after vector search handles complex filter combinations efficiently
  • Microservices architecture enables independent scaling of AI workloads
ownership: Led full-stack development: RAG pipeline, backend API, and frontend UI
React 19TypeScriptSpring Boot 3.4Java 21FastAPIChromaDBOpenAI EmbeddingsGoogle GeminiPostgreSQLJWT + OTP Auth
~/projects$ls personal/

This Portfolio

Terminal-aesthetic portfolio built with Next.js 15, React 19, and Tailwind CSS 4. Features three theme modes, Framer Motion animations, and mobile-first responsive design.

Next.js 15React 19Tailwind CSS 4Framer Motion
[ View Source ]

Want to Collaborate?

I'm always interested in discussing infrastructure challenges, performance optimization, or interesting technical problems.