Hello, I'm

Mithil

Data Science undergraduate building RAG systems, NLP pipelines, and scalable data applications.

Last Played Aurora RAG Chatbot
Scroll to explore
</>
Mithil

I am a Data Science undergraduate specializing in NLP, Retrieval Augmented Generation (RAG) systems, and data pipelines. I design and build RAG-based systems with a focus on retrieval quality, evaluation, and real-world deployment performance.

PythonSQLFastAPIPyTorchRAGDockerPostgreSQL

Work History

Analytics Intern — RAG & Data Analytics

Star Health Allied Insurance

  • Contributed to building analytics pipelines over FY24–FY25 insurance data (34 insurers, 9 segments, 2.3K+ records), enabling scalable market and risk analysis
  • Built hybrid RAG system (LangChain, ChromaDB) integrating structured data with 52 analytical documents for natural-language querying
  • Improved retrieval quality via hybrid search (BM25 + vector), reranking, and metadata filtering (+28% Recall@5, +22% context precision on benchmark queries)
  • Optimized latency from 2.4s → 650ms (−73%) and reduced token usage by 42% via routing, context compression, and prompt tuning
  • Evaluated system on 60-query benchmark (RAGAS/manual): ~93% faithfulness, ~91% relevance with improved citation grounding
PythonLangChainRAGChromaDBSentence-TransformersHybrid SearchStreamlit

Featured Work

Aurora RAG Chatbot

RAG system for real-time event queries, serving 400+ attendees. Optimized latency from 4.2s to 18ms (for cached responses) via multi-tier caching and cross-encoder reranking. Managed a development team as Technical Lead.

PythonFastAPIRedisChromaDBGroqDocker

AI Cloud Drive

Built a self hosted cloud storage system with an integrated RAG pipeline for querying technical PDFs. Implemented a retrieval strategy using hybrid search, re ranking, and context sufficiency checks to prevent hallucinations. Features asynchronous document ingestion and citation tracking, optimizing the system for faithfulness and measurable retrieval accuracy.

PythonFastAPIDockerGroqChromaDB

Open Source Contributions

@mithil27360

My neurons use RAG too – Retrieve And Guess 🧠

View GitHub
Open Source Highlight

Contributor to Keras — merged pull request in core ML ecosystem

Merged by François Chollet
200M+ 60K+

Contribution Activity

View on GitHub
Mithil's GitHub Contributions
2024
2028

Bachelor of Technology

Data Science & Engineering

Manipal Institute of Technology

Data Analytics & VisualizationObject-Oriented ProgrammingDatabase SystemsData Structures

Clubs & Organizations

The Data Alchemists

Management Committee Member Aug 2025 - Present

ISTE Manipal

Working Committee Oct 24 - Jun 25
Management Committee Jul 25 - Present

Manipal Open Source Society

Management Committee Member Aug 2025 - Present

AWS Cloud Club

Management Committee Member Dec 2025 - Present

Finova, MIT Manipal

Working Committee Nov 24 - Jun 25
Management Committee Jul 25 - Present

Chords & Co.

HR Manager Aug 2025 - Present

Open to Collaboration

Around applied projects, research, and systems.

Contact Me