Jay Shah
Lead Data Scientist building LLM/RAG systems, agent platforms, and MLOps at enterprise scale.
About
Lead Data Scientist shipping LLM/RAG, agentic systems, and MLOps products in enterprise SaaS. Expert in Python, AWS, and GCP with crisp communication and cross-team leadership from roadmap to production.
Work Experience
AvathonPleasanton, California
Data Scientist III
AvathonSunnyvale, California
Data Scientist II
Avathon (Acquired Ensemble Energy)Palo Alto, California
Data Scientist
Avathon (Acquired Ensemble Energy)Palo Alto, California
Data Science Intern
Texas A&M UniversityCollege Station, Texas
Graduate Research Assistant
Utilities and Energy ServicesCollege Station, Texas
Student Analyst
DataKindSan Francisco, California
Data Ambassador
Education
Texas A&M University
Gujarat Technological University
Skills
Projects
Pravāha - Your Local Perplexity-Inspired Search Engine
Pravāha is an AI search assistant that combines local search engine capabilities with advanced Large Language Models (LLMs), inspired by Perplexity.ai.
NeuroBuddy: A Personalized Chatbot
A personalized chatbot that provides mental health support and resources to users, leveraging advanced NLP models and AI-driven analytics. The project was developed with Mistral AI and Whisper Models.
AgentEval Suite — Specialized Evals for Production Agents
Scenario-driven evaluation harness for domain agents: synthetic task generation, tool-usage tracing, success/latency metrics, and judge models; CI-friendly to prevent regressions.
Policy Compliance Tracker (LLM + docETL)
Tracks regulatory compliance across policy docs, SOPs, and audit logs; ingests with docETL, normalizes, and runs retrieval + rule checking with explainable outputs and audit-ready reports.
StreamLens: Revolutionizing Video Content Interaction with AI
An AI-driven project aimed at transforming video content interaction, leveraging advanced analytics and machine learning. Participated in the RAG-A-THON challenge organized by Llama Index.
Gujarati Llama - Fine-tuned Version of LLaMA on Indic Language
Developed a fine-tuned version of the LLaMA model specifically for Gujarati and other Indic languages, enhancing language understanding and generation capabilities for low-resource languages.
PowerCurve Estimation for Wind Energy Farms
Collaborated with Texas A&M University to develop models for estimating power curves of wind energy farms, enhancing efficiency and predictive maintenance.
Exploratory Data Analysis of Mercedes Green Manufacturing Challenge
A project associated with Texas A&M University focusing on analyzing the green manufacturing processes of Mercedes, aiming at improving safety and efficiency.
Portfolio Analysis on New York Cab Data
Performed comprehensive data analysis on New York cab data to uncover insights and patterns, associated with Texas A&M University.
Predicting Drowsiness Related Lane Departures
A project aimed at predicting lane departures caused by drowsiness using novel feature generation techniques and convolutional neural networks, in collaboration with Texas A&M University. Achieved robust results with a confidence interval of 0.75-0.86 using the Bootstrap significance test.
Customer Relationship Prediction for a Mobile Network Operator
Worked on predicting customer behavior (churn, appetency, up-selling) for Orange, using a wide range of classification techniques to identify the highest AUC for individual problems. The project focused on true positives and direct customer communication strategies.
Phase 1 Analysis of Multivariate Quality Control Data for an Industrial Forging Process
Conducted principal component analysis and applied T2 and M-Cusum charts on multivariate data from an industrial forging process, achieving significant data reduction and cleansing. This work was associated with Texas A&M University, focusing on quality control data categorization.
Press ⌘J to open the command menu