Jay Shah

Lead Data Scientist building LLM/RAG systems, agent platforms, and MLOps at enterprise scale.

Sunnyvale, CA

JS

About

Lead Data Scientist shipping LLM/RAG, agentic systems, and MLOps products in enterprise SaaS. Expert in Python, AWS, and GCP with crisp communication and cross-team leadership from roadmap to production.

Work Experience

Avathon
Pleasanton, California

2022 - Present

Data Scientist III

Led foundation-model MLOps platform; cut release time 45% and increased model adoption. Architected agent platform with MCP tools to spin up domain agents in under 5 minutes; now powering 20+ workflows. Built LLM interface integrating asset data, enabling reporting and task automation. Launched cross-site solar-storage autoencoder anomaly detection to predict failures and performance issues, saving >$500k. Shipped RAG compliance agent, reducing violations 10% and automating audit preparation for energy domains. Reduced false positives 25% by ranking predictive alerts and next-best actions with Bayesian analysis.

Avathon
Sunnyvale, California

2021 - 2021

Data Scientist II

Deployed XGBoost-based solar/wind forecasting service; lowered MAPE 10% over baseline. Mentored offshore ML team on systems design and best practices; boosted delivery speed 25%. Defined ML architecture standards adopted by 3 squads to streamline handoffs and deployments.

Avathon (Acquired Ensemble Energy)
Palo Alto, California

2019 - 2021

Data Scientist

Built wind-turbine remaining useful life (RUL) model (92% accuracy) using industrial sensor and fault data. Automated model deployment and tracking using Airflow, Docker, Serverless, and Terraform on AWS/GCP. Designed Airflow + AWS Lambda ETL reducing latency 60% across five clients. Prototyped statistical tools to quantify power production inefficiency and energy loss.

Avathon (Acquired Ensemble Energy)
Palo Alto, California

2018 - 2018

Data Science Intern

Implemented anomaly detection to predict component failures using GBM for 8 wind-turbine components. Estimated bearing types and segmented bearing failures from 10-minute signatures using K-means clustering; delivered actionable insights.

Texas A&M University
College Station, Texas

2019 - 2019

Graduate Research Assistant

Researched with Dr. Yu Ding on applying advanced machine learning methods to solve and predict wind energy system failure. Implemented deep learning methods to predict possible power production and downtimes associated with the failures of wind turbine.

Utilities and Energy Services
College Station, Texas

2017 - 2018

Student Analyst

Created weather-controlled building baseline regression models for all digitally metered utilities using enterprise energy module. These models are used to monitor consumption across the campus to prevent sensor issues and energy loss. Manipulated Data in SQL to compare baseline modelled consumption with real-time consumption using statistical control limit chart in excel to analyse the average variation related to prediction.

DataKind
San Francisco, California

2022 - 2022

Data Ambassador

As a Data Ambassador for DataKind, I played a pivotal role in a collaborative project with John Jay College, developing advanced machine learning models to predict student dropout and delayed graduation. Leveraging Random Forest classifiers, our team crafted and tested over 20 models, ultimately recommending a tailored suite of six models to enable early identification and support for students at risk, significantly contributing to improving college completion rates.

Education

Texas A&M University

2017 - 2019
M.S. in Industrial and Systems Engineering (minor Applied Statistics)

Gujarat Technological University

2013 - 2017
B.E. in Mechanical Engineering

Skills

Data Science & Machine Learning
Predictive Modeling & Analytics
Natural Language Processing (NLP)
Deep Learning (CNN, RNN, LSTM, Transformers, TensorFlow, PyTorch)
Time Series Forecasting
Large Language Models (LLM): Fine-tuning, Retrieval-Augmented Generation (RAG), Unsupervised Learning
Model Optimization & Fine-tuning
AI-driven Solution Development
PyTorch
TensorFlow
Scikit-learn
Transformers
LangChain
XGBoost
Statsmodels
Darts
Cloud Computing & Serverless Architectures (AWS, GCP, Azure)
Real-time Data Processing & ETL (Apache Airflow, Dask, Pandas)
Big Data Technologies (Spark, Hadoop)
Software Development (Python, SQL, Bash, R, JavaScript)
Version Control (Git, GitHub)
API Development (FastAPI, Flask)
Statistical Analysis & Data Visualization (Matplotlib, Seaborn, Plotly, Dash, PowerBI)
Exploratory Data Analysis (EDA)
Docker
Linux
High-Performance Computing (GPU/CPU)
Project Management & Team Leadership
Agile Methodologies (Scrum, Kanban)
Research & Development in Renewable Energy Systems
Machine Learning Model Deployment & Monitoring (MLflow, BentoML, Modal)

Press J to open the command menu