Jay Shah

Data Scientist with applied AI skills, dedicated to delivering impactful and scalable solutions.

SanFranscio

JS

About

With over 6 years of experience, I specialize in AI for renewable energy and Large Language Models (LLMs). I excel in predictive maintenance for energy systems and full-stack data science. I also contribute to DataKind's social impact projects, combining AI with societal benefits.

Work Experience

Avathon
Pleasanton, California

2022 - Present

Data Scientist III

Leading data science endeavor of the renewable product research team to develop and productize analytics capabilities. Spearheaded the integration of advanced machine learning models to enhance predictive accuracy. Collaborated with cross-functional teams to ensure seamless deployment and scalability of data solutions. Mentored junior data scientists and provided strategic insights to drive project success.

Avathon
Sunnyvale, California

2021 - 2021

Data Scientist II

Designed and deployed core wind and solar power forecasting capabilities using techniques of XGBoost Informers. Achieved 10% MAPE improvement over SOTA with faster lead time to deployment. Mentored and guided an offshore team of 2 by providing ML systems principles and design-related strategies.

Avathon (Acquired Ensemble Energy)
Palo Alto, California

2019 - 2021

Data Scientist

Built a predictive maintenance system for wind assets to estimate wind turbine’s remaining useful life (RUL). Developed processes and systems for automated deployment of the models in production with performance tracking using Airflow. Architected real-time data processing (ETL) pipeline for 5 customers and achieved 60% reduction in processing time utilizing multiprocessing pandas dask pipeline in Python on AWS serverless cloud server. Prototyped and productionized statistical tools to provide insight into power production inefficiency and quantify the energy loss.

Avathon (Acquired Ensemble Energy)
Palo Alto, California

2018 - 2018

Data Science Intern

Implemented a robust anomaly detection system to predict component failure using GBM for 8 components of the wind turbine. Estimated bearing type and segmented bearing failures based on 10-min signature profile using K-means clustering to perform RCA. Delivered executable insights to customers by performing physics-based statistical data analysis and advanced data visualization utilizing ggplot & matplotlib library in Python that helped to increase 250K $/year in revenue.

Texas A&M University
College Station, Texas

2019 - 2019

Graduate Research Assistant

Researched with Dr. Yu Ding on applying advanced machine learning methods to solve and predict wind energy system failure. Implemented deep learning methods to predict possible power production and downtimes associated with the failures of wind turbine.

Utilities and Energy Services
College Station, Texas

2017 - 2018

Student Analyst

Created weather-controlled building baseline regression models for all digitally metered utilities using enterprise energy module. These models are used to monitor consumption across the campus to prevent sensor issues and energy loss. Manipulated Data in SQL to compare baseline modelled consumption with real-time consumption using statistical control limit chart in excel to analyse the average variation related to prediction.

DataKind
San Francisco, California

2022 - 2022

Data Ambassador

As a Data Ambassador for DataKind, I played a pivotal role in a collaborative project with John Jay College, developing advanced machine learning models to predict student dropout and delayed graduation. Leveraging Random Forest classifiers, our team crafted and tested over 20 models, ultimately recommending a tailored suite of six models to enable early identification and support for students at risk, significantly contributing to improving college completion rates.

Education

Texas A&M University

2017 - 2019
Master's Degree in Industrial and Systems Engineering

Gujarat State University

2013 - 2017
Bachelor's Degree in Mechanical Engineering

Skills

Data Science & Machine Learning
Predictive Modeling & Analytics
Natural Language Processing (NLP)
Deep Learning (CNN, RNN, LSTM, Transformers, TensorFlow, PyTorch)
Time Series Forecasting
Large Language Models (LLM): Fine-tuning, Retrieval-Augmented Generation (RAG), Unsupervised Learning
Model Optimization & Fine-tuning
AI-driven Solution Development
PyTorch
TensorFlow
Scikit-learn
Transformers
LangChain
XGBoost
Statsmodels
Darts
Cloud Computing & Serverless Architectures (AWS, GCP, Azure)
Real-time Data Processing & ETL (Apache Airflow, Dask, Pandas)
Big Data Technologies (Spark, Hadoop)
Software Development (Python, SQL, Bash, R, JavaScript)
Version Control (Git, GitHub)
API Development (FastAPI, Flask)
Statistical Analysis & Data Visualization (Matplotlib, Seaborn, Plotly, Dash, PowerBI)
Exploratory Data Analysis (EDA)
Docker
Linux
High-Performance Computing (GPU/CPU)
Project Management & Team Leadership
Agile Methodologies (Scrum, Kanban)
Research & Development in Renewable Energy Systems
Machine Learning Model Deployment & Monitoring (MLflow, BentoML, Modal)

Press J to open the command menu