Jay Shah

Data Scientist with applied AI skills, dedicated to delivering impactful and scalable solutions.

About

With over 6 years of experience, I specialize in AI for renewable energy and Large Language Models (LLMs). I excel in predictive maintenance for energy systems and full-stack data science. I also contribute to DataKind's social impact projects, combining AI with societal benefits.

Work Experience

Avathon
Pleasanton, California

2022 - Present

Data Scientist III

Leading data science endeavor of the renewable product research team to develop and productize analytics capabilities. Spearheaded the integration of advanced machine learning models to enhance predictive accuracy. Collaborated with cross-functional teams to ensure seamless deployment and scalability of data solutions. Mentored junior data scientists and provided strategic insights to drive project success.

Avathon
Sunnyvale, California

2021 - 2021

Data Scientist II

Designed and deployed core wind and solar power forecasting capabilities using techniques of XGBoost Informers. Achieved 10% MAPE improvement over SOTA with faster lead time to deployment. Mentored and guided an offshore team of 2 by providing ML systems principles and design-related strategies.

Avathon (Acquired Ensemble Energy)
Palo Alto, California

2019 - 2021

Data Scientist

Built a predictive maintenance system for wind assets to estimate wind turbine’s remaining useful life (RUL). Developed processes and systems for automated deployment of the models in production with performance tracking using Airflow. Architected real-time data processing (ETL) pipeline for 5 customers and achieved 60% reduction in processing time utilizing multiprocessing pandas dask pipeline in Python on AWS serverless cloud server. Prototyped and productionized statistical tools to provide insight into power production inefficiency and quantify the energy loss.

Avathon (Acquired Ensemble Energy)
Palo Alto, California

2018 - 2018

Data Science Intern

Implemented a robust anomaly detection system to predict component failure using GBM for 8 components of the wind turbine. Estimated bearing type and segmented bearing failures based on 10-min signature profile using K-means clustering to perform RCA. Delivered executable insights to customers by performing physics-based statistical data analysis and advanced data visualization utilizing ggplot & matplotlib library in Python that helped to increase 250K $/year in revenue.

Texas A&M University
College Station, Texas

2019 - 2019

Graduate Research Assistant

Researched with Dr. Yu Ding on applying advanced machine learning methods to solve and predict wind energy system failure. Implemented deep learning methods to predict possible power production and downtimes associated with the failures of wind turbine.

Utilities and Energy Services
College Station, Texas

2017 - 2018

Student Analyst

Created weather-controlled building baseline regression models for all digitally metered utilities using enterprise energy module. These models are used to monitor consumption across the campus to prevent sensor issues and energy loss. Manipulated Data in SQL to compare baseline modelled consumption with real-time consumption using statistical control limit chart in excel to analyse the average variation related to prediction.

DataKind
San Francisco, California

2022 - 2022

Data Ambassador

As a Data Ambassador for DataKind, I played a pivotal role in a collaborative project with John Jay College, developing advanced machine learning models to predict student dropout and delayed graduation. Leveraging Random Forest classifiers, our team crafted and tested over 20 models, ultimately recommending a tailored suite of six models to enable early identification and support for students at risk, significantly contributing to improving college completion rates.

Education

Texas A&M University

2017 - 2019

Master's Degree in Industrial and Systems Engineering

Gujarat State University

2013 - 2017

Bachelor's Degree in Mechanical Engineering

Skills

Data Science & Machine Learning

Predictive Modeling & Analytics

Natural Language Processing (NLP)

Deep Learning (CNN, RNN, LSTM, Transformers, TensorFlow, PyTorch)

Time Series Forecasting

Large Language Models (LLM): Fine-tuning, Retrieval-Augmented Generation (RAG), Unsupervised Learning

Model Optimization & Fine-tuning

AI-driven Solution Development

PyTorch

TensorFlow

Scikit-learn

Transformers

LangChain

XGBoost

Statsmodels

Darts

Cloud Computing & Serverless Architectures (AWS, GCP, Azure)

Real-time Data Processing & ETL (Apache Airflow, Dask, Pandas)

Big Data Technologies (Spark, Hadoop)

Software Development (Python, SQL, Bash, R, JavaScript)

Version Control (Git, GitHub)

API Development (FastAPI, Flask)

Statistical Analysis & Data Visualization (Matplotlib, Seaborn, Plotly, Dash, PowerBI)

Exploratory Data Analysis (EDA)

Docker

Linux

High-Performance Computing (GPU/CPU)

Project Management & Team Leadership

Agile Methodologies (Scrum, Kanban)

Research & Development in Renewable Energy Systems

Machine Learning Model Deployment & Monitoring (MLflow, BentoML, Modal)

Projects

Pravāha - Your Local Perplexity-Inspired Search Engine

Pravāha is an AI search assistant that combines local search engine capabilities with advanced Large Language Models (LLMs), inspired by Perplexity.ai.

BM25

openai-api

tavily-search

NueroBuddy: A Personalized Chatbot

A personalized chatbot that provides mental health support and resources to users, leveraging advanced NLP models and AI-driven analytics. The project was developed with Mistral AI and Whisper Models.

streamlit

mistral-api

whisper-models

StreamLens: Revolutionizing Video Content Interaction with AI

An AI-driven project aimed at transforming video content interaction, leveraging advanced analytics and machine learning. Participated in the RAG-A-THON challenge organized by Llama Index.

hackathon

llamaindex

MLX

BentoML

NVIDIA-AI-Endpoints

Gujarati Llama - Fine-tuned Version of LLaMA on Indic Language

Developed a fine-tuned version of the LLaMA model specifically for Gujarati and other Indic languages, enhancing language understanding and generation capabilities for low-resource languages.

Transformers

Python

Hugging Face

LLaMA

Fine-tuning

PowerCurve Estimation for Wind Energy Farms

Collaborated with Texas A&M University to develop models for estimating power curves of wind energy farms, enhancing efficiency and predictive maintenance.

Data Analytics

Python

Machine Learning

MLP

Exploratory Data Analysis of Mercedes Green Manufacturing Challenge

A project associated with Texas A&M University focusing on analyzing the green manufacturing processes of Mercedes, aiming at improving safety and efficiency.

Data Science

Python

Exploratory Data Analysis

Portfolio Analysis on New York Cab Data

Performed comprehensive data analysis on New York cab data to uncover insights and patterns, associated with Texas A&M University.

Data Analysis

Python

Statistical Modeling

Predicting Drowsiness Related Lane Departures

A project aimed at predicting lane departures caused by drowsiness using novel feature generation techniques and convolutional neural networks, in collaboration with Texas A&M University. Achieved robust results with a confidence interval of 0.75-0.86 using the Bootstrap significance test.

Data Transformation

Keras

Customer Relationship Prediction for a Mobile Network Operator

Worked on predicting customer behavior (churn, appetency, up-selling) for Orange, using a wide range of classification techniques to identify the highest AUC for individual problems. The project focused on true positives and direct customer communication strategies.

CRM

Data Analytics

Machine Learning

Logistic Regression

Phase 1 Analysis of Multivariate Quality Control Data for an Industrial Forging Process

Conducted principal component analysis and applied T2 and M-Cusum charts on multivariate data from an industrial forging process, achieving significant data reduction and cleansing. This work was associated with Texas A&M University, focusing on quality control data categorization.

Data Analysis

PCA

T2 Charts

M-Cusum Charts

Minitab

Press ⌘J to open the command menu

Jay Shah

About

Work Experience

AvathonPleasanton, California

Data Scientist III

AvathonSunnyvale, California

Data Scientist II

Avathon (Acquired Ensemble Energy)Palo Alto, California

Data Scientist

Avathon (Acquired Ensemble Energy)Palo Alto, California

Data Science Intern

Texas A&M UniversityCollege Station, Texas

Graduate Research Assistant

Utilities and Energy ServicesCollege Station, Texas

Student Analyst

DataKindSan Francisco, California

Data Ambassador

Education

Texas A&M University

Gujarat State University

Skills

Projects

Avathon
Pleasanton, California

Avathon
Sunnyvale, California

Avathon (Acquired Ensemble Energy)
Palo Alto, California

Avathon (Acquired Ensemble Energy)
Palo Alto, California

Texas A&M University
College Station, Texas

Utilities and Energy Services
College Station, Texas

DataKind
San Francisco, California