Hi, I'm
Harshit Kumar Gupta

Machine Learning Engineer at GenAI Innovation Center AWS

GenAI, NLP, CV Responsible AI, AWS AI Practitioner
San Diego, California, United States

A versatile Machine Learning professional building scalable and responsible AI Applications using ML models and evaluating them on various KPIs, then deploying as microservices. Passionate about learning new technologies and mastering them in a very short time span.

Career Highlight

Currently leading ML-powered anomaly detection systems at Amazon, processing High volume of Selling Partner Payouts.

βœ“ Self-motivated team player
βœ“ Strong analytical & problem solving
βœ“ Creativity & innovation
βœ“ Leadership qualities

Machine Learning Expert with specialization in Computer Vision, NLP, Deep Learning, and Large Language Model.

Current Tech Stack: Python, Java, AWS Lambda, DynamoDB, Apache Spark, Docker
10+ years across Amazon, Loyalty Platforms, CognitiveScale, Intel Security, and Cadence Design Systems.

Harshit Kumar Gupta - AI Engineer
πŸš€
Machine Learning
Deep Learning
Computer Vision
Natural Language Processing
Anomaly Detection
Feature Engineering
Model Interpretability
Fine Tuning

Featured Works

Here are some of my most impactful projects across Amazon, Loyalty platforms, and AI systems, showcasing expertise in distributed systems, machine learning, and large-scale data processing.

πŸ€–
Large Language Models (LLM)MCPFine Tuning +5

BSOR GenAI Agent

Entry point and orchestrator for all GenAI use cases for Selling Partner Event Aggregation- Simplified Onboarding Migration for already onboarded events, E2E testing across multiple Services, Customer Asks and Operations.

Key Achievements
  • β€’ Building unified knowledge base and Retrieval system using RAG
  • β€’ Creating MCP Gateway service for existing Coral based Services(4+)
  • β€’ Creating MCP server for creating prompts and tools for SO Migration and E2E testing etc.
Large Language Models (LLM)MCPFine TuningRetrieval-Augmented Generation (RAG)Amazon BedrockAgent CoreLangFuseStrands
πŸ’»
AutoMLAutoGluonDataLake +2

Transactional Deferral Anomaly Detection

Real-time anomaly detection system to detect incorrect deferral type (computed by SPYDER withholding engine) for Selling partner financial transactions.

Key Achievements
  • β€’ Designing guardrail to check incorrectly released or deferred transactions
  • β€’ Utilized event features from Seller Event Aggregator and Deferral Policies
  • β€’ Built end-to-end ML pipeline with SageMaker training, inference endpoints
AutoMLAutoGluonDataLakeApache IcebergAWS SageMaker
πŸ’»
JavaLambdaGlue +12

Intelligent Payment Circuit Breaker System for Real-Time Payment Anomaly Detection and Mitigation

Intelligent Request Gating and Anomaly Detection of Seller Disbursement handling High volumne of Monthly Disbursements using ML models for failure prediction, pending anomaly detection, bank grouping, and result code clustering

Key Achievements
  • β€’ Designed scalable system predicting anomaly detection for Selling Partner Disbursements
  • β€’ Training machine learning models for predicting anomalous scores based on past patterns
  • β€’ Built Bookkeeper to keep track of failures at various indexes
JavaLambdaGlueStep FunctionHerdApolloRedisDynamoDBHubbleRedshiftSageMakerLogistic RegressionNeuralProphetTop2VecFuzzyWuzzy
πŸ’»
PythonSageMakerLambda +6

OptimalCharge: Spatio-Temporal ML for Predictive Selling Partner Collections

ML-powered charge timing optimization system for Amazon Seller Debt Manager (SDM) to improve debt recovery rates and charge success rates through intelligent retry scheduling

Key Achievements
  • β€’ Designed statistical voting classifier ensemble model to predict optimal charge retry times for failed transactions
  • β€’ Improved overall charge success rate from 40% baseline using temporal and regional features (Region, Card Type, Hour, Day)
  • β€’ Built end-to-end ML pipeline with SageMaker training, inference endpoints, and Lambda integration
PythonSageMakerLambdaStatistical Voting ClassifierEnsemble MethodsHerd WorkflowsStep FunctionsGlueAppConfig
🎯
JavaSpring BootDynamoDB +7

Gravty (SaaS for Customer Engagement and Loyalty Management)

Designing offers and campaigns for customer attraction with cashback and points management. Customer targeting using on-fly offer creation and dynamic rules. Building Predictive and analytics model for customer, location retain and product, offer recommendation.

Key Achievements
  • β€’ Designed scalable transaction layer for realtime and batch processing
  • β€’ Designed Drools based execution engine to execute custom rules created from blockly
  • β€’ Designed Incremental Data Processing Pipeline in Data Lake
JavaSpring BootDynamoDBLambdaFargateDroolsBlocklyApache HudiPySparkAWS
πŸ“ˆ
Pythonscikit-learnApache Livy +3

Debt Risk Hotspots ML Model

Discover patterns of features of accounts that are conducive to bad debt.

Key Achievements
  • β€’ Interpreting model to understand similar bad debt accounts
  • β€’ Defining similarities of accounts quantitatively by using feature importance
  • β€’ Stratification to understand characteristics of clusters
Pythonscikit-learnApache LivySparkCatBoostSHAP
πŸ‘₯
PythonPandasOR-Tools +1

Patient Scheduling Constraint Optimizer

Recommend ideal patient schedules based on patient preferences and optimal use of resources.

Key Achievements
  • β€’ Building a Patient Scheduling model to efficiently schedule medical appointments
  • β€’ Using patient preferences, medical rules, time slots and facility availability as constraints
  • β€’ Solving complex combinatorial optimization using OR-Tools
PythonPandasOR-ToolsDocker
πŸ“ˆ
Pythonpandasscikit-learn +3

Bad Debt Risk Advisor ML Model

Building predictive model to determine bad debt for Hospital Billing and Professional Billing

Key Achievements
  • β€’ Building a model for predicting bad debt accounts
  • β€’ Developing KPI for measuring model performance
  • β€’ Building jobs for distributed data processing in Hive Using Spark
Pythonpandasscikit-learnApache LivySparkCatBoost
🧠
JavaPythonXgBoost +3

Predictive User Engagement on Tax Filing System

Improve user engagement and increase Tax Filing by using the ML model.

Key Achievements
  • β€’ Building model to handle historical data and click through data both
  • β€’ Building a predictive model for user intervention
  • β€’ Improving model response time in Production to handle peak season load
JavaPythonXgBoostMongoDBDockerAWS

Technologies & Tools I Work With

Amazon Bedrock
Python
Amazon SageMaker
PyTorch
AWS Lambda
DynamoDB
Apache Spark
Docker
Scikit-learn
Redis
CloudWatch
Step Functions

Professional Experience

My journey through the tech industry in field of machine learning, applied science and AI engineering, across leading companies in AI and enterprise software.

Machine Learning Engineer II

Amazon.com Services LLC
July 2022 - Present
San Diego, California, USA

Leading development of intelligent disbursement systems and anomaly detection for seller payments failures.

Key Projects
BSOR GenAI Agent

Entry point and orchestrator for all GenAI use cases for Selling Partner Event Aggregation- Simplified Onboarding Migration for already onboarded events, E2E testing across multiple Services, Customer Asks and Operations.

Large Language Models (LLM)MCPFine TuningRetrieval-Augmented Generation (RAG) +4 more
  • ● Building unified knowledge base and Retrieval system using RAG
  • ● Creating MCP Gateway service for existing Coral based Services(4+)
Transactional Deferral Anomaly Detection

Real-time anomaly detection system to detect incorrect deferral type (computed by SPYDER withholding engine) for Selling partner financial transactions.

AutoMLAutoGluonDataLakeApache Iceberg +1 more
  • ● Designing guardrail to check incorrectly released or deferred transactions
  • ● Utilized event features from Seller Event Aggregator and Deferral Policies
Intelligent Payment Circuit Breaker System for Real-Time Payment Anomaly Detection and Mitigation

Intelligent Request Gating and Anomaly Detection of Seller Disbursement handling High volumne of Monthly Disbursements using ML models for failure prediction, pending anomaly detection, bank grouping, and result code clustering

JavaLambdaGlueStep Function +11 more
  • ● Designed scalable system predicting anomaly detection for Selling Partner Disbursements
  • ● Training machine learning models for predicting anomalous scores based on past patterns
OptimalCharge: Spatio-Temporal ML for Predictive Selling Partner Collections

ML-powered charge timing optimization system for Amazon Seller Debt Manager (SDM) to improve debt recovery rates and charge success rates through intelligent retry scheduling

PythonSageMakerLambdaStatistical Voting Classifier +5 more
  • ● Designed statistical voting classifier ensemble model to predict optimal charge retry times for failed transactions
  • ● Improved overall charge success rate from 40% baseline using temporal and regional features (Region, Card Type, Hour, Day)

Software Engineer II

Amazon Development Centre (India)
September 2020 - July 2022
Hyderabad, Telangana, India

Designed scalable systems for journal processing and AWS native service migration with focus on distributed data processing.

Key Projects
SPURSH

Aggregates transaction details into journal entries that are posted to the General Ledger

JavaFargateBatchOFA +3 more
  • ● Designed scalable system for JournalPostmanService and JournalPostmanPreprocessor
  • ● Removed Journal status publishing and ticketing dependency from OFA

Product Engineering Architect

Loyalty Juggernaut
June 2019 - September 2020
Hyderabad, Telangana, India

Architected cloud platform for customer engagement and loyalty management with predictive analytics, building ML models for customer retention and product recommendation.

Key Projects
Gravty (SaaS for Customer Engagement and Loyalty Management)

Designing offers and campaigns for customer attraction with cashback and points management. Customer targeting using on-fly offer creation and dynamic rules. Building Predictive and analytics model for customer, location retain and product, offer recommendation.

JavaSpring BootDynamoDBLambda +6 more
  • ● Designed scalable transaction layer for realtime and batch processing
  • ● Designed Drools based execution engine to execute custom rules created from blockly

Senior Software Development Engineer

CognitiveScale
November 2017 - June 2019
Hyderabad, Telangana, India

Building scalable and responsible AI Applications using ML models and evaluating them on various KPIs, then deploying as microservices.

Key Projects
Debt Risk Hotspots ML Model

Discover patterns of features of accounts that are conducive to bad debt.

Pythonscikit-learnApache LivySpark +2 more
  • ● Interpreting model to understand similar bad debt accounts
  • ● Defining similarities of accounts quantitatively by using feature importance
Patient Scheduling Constraint Optimizer

Recommend ideal patient schedules based on patient preferences and optimal use of resources.

PythonPandasOR-ToolsDocker
  • ● Building a Patient Scheduling model to efficiently schedule medical appointments
  • ● Using patient preferences, medical rules, time slots and facility availability as constraints
Bad Debt Risk Advisor ML Model

Building predictive model to determine bad debt for Hospital Billing and Professional Billing

Pythonpandasscikit-learnApache Livy +2 more
  • ● Building a model for predicting bad debt accounts
  • ● Developing KPI for measuring model performance
Predictive User Engagement on Tax Filing System

Improve user engagement and increase Tax Filing by using the ML model.

JavaPythonXgBoostMongoDB +2 more
  • ● Building model to handle historical data and click through data both
  • ● Building a predictive model for user intervention

Senior Software Development Engineer

Cadence Design Systems
February 2017 - October 2017
Noida, Uttar Pradesh, India

Developed EDM (Enterprise Data Management) solutions for collaborative library and design data management.

Key Projects
Allegro EDM Solutions

Collaborative Library and design data management system

J2SESwingTinker PopSQLg +2 more
  • ● Involved in development of EDM (Enterprise Data Management)
  • ● Designed DAO layer to support SQL, NoSQL and Graph Databases

Senior Software Development Engineer

Intel Security (McAfee)
July 2015 - February 2017
Gurgaon, Haryana, India

Worked on Application Control and Change Control with Global Threat Intelligence systems.

Key Projects
Application Control and Change Control

Global Threat Intelligence system for application security

J2EEMFS (Spring based Framework)JavaScriptMS SQL
  • ● Drove several features independently and contributed to end-to-end delivery
  • ● Led feature to support SHA-256 for Rule Groups and Policy Discovery

Research Projects

Academic research in Machine Learning, Computer Vision, and NLP during my M.Tech at IIT Delhi and independent ML research projects with open-source contributions.

M.Tech Thesis
πŸŽ“

Assessment of Autism Spectrum Disorder in Toddlers using Speech Features

Supervisor: Dr. Santanu Chaudhury, Dept. of Electrical Engg, IIT Delhi

Designing an Android App to collect voice sample and store in the cloud

Key Contributions
  • β€’ Analysis of Speech Samples using Spectrogram and Scalogram
  • β€’ Feature Extraction using Discrete Wavelet Transform and Discrete Wavelet Packet Analysis
  • β€’ Classification of speech samples using SVM, Random Forest, HMM, CNN classifiers
  • β€’ Application of Deep Learning Convolutional Neural Network for Feature Learning and Classification
Pythonnumpyscipyscikit-learncaffepylearn
Fake Profile Detection using Machine Learning
Research Project

Fake Profile Detection using Machine Learning

Detect fake profiles in online social networks using multiple machine learning techniques

Key Contributions
  • β€’ Implemented Support Vector Machine, Neural Network, and Random Forest algorithms
  • β€’ Developed comprehensive fake profile detection system for social media platforms
  • β€’ Created Jupyter notebooks for interactive analysis and model comparison
  • β€’ Achieved high accuracy in distinguishing authentic vs fake social media profiles
Pythonscikit-learnpandasnumpymatplotlibpybrain
Twitter Sentiment Analysis using Machine Learning
Research Project

Twitter Sentiment Analysis using Machine Learning

Sentiment analysis of tweets using machine learning and natural language processing techniques

Key Contributions
  • β€’ Implemented Naive Bayes and SVM models for sentiment classification
  • β€’ Developed comprehensive text preprocessing pipeline with stop words removal
  • β€’ Created lexicon-based sentiment analysis using positive/negative word dictionaries
  • β€’ Built scalable sentiment prediction system for real-time Twitter data analysis
Pythonnltkscikit-learnpandasnumpy

Research Specializations

🧠
Machine Learning
Deep Learning, CNN, SVM, Random Forest, HMM classifiers for autism detection
🎡
Signal Processing
Speech analysis using Spectrogram, Scalogram, and Wavelet Transform
πŸ’¬
NLP & Social Media
Sentiment analysis, profile detection, and social media analytics
10+
Years Experience
4
Research Projects
11
ML models in Production
IIT
Delhi Alumni
Open Source Contributions
67+ GitHub Stars
87+ Forks
Open Source Projects

Academic Projects

Course projects and academic assignments covering web development, mobile applications, security systems, and data processing during my academic journey.

πŸ“Έ
Academic Project

Image Encryption & Decryption and Transformation

Secure & Compressed image transfer system

Key Features
  • β€’ Implemented secure image transfer with compression
JavaCryptography
πŸ‘©β€βš•οΈ
Academic Project

Aayush

A Web platform where doctors and patients can interact, so that patients could get help online and they could find best doctors around.

Key Features
  • β€’ Built complete doctor-patient interaction platform
J2EEStrutsWeb Development
🌀️
Academic Project

Weather Forecasting Application

Using Yahoo Weather API and support for Offline Queries

Key Features
  • β€’ Implemented weather forecasting with offline support
API IntegrationMobile Development
πŸ“Έ
Academic Project

Illuminance Correction

Android App for Illuminance Correction on Image using OpenCV for Android

Key Features
  • β€’ Developed mobile app for image correction
AndroidOpenCVImage Processing

Academic Skills Developed

🌐
Web Development

J2EE, Struts Framework, Full-stack development

πŸ“±
Mobile Development

Android apps, OpenCV integration, Image processing

πŸ”
Security & Encryption

Cryptography, Secure data transfer, Compression

πŸ› οΈ
API Integration

External APIs, Data processing, Offline capabilities

Want to see more projects?

Recommendations & Shout Outs

What colleagues, managers, and collaborators say about working with me across Amazon, AI startups, and enterprise software companies.

VM
πŸ“¦
Vishnu Mohan

Software Development Manager at Amazon

Amazon Former Manager
"

I strongly recommend Harshit Gupta, who was a key contributor on my team at Amazon. Harshit consistently demonstrated deep technical strength, strong ownership, and the ability to deliver measurable business impact. He led OptimalCharge and also architected the Intelligent Payment Circuit Breakerβ€”both of which became foundational systems for our payments platform. Harshit brings expertise in applied ML, production-scale systems, and rigorous experimentation practices. What truly stands out is his ability to translate complex business problems into elegant, scalable solutions while maintaining a clear focus on outcomes. Any team would be fortunate to have him

"
Key Skills Highlighted:
Applied MLSystem ArchitectureExperimentationLeadership
GR
πŸ“¦
Ganesh R

Software Development Manager and Bar Raiser at Amazon

Amazon Former Manager
"

Harshit has a perfect blend of engineering and machine learning expertise. I’ve seen him tackle complex problems with creativity and deliver highly effective solutions. He identified the optimal timing to charge sellers, significantly improving success rates and ensuring the sustainability of Amazon’s funds flow. He also introduced an innovative LLM-driven migration approach projected to save nearly 2,000 SDE weeks. Harshit is an exceptional problem-solver and a true asset to any team β€” I would highly recommend him.

"
Key Skills Highlighted:
Machine LearningProblem SolvingInnovationLeadership
SG
πŸ“¦
Siddharth Gupta

Engineering Leader | Amazon bar raiser

Amazon Former Colleague
"

I had the privilege of working with Harshit at Amazon, and our professional relationship dates back to our college days at KNIT Sultanpur. Harshit is an exceptional software engineer who combines deep technical expertise in Machine Learning and distributed systems with outstanding problem-solving abilities. At Amazon, he led ML-powered anomaly detection systems for seller disbursements, handling massive transaction volumes with remarkable reliability and innovative approaches. He's an excellent team player and mentor who elevates everyone around him, and his ability to handle complex, high-volume systems under pressure makes him invaluable. I wholeheartedly recommend Harshit for any Senior Software Development Engineer role.

"
Key Skills Highlighted:
Machine LearningDistributed SystemsProblem SolvingTeam Leadership
KT
🧠
Kranthi Tej

Program Manager

CognitiveScale Former Manager
"

Harshit is a very talented individual who comes up with innovative methodologies to solve ML problems. He is very good at implementing these models in live applications. He did an amazing job in enhancing risk prediction solution with significant amount of operational constraints. Very capable individual to be on a team dealing with challenging problems.

"
Key Skills Highlighted:
Machine LearningRisk PredictionInnovation
PK
🧠
Prajna Kandarpa

Engineering Manager

CognitiveScale Former Manager
"

Harshit is a delightful engineer and one of the most amicable people I've worked with. His experience in signal processing using deep learning techniques came in quite handy for our work at CognitiveScale. I must add that he singlehandedly built an entire data science pipeline capable of handling multiple hundred requests/sec traffic throughput. A great problem solver and very reliable. Any software engineering team would love having a member like Harshit on their roster.

"
Key Skills Highlighted:
Deep LearningSignal ProcessingData Pipeline
PJ
πŸ›‘οΈ
Pankaj Joshi

System Test & DevSecOps Strategist

Intel Security Senior Colleague
"

Harshit joined my team 1.5 years back, Harshit came with very strong fundamental and conceptual knowledge, and soon acquired good understanding of the product & process. I found him always motivated to grab the complex work and he has shown his innovative ways to simplify things. He is a problem solver and an asset to my team.

"
Key Skills Highlighted:
Problem SolvingDevSecOpsInnovation
AC
πŸ›‘οΈ
Amit Chopra

Senior Program Manager

Intel Security Team Colleague
"

Harshit is very committed, exhibits true enthusiasm at work and is a very good team player. He is technically very sound. He was a tremendous asset to our group and was always capable of handling multiple assignments. He is quick to understand things and has good debugging skills. He also shows sense of urgency and is able to complete his work on time. Harshit at many times has stretched to meet tight deadlines.

"
Key Skills Highlighted:
Team LeadershipDebuggingProject Management

πŸŽ‰ Internal Shout-Outs

Recognition from Amazon colleagues for exceptional work

πŸ“¦
Udit Khimesra
Senior Software Engineer β€’ Amazon
πŸ† Internal Recognition
October 27, 2021
"

Thanks Harshit for doing seamless delivery of JPPS Optimization Change. It was a complex change and required working closely with AE team.

"
Amazon Leadership Principles Demonstrated:
Deliver ResultsOwnershipInsist on the Highest Standards
2 likes
🎯 Complex Delivery Achievement

Professional Resume

Comprehensive overview of my qualifications, certifications, and achievements in Machine Learning, Applied Science and AI Engineering.

Technical Skills

πŸ€– Machine Learning & AI
Machine LearningDeep LearningComputer VisionNatural Language ProcessingAnomaly DetectionFeature EngineeringModel InterpretabilityFine Tuning
πŸ“š ML Frameworks
Scikit-learnPyTorchKerasHuggingFaceXgBoostCatBoost
πŸ’» Programming Languages
PythonJava 8JavaScriptR
☁️ AWS Cloud
LambdaFargateGlueStep FunctionsCloudWatchCDKBatch
πŸ—„οΈ Databases
PostgreSQLMongoDBRedisDynamoDB +3 more
πŸ“Š Big Data & Analytics
Apache HiveApache SparkApache HudiPySparkHadoop Ecosystem
πŸš€ Specializations
Distributed SystemsMicroservicesData Processing PipelinesReal-time AnalyticsPredictive Modeling

Education

πŸŽ“
2013-2015
M.Tech in Computer Technology

Indian Institute of Technology, Delhi

GPA: 8.43/10

Thesis: Assessment of Autism Spectrum Disorder in Toddlers using Speech Features
🎯
2009-2013
B.Tech in Computer Science & Engineering

Kamla Nehru Institute of Technology, Sultanpur

Grade: 81.16%

Certifications

☁️
AWS Certified AI Practitioner
Amazon Web Services (AWS)
Machine Learning
2025 View Credential β†—
🧠
Generative AI with Large Language Models
DeepLearning.AI
LLMs PEFT Fine Tuning LoRA PPO Quantization
2025 View Credential β†—
πŸ€—
AI Agents Fundamentals
Hugging Face 2025 View Credential β†—
πŸ“œ
Machine Learning
Stanford University, Coursera 2015
πŸ“œ
Algorithms: Design and Analysis, Part 1
Stanford University, Coursera 2015
πŸ“œ
Algorithms: Design and Analysis, Part 2
Stanford University, Coursera 2015
πŸ“œ
Image and Video Processing
Duke University, Coursera 2015
πŸ“œ
J2EE Struts with Hibernate
Professional Certification 2016
πŸ“œ
Getting and Cleaning Data
Data Science Certification 2016
πŸ“œ
R Programming
Statistical Computing Certification 2016

Awards & Achievements

πŸ†

BugBash for MAC 8.0 Award - Intel Security

πŸ†

3rd prize winner in Mind-Hunters event, National Level Techfest, Effluence

πŸ†

2nd prize winner in Fill Up The Code event, Tech Carnival By Computer Society Of India

πŸ†

Consolation prize in Paper Presentation on "WEB 3.0", organized by I.E.I.

πŸ†

Consolation prize in Technical Wordsworth event organized by Computer Society of India

πŸ†

State Level project on Water Resource Conservation in National Children's Science Congress

Professional Qualities

🧠

Self-motivated team player with strong analytical, problem solving, planning and resource optimization skills

πŸ’‘

Possess creativity & innovation, flexibility & adaptability and interpersonal skills with leadership qualities

πŸš€

Passionate about learning new technologies and tools and mastering those in a very short time span

Download Full Resume

Get the complete PDF version of my resume with detailed project descriptions and technical specifications.

Download PDF Resume