Hardik Nahata

馃憠 Staff ML Engineer @ PayPal

馃搷Silicon Valley, CA

I am a Staff Machine Learning Engineer based in Silicon Valley, specializing in generative AI, applied AI, and large-scale personalization systems. With a Master's in Computer Science (Artificial Intelligence) from Northeastern University, Boston, I have built deep expertise in AI-driven innovation.

At PayPal, I design and develop sophisticated shopping personalization systems and scalable machine learning infrastructure that power AI applications for millions of users. My work involves solving complex engineering challenges, optimizing AI models for real-world impact, and integrating machine learning into large-scale production systems. Having worked in both fast-paced startups and Big Tech, I understand the challenges of building and scaling AI in diverse environments.

Beyond engineering, I am passionate about mentoring and advising professionals and aspiring ML engineers, helping them break into AI and advance their careers. If you're looking to explore new opportunities, book a 1:1 call today using the link in the navigation bar.

Hardik Nahata

Skills

Generative AI / Agentic Frameworks
Large Language Models (LLMs)
Machine Learning & Deep Learning
Natural Language Processing (NLP)
Python / SQL
HTML / CSS / JS / Node

Areas of Expertise

6240

Hours Worked

312000

Lines of Code

512

Coffee Meetups

8

Awards Won

Projects



portfolio image
portfolio image

NYC 311 Complaint Type Prediction

311 is a service that New York City residents can use to make non-emergency reports. The NYC 311 Dataset was made public by NYC OpenData and has about 40 columns and around 21 Million Rows.

路 I have experimented with Traditional ML Models and Transformers to Predict the Type of the Complaint with the complaint text leveraging Natural Language Processing.

路 Please use the link below to access the Jupyter Notebooks and Code.


Technologies:


Completion:

Link:

Data Science
Python

December 2021

Github Repo

portfolio image portfolio image

Analysing User Behaviour and Optimizing the Workflow using Machine Learning

路 Created an e-Commerce website selling trendy clothing , got users to browse through it, collected their browsing data.
路 Further, applied Data Preprocessing to apply various Machine Learning Algorithms like KNN etc.
路 Used ClickStream and plotted Seaborn Heatmaps to understand the user browsing behaviour and reported observations to design team which could improve the browsing experience of the users.


Technologies:



Completion:

link:

Web Dev
Data Analytics
Machine Learning

April 2019

Coming Soon!

portfolio image portfolio image

Movie Recommender System

路 Based on a movie dataset with large number of entries, this system predicts the best recommendations for a user through his/her past ratings.

路 Used Item-Based Collaborative filtering and found the correlation values between every two movie ratings to predict the recommendations.

路 Used Pandas library for operating on the .csv files.


Technologies:


Completion:

Link:

Data Science
Python

August 2018

Coming Soon!

portfolio image portfolio image

Drowsiness Detection during Commute

路 Created a prototype for a drowsiness detection system using a Webcam.

路 An alarm is sounded when the driver falls asleep.

路 Used the concept of Eye Aspect Ratio introduced by Soukupov谩 and 膶ech in their 2016 paper, "Real-Time Eye Blink Detection Using Facial Landmarks".


Technologies:


Completion:

Link:

Computer Vision
Keras

February 2019

Github Repo

portfolio image portfolio image

License Plate Number Detection

路 Performed various image processing algorithms like Dilation, Erosion, Edge Processing to preprocess the image

路 Smoothened the Vertical Histogram through Low Pass Filtering

路 Found probable segments for License Plate and performed Region of Interest Extraction.

路 The code was developed in MATLAB.


Technologies:


Completion:

Link:

Image Processing
MATLAB

October 2017

Github Repo

portfolio image portfolio image

Handwritten Digit Recognition using CNN

路 The MNIST dataset contains a total of 70,000 handwritten digit images to train and test.

路 Trained the data through a set of layers including Convolutional layer, Rectified linear unit, MaxPool layer and Fully Connected layer.

路 Achieved an accuracy of 97% on the Test set.


Technologies:


Completion:

Link:

Deep Learning
Neural Networks

August 2019

Coming Soon!

portfolio image portfolio image

Smart Parking System

路 The Ultrasonic Sensor[HC-SR04] is placed on the parking spot which determines the presence a car.

路 The signal is indicated through a High power LED placed at a certain height or on to the ceiling.

路 The setup consists of an Arduino board, sensor and LED. Hence its highly cost effective.


Technologies:

Completion:

Link:

Arduino

March 2019

Coming Soon!

Recommendations

Get in Touch

Email

hardiknahata@gmail.com

Social Media

GitHub

hardiknahata

Location

San Francisco, CA


Organizations