Sahil Wadhwa

About

Experienced Data Scientist with 5+ years at The Home Depot, previously at BlackRock and Amazon. Proficient in data science tech, specializing in ML, data integration, and pipelines. Skilled in cloud platforms and architecture. Committed to enhancing insights, user experiences, and efficiency through data-driven solutions.




Data Scientist | Transforming Data into Insights | Unlocking the Power of Analytics


  • Experience: 6+ years
  • Phone: (217) 208 6838
  • Place: San Jose
  • Grad: MS in Statistics, UIUC
  • Email: sahil24wadhwa@gmail.com


Dedicated Data Scientist passionate about crafting cutting-edge data solutions, harnessing technical skills to drive efficiency, and elevate user experiences. Committed to empowering data-driven decision-making and collaborating with diverse teams to deliver impactful results.


“It is easy to lie with statistics. It is hard to tell the truth without it.”
― Andrejs Dunkels

Skills

With a strong foundation in statistics, algorithms, and machine learning, I have honed my skills through rigorous work as an experienced Data Scientist and Machine Learning Engineer. I am dedicated to producing top-tier data-driven solutions that not only meet but surpass the specific requirements of each project. My relentless pursuit of knowledge and adaptability in dynamic technical environments guarantee the development of impactful applications that optimize efficiency, enhance user satisfaction, and contribute to business success.

Programming Languages

Python, C++, Java, R, Scala, JavaScript

Cloud Technologies

AWS: SageMaker, ECS, EC2, S3.
NVIDIA: TensorRT.
GCP: Natural Language AI, Speech-to-Text, Text-to-Speech.


Databases

MongoDB, DynamoDB, MYSQL, RDS, PostgreSQL, and MS SQL



Operating Systems

macOS, Windows and Linux.

Technologies and Frameworks

Tensorflow,Spark,PyTorch,Flask.

Tools

Git, Jira, GitHub, R studio, Visual Studio Code, IntelliJ, and PyCharm.

Experience

Embarking on a Data-Driven Exploration: Unveiling Proficiencies in Data Science and Innovation.

Summary

Sahil Wadhwa

Experienced Data Scientist with a rich background in data analysis and machine learning. Proficient in a wide range of data science technologies and frameworks, with hands-on experience in designing and implementing advanced analytics solutions. Adept at collaborating with cross-functional teams to deliver data-driven insights and innovative solutions. Committed to leveraging technical expertise gained from roles at Amazon, BlackRock, and other industry leaders to drive impactful data science projects and contribute to organizational growth.

  • Los Angeles,CA
  • (217) 208 6838
  • sahil24wadhwa@gmail.com/li>

Education

MS in Statistics

Aug 2019 - Dec 2020

University of Illinois, Urbana-Champaign

GPA: 4.0

Statistical Modeling II, Statistical Learning, Time Series Analysis, Statistical Data Management

Bachelor of Technology, Computer Science

2012 - 2016

Jamia Millia Islamia, New Delhi

GPA: 9.6

Artificial Intelligence, Data Structures, Object Oriented Programming, Database Management Systems, Operating Systems, Advance Computer Networks.

Professional Experience

Data Science Manager AI Foundations

July ’24 - Present

Capital One

  • AI Safety – Led the development and fine-tuning of Large Language Models (LLMs) using Chain-of-Thought (CoT) reasoning techniques to establish robust guardrails for both input and output. This initiative safeguarded company-wide chatbot systems from producing or responding to inappropriate, harmful, or malicious content, enhancing the ethical standards and user safety in AI-driven interactions.
  • Red-Teaming – Conducted comprehensive red-teaming exercises on LLMs, employing advanced techniques such as Tree of Attacks with Pruning, Crescendo, and Encoding attacks to test and expose vulnerabilities systematically. These rigorous evaluations helped identify potential weaknesses in the models' responses, contributing to improved security measures and the overall resilience of AI systems.

Data Scientist Enterprise Intelligence

Jan ’23 - July '24

Home Depot

  • Creating conversational AI solutions for Interactive Voice Response (IVR) and Messaging systems within Home Depot. This involves intent classification, slot fitting, and smart reply predictions.
  • Submitted a paper to EMNLP 2024 titled ”Enhancing Task-Oriented Dialogue Systems: Semantic Aware Nearest Neighbors for Real-time Intent Classification” which aims to detect customers intent in a task-oriented dialogue system.

Applied Scientist Alexa AI

Aug ’21 - Jan ’23

Amazon

  • Developed multi-modal SoTA cascaded Neural Turing Machines (NTM) based network for multi-turn fashion image retrieval with an overall improvement of 50% over previous benchmarks.
    • Using multi-turn transactions of an image and feedback text, the model retrieves the best matched image based on the interactions between the user and machine.
    This was deployed as a chatbot feature in Alexa Fashion domain and is in beta testing. Paper accepted in ICCV 2023.
  • Worked on gesture recognition for Augmented Reality (AR) on hands using object detector, landmarks extractor, and Kalman filter tracking using mobilenetV2 and efficientnet resulting in gains of over 5%.

Data Scientist

Feb ’21 - Aug ’21

Ciitizen Corporation

  • Built framework for Named Entity Recognition (NER) of seminal entities such as Diagnostic and Therapeutic Procedures using LSTM from clinical pdfs.
  • Developed Tensorflow and Kubeflow based CNN model to segregate clinical texts on the basis of importance, thereby reducing the manual read time by 2X hours.

Machine Learning Engineer Financial Modeling Group

Feb ’18 - Aug ’19

Blackrock

  • Built end-to-end word-embedding model similar to word2vec . Enabled large scale training using Ignite for async updates across multiple nodes. Deployed the code using Docker image and Kubernetes
  • Developed a novel word-level Entity Linking/Disambiguation model by using BERT and bi-LSTMs surpassing SoTA on AIDA test dataset by 2%. Model deployed using tensorlfow-serving. Accepted in AACL.

Data Scientist

June ’16 - Sept ’17

Scry Analytics

  • Relation Extraction: Developed a Convolution Neural Network(CNN) based deep learning model to identify relationships between entities in a text. Achieved F1 score of 85% surpassing previous feature-based models by 15%.
  • Implemented Big Data pipelines in Spark for fast retrieval and processing of data residing in HBase. Reduced the pipeline execution time from 3 days to 6 hours by switching the entire pipeline from Map-Reduce to Spark.

NLP Research Intern

Aug ’19 - Dec ’20

Inference Analytics

  • Created an end to end model to perform aspect-based sentiment analysis using pre-trained encoders and topic modeling techniques (LDA, Rake).

Machine Learning Engineer Intern HAMR Write Design Team

May ’20 - Aug ’20

Inference Analytics

  • Designed a web-based Machine Learning pipeline for data segmentation, wrangling and ranking.

Contact

Location:

Los Angeles,CA

Phone:

(217) 208 6838