Shivam Choudhary

Senior Data Scientist - Natural Language Processing

About Me

Shivam Choudhary is an experienced Data Scientist specialized in NLP. A proponent of continuous learning and self improvement. Adept at handling ambiguous or undefined challenges through strong problem solving abilities. Interested to participate in cutting edge research in machine learning applications, software development projects and develop real world solutions for large scale problems.

Bio

Email
shivamchoudhary2014@gmail.com
Cell
+91 9062330628
Alternate
+91 8240805244
Location
Hyderabad, INDIA

Professional Skills

Scripting Language
Frameworks
GenAI Framework
Databases
Cloud
Operating System
Tools
IDE
Python, Linux
Tensorflow, Pytorch, FastAPI, Flask, PyUnit, GraphQL
LangChain, LlamaIndex, LangGraph, CrewAI, AutoGen
MySQL, PostgreSQL, MongoDB, Aurora DB, Document DB, VectorDB
Azure OpenAI, Cognitive API, Lambda Function, Sagemaker
Window, Linux
Git, Bitbucket, Google Colab, Jupyter Notebook, Postman, Robo3T
Spyder, PyCharm, VSCode

Certifications

 
Azure Data Scientist (DP-100)
Microsoft Certified AI Engineer
Microsoft Certified IOT Developer
AZ-900: Azure Fundamental
AI-900: AI Fundammental
Issuer
Microsoft
Microsoft
Microsoft
Microsoft
Microsoft
 
DP-900: Data Fundamental
MTA: Programming using Python
AWS Cloud Practitioner
Oracle cloud infrastructure 2020 Associate
Databricks Certified GenAI Engineer
Issuer
Microsoft
Microsoft
Amazon
Oracle
Databricks

Work Experience

Senior Data Scientist at EY
Feb, 2025 - Till-Date
  • Hands on experience in building LLM based application using LangChain, LlamaIndex, LangGraph, CrewAI, AutoGen, Multi-Agent System
    • Leveraging agent builder aws bedrock environment to build multi agentic Framework
    • Making banking & Financial service lifecycle easy to understand
    • Domain: Banking & Financial Services
    • Technology used: Python, Sagemaker, Bedrock
Data Scientist at Fractal Analytics
Jul, 2023 - Feb 2025
  • Developed a platform to analyze a high volume of chat and call transcripts, extracting insights on customer satisfaction rates, repeat call rates, and SLA adherence for the Customer Satisfaction Index. Automated approximately 85% of the process, enabling efficient generation of these insights.
    • These key performance indicators (KPIs) provided hardware/device stakeholders with critical information to inform strategic decisions and enhance customer experiences.
    • Leveraged Unsupervised techniques like: Clustering, HDBSCAN, UMAP, BERTopic
    • Used LLM and Statistical technique to support business driven decisions
    • Domain: Entertainment & Media
    • Technology used: Python, Clustering, BERTopic, ULM, Statistical technique, LLM(Gemini 1.5 Falsh)

  • Designed a cloud native analytics platform which will help business to idnetify the potential user for the product, help data scientist to bring their own data and perform complete activities from Feature engineering to Model Deployment, help Buisness Anlayst to understand the reason which led to particular inference using SHAP.
    • Domain: Insurnace
    • Technology used: Python, Sagemaker, Kubernetes, Glue, Lambda, Step Function
Applied & Data Scientist at Carelon Global Solutions
Sept, 2022 - Jul, 2023
  • Designing a product which will help enrolled members to get appropriate care plan recommendation as per short survey under the Care Management
    • Implemented FastAPI to productionize the ML models as a Microservices
    • Designed the complete pre-processing and post processing business logic in the form of ENSO data pipeline
    • Domain: Health Care
    • Technology used: FastAPI, mongoDB, RedShift, Quay, ENSO
Data Scientist at Deloitte USI
Jan, 2021 - Sept, 2022
  • Project is all about migration of Big Data Stack to AWS, Snowfalke.
    • Upskilled to Spark and Snowflake for project Delivery
    • Involved in complete Life Cycle, identified gaps & risk for smooth delivery
    • Identified and designed several layer of snowflake to achieve transformation logic for business
    • Domain: Health Care
    • Technology used: AWS, Snowflake, Scala, Spark, Microservices

  • Project is all about application modernization of one of the world largest health provider client, reduced latency to double digit metrics
    • Responsible for development and unit test of a web services (Data-access layer), python-based services (Microservices), deployed on Kubernetes cluster
    • Work with cross functional Teams to ensure proper working of endpoint developed (RESTful API)
    • Domain: Health Care
    • Technology used: AWS, mongoDB, Flask, python, RESTful, GraphQL, Pytest, Docker, Kubernetes
Data Science Associate at Infosys Ltd
Jan, 2020 - Dec, 2020
  • Project is all about Digitization of legacy based paper application:
    • Involved in complete software development life cycle (Agile Methodologies), POC, requirement gathering, development and unit test
    • Responsible for complete Data Science Solution to be implemented and migrated from legacy application
    • Implemented a data Pipeline which will take care of Data Extraction, Data Cleansing, Data Wrangling, Machine Learning Model Training
    • Automated few processes which reduce SME(Subject Matter Expert) time by creating Entity Recognition(TF-IDF, Rake-NLTK based model), Recommendation Engine and Window Batch Scripting
    • Domain: Bio-Pharmaceutical
    • Technology used: Azure, Python, PostgreSQL, Machine Learning, NLP
Trainee at Infosys Ltd
Sept, 2019 - Dec, 2019
  • Experienced world class training backed with project work:
    • Classroom Training on: Python, SQL, Automation, Scripting
    • Micro Learning Program on: Machine Learning, Deep Learning, Data Science, Azure ,Docker, Microservices and many more

Education

Master of Computer Application from University of Mysore
2022 - 2024
During this period of time deep dived into Data Warehousing, Artificial Intelligence, Data Governance and Application Security which enabled me to design the Enterprise AI enabled solution.
Bachelor of Computer Application from West Bengal University of Technology
2016 - 2019
During this period of time started with Computer Science foundation i.e, C, C++, Computer Architecture and Design, Operating System, Software Engineering, OOPS concept ,Engineering Mathematics, Computing Mathematics. Implemented one major project using .Net and two minor project using PHP and MySQl.