profile-pic

Sarika Mishra

Versatile and Results-Driven Professional with 4 years of experience in roles spanning Software Engineering, Prompt Engineering, Data Engineering, and Data Science. Proven track record in executing projects across healthcare, manufacturing, finance, and HR domains. Proficient in Python coding, utilizing tools such as BeautifulSoup, Selenium, Pandas, NumPy, Scikit, and Requests. Experience with Django framework and AWS services including S3 Bucket, Lambda, Lex, and CloudWatch. Expertise in data visualization using Tableau, MS Excel, PostgreSQL, and Power BI.

Demonstrates a strong commitment to project success by leveraging expertise and providing support to teammates. Proficient in CI/CD with GitHub and knowledgeable in SonarQube for code quality and best practices. Actively hones problem-solving skills through coding challenges on HackerRank. Committed to continuous learning, expanding leadership capabilities, and storytelling through online courses. Completed training in Data Science, AI, and Machine Learning on platforms such as Coursera, YouTube, and Udemy.

Adept at networking and building connections, excelling as a collaborative team player. Actively pursues continuous learning in leadership, storytelling, and technology advancements.

  • Role

    Gen AI Engineer

  • Years of Experience

    3.6 years

Skillsets

  • Supervised Learning
  • Data preprocessing
  • embeddings
  • ETL pipelines
  • Flask
  • gans
  • Kubernetes
  • LLMs
  • Scikitlearn
  • data augmentation
  • synthetic data generation
  • Unsupervised Learning
  • vaes
  • AWS
  • Azure
  • LangChain
  • rag
  • REST
  • Django
  • Python
  • SQL
  • Python
  • SQL
  • Docker
  • Feature Engineering
  • GCP
  • Airflow
  • Python - 4.0 Years
  • FastAPI
  • MLFlow
  • Prompt Engineering
  • PyTorch
  • TensorFlow
  • Transformers
  • chunking

Professional Summary

3.6Years
  • Feb, 2022 - Present4 yr 1 month

    Data Scientist

    HashedIn by Deloitte

Applications & Tools Known

  • icon-tool

    My SQL

  • icon-tool

    MS-Office

  • icon-tool

    Tableau

  • icon-tool

    Jupyter Notebook

  • icon-tool

    MySQL

  • icon-tool

    PostgreSQL

Work History

3.6Years

Data Scientist

HashedIn by Deloitte
Feb, 2022 - Present4 yr 1 month
    Worked across multiple high-impact client projects, delivering AI solutions that blended ML models, GenAI systems, and robust MLOps practices, directly supporting business transformation and revenue growth. Banking: Designed fraud detection pipeline with ensemble models (Random Forests, Decision Trees), increasing fraud detection accuracy by 30%. Automated ETL workflows for transactional data with data engineers, reducing pipeline latency by 40%. Deployed Power BI dashboards to risk teams, cutting investigation cycles by 20%. Insurance: Built claims triage model (Logistic Regression + Gradient Boosting) integrated with LLM-based summarizer (AWS Comprehend Medical + GPT) for contextual insights. Exposed models through FastAPI microservices on AWS Lambda for real-time claim decisioning. Delivered solution that reduced manual claim review efforts by 35%. Retail: Developed GenAI chatbot using LangChain + vector search (Milvus), enabling personalized, multi-turn product discovery. Boosted customer conversion rates by 15-20% through contextual, memory-driven conversations. Partnered with marketing to align outputs with sales KPIs. Healthcare: Built ML pipeline using Logistic Regression & Random Forests on EHR data; improved readmission risk prediction by 12%. Designed interpretable features for physicians to understand clinical drivers of readmission. Delivered insights via interactive Power BI dashboard for hospital executives. Healthcare Research: Finetuned domain-adapted BART model on biomedical corpora for summarizing research papers. Deployed solution as a Streamlit app, reducing average literature review time by 40%. Helped medical researchers accelerate discovery and improve research throughput.

Major Projects

4Projects

Agentic GenAI Claims Intelligence System

    Designed an Agentic AI system for healthcare insurance where multiple LLM-based agents autonomously handled claim intake, document understanding, validation, and prioritization. Built tool-using agents with Python and LangChain, implemented RAG pipeline, and deployed scalable backend services using FastAPI and Docker.

Agentic Conversational Recommendation System

    Developed a multi-agent conversational AI system for retail e-commerce, handling user intent detection, product retrieval, ranking, and response generation. Used embeddings-based semantic search and deployed cloud-native services on AWS.

Fraud Detection with Agent-assisted Explainability

    Built a real-time fraud detection pipeline for banking and payments, combining ML models with an Agentic AI explanation layer. Developed LLM-powered agents for transaction analysis and explanation generation.

Clinical Text Intelligence & Autonomous Research Assistant

    Designed an Agentic AI research assistant for healthcare analytics to retrieve, summarize, and contextualize clinical notes and medical literature. Implemented NLP and LLM-based techniques for document retrieval and insight generation.

Education

  • Bachelor of Technology Electronics and Communication Engineering

    Bhagwan Parshuram Institute of Technology (2022)

Certifications

  • Python master class- udemy

  • Machine learning freecodecamp

  • Sql course codecademy

  • Audio processing research internship iiit allahabad

  • Artificial intelligence internship training 1stopai

  • Html, css, github codecademy