profile-pic

Ankur Dhuriya

A results-oriented Applied AI/ML Engineer with over five years of experience in crafting sophisticated speech recognition, Natural Language Processing (NLP), and Generative AI solutions. I possess a proven ability to deliver impactful, high-quality outcomes that align with business objectives. My technical proficiency encompasses Python, PyTorch, TensorFlow, and Hugging Face, complemented by expertise in cloud environments such as AWS and Azure. I am deeply passionate about driving innovation within the AI landscape and thrive in collaborative environments with dynamic teams.A results-oriented Applied AI/ML Engineer with over five years of experience in crafting sophisticated speech recognition, Natural Language Processing (NLP), and Generative AI solutions. I possess a proven ability to deliver impactful, high-quality outcomes that align with business objectives. My technical proficiency encompasses Python, PyTorch, TensorFlow, and Hugging Face, complemented by expertise in cloud environments such as AWS and Azure. I am deeply passionate about driving innovation within the AI landscape and thrive in collaborative environments with dynamic teams.


  • Role

    Senior Data Scientist

  • Years of Experience

    5 years

Skillsets

  • Generative AI
  • Transformer Models
  • Text-to-speech
  • LangChain
  • Hugging Face
  • ETL pipelines
  • data transformation
  • C++
  • Automatic Speech Recognition
  • Generative AI
  • MLOps
  • TensorFlow
  • PyTorch
  • NLP
  • Python
  • AWS - 3 Years
  • MLOps
  • TensorFlow
  • PyTorch
  • NLP
  • Python
  • TensorFlow - 3 Years
  • PyTorch - 3 Years
  • Azure - 3 Years
  • SQL
  • Shell
  • Python - 5 Years
  • MLOps
  • Deep Learning

Professional Summary

5Years
  • Dec, 2024 - Present 10 months

    Senior Data Scientist

    GEP Worldwide
  • Dec, 2022 - Dec, 20242 yr

    Data Scientist

    Builder.ai
  • Sep, 2020 - Dec, 20222 yr 3 months

    Data Scientist

    ThoughtWorks
  • Feb, 2020 - Aug, 2020 6 months

    Data Science Intern

    ThoughtWorks

Applications & Tools Known

  • icon-tool

    NumPy

  • icon-tool

    Pandas

  • icon-tool

    Scikit-Learn

  • icon-tool

    SciPy

  • icon-tool

    Matplotlib

  • icon-tool

    PyTorch

  • icon-tool

    AWS

  • icon-tool

    Azure OpenAI

  • icon-tool

    LangChain

  • icon-tool

    CrewAI

  • icon-tool

    LlamaIndex

  • icon-tool

    Docker

  • icon-tool

    Git

  • icon-tool

    NLP

Work History

5Years

Senior Data Scientist

GEP Worldwide
Dec, 2024 - Present 10 months
    Led the implementation of an LLM agentic evaluation framework, designed guardrails and applied security measures, and optimized LLM performance using MLOps best practices.

Data Scientist

Builder.ai
Dec, 2022 - Dec, 20242 yr
    Developed LLM solutions using RAG, created multilingual AI applications, and integrated ASR frameworks with various databases for enhanced system functionalities.

Data Scientist

ThoughtWorks
Sep, 2020 - Dec, 20222 yr 3 months
    Engineered ASR solutions for 18 Indian languages, Text-to-Speech systems for 8 languages, and optimized speech-to-text transcription pipelines.

Data Science Intern

ThoughtWorks
Feb, 2020 - Aug, 2020 6 months
    Implemented a gender classification model, created clustering models for speech data, and enhanced data categorization workflows.

Achievements

  • Delivered Automatic Speech Recognition on 18 Indian Languages with error rates better than major cloud speech-to-text services.
  • Built a Toolkit for speech data processing and building speech recognition models.
  • Delivered highly efficient Text-to-Speech on 8 Indian Languages using public data TTS data.
  • Built an LLM application for accurate question answering on call transcriptions.
  • Developed services and applications involving autonomous multi-agent AI systems.

Major Projects

2Projects

CLSRIL-23: Cross Lingual Speech Representations for Indic Languages

    Research publication focusing on cross-lingual speech representations for Indian languages.

Vakyansh: ASR Toolkit for Low Resource Indic Languages

    Developed an ASR toolkit designed for low-resource Indian languages.

Education

  • MS in Machine Learning & AI

    Liverpool John Moores University
  • B. Tech in Computer Science & Engineering

    DIT University, Dehradun

Certifications

  • Data science professional certificate | ibm

  • Data science for engineers | iit madras