profile-pic

Ankur Dhuriya

Ankur Dhuriya

A results-oriented Applied AI/ML Engineer with over five years of experience in crafting sophisticated speech recognition, Natural Language Processing (NLP), and Generative AI solutions. I possess a proven ability to deliver impactful, high-quality outcomes that align with business objectives. My technical proficiency encompasses Python, PyTorch, TensorFlow, and Hugging Face, complemented by expertise in cloud environments such as AWS and Azure. I am deeply passionate about driving innovation within the AI landscape and thrive in collaborative environments with dynamic teams.A results-oriented Applied AI/ML Engineer with over five years of experience in crafting sophisticated speech recognition, Natural Language Processing (NLP), and Generative AI solutions. I possess a proven ability to deliver impactful, high-quality outcomes that align with business objectives. My technical proficiency encompasses Python, PyTorch, TensorFlow, and Hugging Face, complemented by expertise in cloud environments such as AWS and Azure. I am deeply passionate about driving innovation within the AI landscape and thrive in collaborative environments with dynamic teams.


  • Role

    Data Scientist

  • Years of Experience

    5 years

Skillsets

  • Python
  • Machine Learning
  • Data Analysis
  • Information Retrieval
  • Dcoker
  • Speech Recognition
  • Generative AI
  • Algorithms
  • LLMs
  • MLOps
  • FastAPI
  • TensorFlow
  • Pytorch
  • NLP
  • Statistics
  • AWS - 3 Years
  • LLM - 3 Years
  • Natural Language Processing - 5 Years
  • TensorFlow - 3 Years
  • Pytorch - 3 Years
  • Docker - 4 Years
  • Azure - 3 Years
  • Data structure and algorithms
  • SQL
  • Shell
  • R
  • Python - 5 Years
  • MLOps
  • Deep Learning
  • C

Professional Summary

5Years
  • Dec, 2022 - Present2 yr 6 months

    Data Scientist

    Builder.ai
  • Sep, 2020 - Dec, 20222 yr 3 months

    Data Scientist

    Thoughtworks
  • Feb, 2020 - Aug, 2020 6 months

    Data Science Intern

    Thoughtworks

Applications & Tools Known

  • icon-tool

    NumPy

  • icon-tool

    Pandas

  • icon-tool

    Scikit-Learn

  • icon-tool

    SciPy

  • icon-tool

    Matplotlib

  • icon-tool

    PyTorch

  • icon-tool

    AWS

  • icon-tool

    Azure OpenAI

  • icon-tool

    LangChain

  • icon-tool

    CrewAI

  • icon-tool

    LlamaIndex

  • icon-tool

    Docker

  • icon-tool

    Git

  • icon-tool

    NLP

Work History

5Years

Data Scientist

Builder.ai
Dec, 2022 - Present2 yr 6 months
    Built an LLM application for accurate question answering on call transcriptions. Developed services and applications involving autonomous multi-agent AI systems, contributed to code creation and suggesting improvements. Created an LLM application for summarizing call transcriptions, generating meeting minutes, and implementing fastapi celery services. Applied Louvain community detection for enhanced analysis and insights. Contributed to the application of Call Diarization techniques utilizing Azure Cognitive Service and AWS Speech Service.

Data Scientist

Thoughtworks
Sep, 2020 - Dec, 20222 yr 3 months
    Delivered Automatic Speech Recognition on 18 Indian Languages with error rates better than major cloud speech-to-text services. Built a Toolkit for speech data processing and building speech recognition models. Delivered highly efficient Text-to-Speech on 8 Indian Languages using public data TTS data. Delivered Inverse Text Normalization for post processing of speech to text output.

Data Science Intern

Thoughtworks
Feb, 2020 - Aug, 2020 6 months
    Delivered Machine Learning model for classifying gender based on speech data embeddings with an accuracy of 98% on balanced data. Built a clustering model for grouping similar voices in speech data.

Achievements

  • Delivered Automatic Speech Recognition on 18 Indian Languages with error rates better than major cloud speech-to-text services.
  • Built a Toolkit for speech data processing and building speech recognition models.
  • Delivered highly efficient Text-to-Speech on 8 Indian Languages using public data TTS data.
  • Built an LLM application for accurate question answering on call transcriptions.
  • Developed services and applications involving autonomous multi-agent AI systems.

Major Projects

2Projects

LLM Application for Question Answering

Dec, 2022 - Present2 yr 6 months
    Built an LLM application using the RAG approach utilizing a model from Azure OpenAI, for accurate question answering on call transcriptions.

Multilingual Speech Recognition

Sep, 2020 - Dec, 20222 yr 3 months
    Delivered Automatic Speech Recognition on 18 Indian Languages with error rates better than major cloud speech-to-text services.

Education

  • MS in ML & AI

    Liverpool John Moores University (Online)
  • B.Tech in Computer Science & Engineering

    DIT University, Dehradun (2020)

Certifications

  • Data science professional certificate | ibm

  • Data science for engineers | iit madras