profile-pic

Venkatesan N

Venkatesan N

With 7 years of experience in Data Science and a total experience of 12 years, I specialize in Natural Language Processing (NLP). My proficiency in NLP libraries such as NLTK, spaCy, OpenAI, and Hugging Face Transformers allows me to rapidly engineer and optimize models for production-grade deployment. My agile development skills, MLFlow expertise, and AWS proficiency ensure seamless model management and exceptional performance. I use powerful techniques such as Transformer Architecture redesign, Data Augmentation, and Hyperparameter optimization to guarantee optimal results. My strengths lie in developing LLM models for specific languages and domains, creating custom tokenization, and implementing Named Entity Recognition (NER) within NLP pipelines. My ultimate goal is to provide innovative NLP solutions that have a tangible impact. As the NLP field constantly evolves, I inspire creativity and offer game-changing solutions to ensure continued success.
  • Role

    NLP Researcher (PhD)

  • Years of Experience

    12 years

Skillsets

  • Natural Language Processing - 7 Years

Professional Summary

12Years
  • Feb, 2021 - Present4 yr 4 months

    NLP Researcher (PhD)

    PSG College of Technology
  • Feb, 2021 - Present4 yr 4 months

    Data Science Consultant (Freelancer)

    ProAct AI Consulting LLP
  • Aug, 2019 - Dec, 20201 yr 4 months

    Deep Learning Researcher

    KCT
  • Jun, 2011 - Jun, 20143 yr

    Technology Instructor

    KGiSL
  • Jun, 2014 - Jun, 20173 yr

    Research Associate

    SKNSITS
  • Jul, 2017 - May, 20191 yr 10 months

    Deep Learning Researcher

    MAEER's MIT

Applications & Tools Known

  • icon-tool

    MLFlow

  • icon-tool

    AWS

  • icon-tool

    Hugging Face Transformers

  • icon-tool

    Atlassian JIRA

  • icon-tool

    Azure Synapse Analytics

  • icon-tool

    NLP

Work History

12Years

NLP Researcher (PhD)

PSG College of Technology
Feb, 2021 - Present4 yr 4 months
    Engineered & deployed IR system combining TF-IDF, BM25, language modeling, boosting e-commerce catalog searchability. Created a named entity recognition (NER) model using spaCy to identify the medication names in the ICD-10 dataset. Trained a Llama model on the NER-processed ICD-10 dataset to predict disease diagnoses based on prescriptions. Demonstrated the potential of LLMs for this task by having the Llama model outperform other methods on a held-out test set through the integration of NER.

Data Science Consultant (Freelancer)

ProAct AI Consulting LLP
Feb, 2021 - Present4 yr 4 months
    Created ready-to-use Text Corpus for low-resource Dravidian languages in researching deep learning models. Worked with the AI development team to transfer research work into production. Taught graduate courses on High-Performance Computing, Software Modelling and Design, Machine Learning with 100% practical examples.

Deep Learning Researcher

KCT
Aug, 2019 - Dec, 20201 yr 4 months
    Developed ambiguity resolution techniques for natural language processing (NLP) that improve accuracy greater than 2% for uses like sentiment analysis and question-answering. Enhanced low-resource language data with augmentation, customized LSTM/RNN models, and fine-tuning, yielding 5%+ performance gains, dataset-dependent.

Deep Learning Researcher

MAEER's MIT
Jul, 2017 - May, 20191 yr 10 months
    Developed ambiguity resolution techniques for natural language processing (NLP) that improve accuracy greater than 2% for uses like sentiment analysis and question-answering. Enhanced low-resource language data with augmentation, customized LSTM/RNN models, and fine-tuning, yielding 5%+ performance gains, dataset-dependent.

Research Associate

SKNSITS
Jun, 2014 - Jun, 20173 yr
    Contributed to research on the adaptability of deep learning algorithms in self-driving cars and developing innovative techniques in lane-changing algorithms. Focused on perception, object recognition, and decision-making, pushing the boundaries of autonomous vehicle technology.

Technology Instructor

KGiSL
Jun, 2011 - Jun, 20143 yr
    Technology Instructor on Java Programming, Oracle Database Development. Provide technical ability and support for the implementation and maintenance of the eCampus Management System, collaborating with stakeholders and the development team.

Achievements

  • International Publications: 10+
  • Patent Publications: 01

Education

  • Ph.D

    Anna University
  • M.E.

    Anna University (2011)

Certifications

  • Deep learning

  • Natural language processing

  • Machine learning