profile-pic
Vetted Talent

Sai Vignan Malyala

Vetted Talent

Principal Data Scientist/ Head of AI / Mentor with vast experience in building AI use-cases from scratch and deploying them to production. Amazing experience in GenAi, LLm fine-tuning , RAG, vector databases, NLP, Machine learning, Deep learning, transfer learning, working with LLM, MLOPS, deployment using aws, airflow, data bricks, pyspark, pipelining, containerization. Effective and proactive communicator with experience in leading teams and projects. Expertise in Computer Vision for OCR related information extraction from images, pdf parser, XML parser, box detection, entity

detection and recognition, Data Mining, Data tagging, Data Analysis, Feature Selection & Model Selection, Model Building, Model Validation, Model threshold validation, log analysis.

  • Role

    Head LLM Engineer - Gen AI Architect

  • Years of Experience

    10.4 years

  • Professional Portfolio

    View here

Skillsets

  • LLM Fine-tuning
  • vector search
  • text generation
  • Statistics & probability
  • semantic search
  • Regression Analysis
  • Recommendation Systems
  • Neural Networks
  • Multi-Agent Systems
  • MLOps
  • Machine Learning
  • Deep Learning - 7 Years
  • Information Extraction
  • Generative AI
  • Feature Engineering
  • Data Science
  • data augmentation
  • Data Analysis
  • Active Learning
  • Natural Language Processing - 9 Years
  • Prompt Engineering - 2 Years
  • Transfer Learning

Vetted For

18Skills
  • Roles & Skills
  • Results
  • Details
  • icon-skill_image
    Senior Generative AI EngineerAI Screening
  • 59%
    icon-arrow-down
  • Skills assessed :BERT, Collaboration, Data Engineering, Excellent Communication, GNN, GPT-2, graphs, Large Language Models, Natural Language Processing, Sagemaker, Deep Learning, neural network architectures, PyTorch, TensorFlow, machine_learning, Problem Solving Attitude, Python, Vertex AI
  • Score: 59/100

Professional Summary

10.4Years
  • Head LLM Engineer - Gen AI Architect

    Fortune 50 Pharma Company
  • Jan, 2024 - Nov, 2024 10 months

    Head of AI - NLP/GenAI

    XA
  • Sep, 2023 - Feb, 2024 5 months

    Principal AI Consultant & Advisory

    OrthoQuant
  • Oct, 2018 - Aug, 20223 yr 10 months

    Principal Data Scientist Head of Data Science

    Oorwin Labs
  • Aug, 2022 - Feb, 2023 6 months

    Senior Applied AI Engineer

    Work Fusion
  • May, 2023 - Jan, 2024 8 months

    Lead Data Science

    The Weather Channel
  • Consultant AI Lead

    MezmerMedia

Applications & Tools Known

  • icon-tool

    Python

  • icon-tool

    AWS (Amazon Web Services)

  • icon-tool

    ML

  • icon-tool

    NLP

  • icon-tool

    OCR

  • icon-tool

    Deep Learning

  • icon-tool

    Business Analytics

  • icon-tool

    DevOps

  • icon-tool

    Computer Vision

  • icon-tool

    Artificial Intelligence

  • icon-tool

    Data Visualization

  • icon-tool

    Generative AI

  • icon-tool

    Docker

  • icon-tool

    Azure

  • icon-tool

    AWS

  • icon-tool

    Tableau

  • icon-tool

    GraphQL

  • icon-tool

    Flask

  • icon-tool

    Gunicorn

  • icon-tool

    Solr

  • icon-tool

    Scrapy

  • icon-tool

    GCP

  • icon-tool

    Airflow

  • icon-tool

    MLFlow

  • icon-tool

    Haystack

  • icon-tool

    LangChain

  • icon-tool

    weaviate

  • icon-tool

    GPU

Work History

10.4Years

Head LLM Engineer - Gen AI Architect

Fortune 50 Pharma Company
    Designed and deployed a multi-agent Generative AI system to enhance clinical trial inspection processes by aggregating structured and unstructured data sources for areas like protocol deviations and adverse events. Integrated parallel generation, semantic caching, agent memory, and human-in-the-loop mechanisms to improve system performance. Conducted regular feedback sessions with client stakeholders to gather insights on usability and emerging use cases. Categorized feedback into key areas (usability, performance, training needs) for structured analysis. Implemented a continuous improvement loop, refining AI models and processes based on categorized feedback, issue prioritization, and regular progress updates. Delivered structured reports on feedback trends, system adjustments, and key recommendations for performance optimization. Coordinated cross-functional collaboration across stakeholders and technical teams to ensure tool refinements aligned with business goals.

Head of AI - NLP/GenAI

XA
Jan, 2024 - Nov, 2024 10 months
    Built AI and Gen AI products effective for massive real time usage. Used RAG, LLMs, AI agents, langchain, langgraph, custom retrievers, fine-tuning of smaller LLMs, evaluation of results, Azure for deployment. Engaged stakeholders through feedback sessions, encouraging participation in feedback mechanisms like in-app feedback surveys and regular check-ins. Compiled a comprehensive use case inventory, associating user feedback with potential improvements and scaling opportunities. Led feedback analysis sessions to align stakeholders on system challenges and implemented an iterative response loop for refinements.

Principal AI Consultant & Advisory

OrthoQuant
Sep, 2023 - Feb, 2024 5 months
    Led a team of 4 AI developers. Fine-tuned multiple LLMs on scale with multi-GPUs (minimum 2 nodes, each node has 8 GPUs, each GPU is A100 40GB RAM). Guided for Gen AI applications such as Custom LLM model finetuning, RAG generation, Extractive QA search, and Semantic Search applications. Facilitated regular progress reviews, presenting structured insights and improvement suggestions based on qualitative feedback analysis from internal users.

Lead Data Science

The Weather Channel
May, 2023 - Jan, 2024 8 months
    Identified First party audience on the platform based on different health predictions and classified users based on Social determinants for web, android & IOS using app usage & behavioural data. Worked on Large scale data modelling of around 400 features and 600M records. Worked with Pyspark, EMR, Sagemaker pipelines, python, Aws Athena, gin configs, etc. Predicted users on Home owners, breast cancer, business travelers, Psoriasis, Asthma based on app usage data (finding soft labels based on data distribution). Led the team of 3 working on SDoh use-cases. Applied Generative AI on Weather article checks with multiple data points. Built a Vector search platform for stakeholders that ingests data, finds right videos and articles based on search inputs and maps it to right tags.

Senior Applied AI Engineer

Work Fusion
Aug, 2022 - Feb, 2023 6 months
    Implemented Table Detection and Table Structure Detection on banking documents using CascadeTabNet, table transformer and LayoutLM. Developed cost-saving solutions for native PDF extraction and OCR models. Achieved 94% accuracy comparable to premium API solutions.

Principal Data Scientist Head of Data Science

Oorwin Labs
Oct, 2018 - Aug, 20223 yr 10 months
    Led development for Resume and JD parsers achieving 84% F1-score processes. Implemented custom scraping engines, chatbots, and data analytics use cases. Built actionable NLP-based recruitment tools, reducing operational API costs. Delivered scalable products capable of handling high throughput with low response latency.

Consultant AI Lead

MezmerMedia
    Developed domain-based solutions for article generation in Sports & Betting. Built products from scratch including an LLM chatbot for recruiters and an advanced candidate validation tool utilizing Generative AI. Delivered article generation capabilities for media companies using LLMs. Conducted use-case analysis and domain scenarios before project implementation.

Achievements

  • Oorwin Awards - Awarded for building NLP Parsers effectively in short span with atmost metrics that helped company reduce heavy costs
  • Data Science Mentor & Industry Tutor Mentored & Taught around batches (9 month courses) on weekends in Upgrad & Great Learning out of interest in teaching subject rightly AI advisory Worked as AI advisory for startups Spiritualist Have great interest in philiosphy and service for higher purpose.
  • Promoted to Head of Data Science for building AI platform, research & strategy
  • Going above & beyond award National Talent Search Exam - NTSE Achieved State wide 3rd rank in NTSE Exam and qualified Nationals
  • Awarded for building NLP Parsers effectively in short time
  • Promoted to Head of Data Science for building AI platform
  • State wide 3rd rank in National Talent Search Exam (NTSE)

Major Projects

9Projects

Head LLM Engineer - Gen AI Architect

    Designed and deployed a multi-agent Generative AI system to enhance clinical trial inspection processes by aggregating structured and unstructured data sources for areas like protocol deviations and adverse events. Integrated parallel generation, semantic caching, agent memory, and human-in-the-loop mechanisms to improve system performance. Conducted regular feedback sessions, categorized feedback into key areas, and implemented a continuous improvement loop.

Signitives Consultant AI Lead

    Built AI-powered applications from scratch, including recruiter chatbots, article-generation tools for the sports media domain, and RAG-integrated domain-based products. Fine-tuned custom LLMs for multiple use cases such as SQL query generation and customer support.

Head of AI - NLP/GenAI

Jan, 2024 - Nov, 2024 10 months
    Led the development of AI and Gen AI products for real-time usage in the automobile sector, including chatbots and repair cost estimation tools. Utilized RAG, custom retrievers, fine-tuned LLMs, and multilingual information extraction. Engaged stakeholders through feedback sessions and conducted iterative system refinements.

Principal AI Consultant & Advisory

Sep, 2023 - Feb, 2024 5 months
    Fine-tuned multiple LLMs for domain-based applications, including RAG generation, extractive QA search, and semantic search for a funded AI startup. Facilitated team building and data migration pipelines while leading progress reviews and feedback analysis sessions.

Lead Data Science (Contract)

May, 2023 - Jan, 2024 8 months
    Developed health prediction models and classified first-party audiences based on social determinants for the Weather Channel USA. Worked on large-scale data modeling with hundreds of features and millions of records. Built a vector search platform, validated embedding models, and deployed RAG-based generative AI tools for article checks.

Senior Applied AI Engineer

Aug, 2022 - Feb, 2023 6 months
    Implemented table detection systems on banking documents using advanced models such as CascadeTabNet and LayoutLM. Developed native PDF extraction pipelines and active learning workflows to reduce dependency on third-party services.

Principal Data Scientist - Head of Data Science

Oct, 2018 - Aug, 20223 yr 10 months
    Created AI-based resume and JD parsers with high accuracy and scalability, served multiple daily users with rapid processing times. Developed advanced analytics tools such as pre-screening chatbots and scraping engines for recruitment data.

NLP Scientist

Feb, 2018 - Oct, 2018 8 months
    Worked on NLP chatbots, building robust frameworks and pipelines for multiple domains such as banking and telecom. Implemented query understanding models and developed visualization systems for finance-related data.

Senior Software Engineer - Data Scientist

Jun, 2015 - Feb, 20182 yr 8 months
    Delivered data-driven projects, including candidate rankings and clustering analyses for telematic and telecom data. Conducted sentiment analysis and predictive modeling for various client outcomes using machine learning frameworks.

Education

  • MBA/PGDM (Part-time)

    Aegis School of Data Science (2017)
  • B Tech/ BE: Engineering

    SRM University (2015)

Certifications

  • Courses & Certifications

  • Product management from institute of product leadership (ipl)

  • Blockchain - advanced distributed ledger technology from iiit hyderabad

Interests

  • Watching Movies
  • Driving
  • Games