profile-pic

Parth Patel

As a highly skilled AI developer, I have spearheaded the development of multiple Generative AI (Gen AI) use cases for large enterprise clients, showcasing my ability to innovate and create solutions that exceed expectations. With a master’s degree from IIT Bombay and over 4 years of experience at IBM, I possess the technical expertise and industry knowledge to deliver exceptional results. My excellent communication and problem-solving skills enable me to work independently or as part of a team, making me a valuable asset to any organization looking to enhance its Language Modelling capabilities

  • Role

    AI Data Scientist

  • Years of Experience

    6.3 years

Skillsets

  • Microservices
  • Meta segment-anything
  • Context-aware extraction
  • vision-language models
  • Vector stores
  • transformer architecture
  • synthetic data generation
  • session management
  • Rest APIs
  • Real-time Processing
  • RAGAS
  • QLoRA
  • OCR
  • Multimodal document processing
  • model serving
  • Model Fine-tuning
  • Kubernetes
  • MCP
  • LoRA
  • LLMs
  • LangGraph
  • Hybrid search
  • GraphRAG
  • GPU
  • event-driven architecture
  • Context engineering
  • Azure
  • AI observability
  • Agentic AI
  • Generative AI
  • prompt chaining
  • LLMOps

Professional Summary

6.3Years
  • Jul, 2019 - Present6 yr 8 months

    Senior Generative AI Data Scientist

    IBM India

Applications & Tools Known

  • icon-tool

    Azure OpenAI

  • icon-tool

    Azure ML

  • icon-tool

    Streamlit

  • icon-tool

    Docker

  • icon-tool

    Kubernetes

  • icon-tool

    PyTorch

  • icon-tool

    Keras

  • icon-tool

    Pandas

  • icon-tool

    Scikit-Learn

  • icon-tool

    SciPy

  • icon-tool

    OpenCV

  • icon-tool

    NLTK

  • icon-tool

    FAISS

Work History

6.3Years

Senior Generative AI Data Scientist

IBM India
Jul, 2019 - Present6 yr 8 months
    Architected enterprise VLM system combining Azure Document Intelligence with GPT-4 Vision, achieving 95%+ accuracy in legal document processing. Pioneered image-text reconciliation approach bridging traditional OCR limitations with AI contextual understanding. Built cloud-native architecture on Azure Container Apps with zero-shot learning for new document types without retraining. Architected comprehensive GenAI platform processing 1000+ legal documents with 97.56% accuracy. Built production-scale LLM infrastructure on Kubernetes achieving 68-88% cost reduction. Engineered 14 specialized transformer models (7 NER + 7 REL) with 95-98% precision. Designed intelligent document processing system with OCR integration and 20+ category classification. Achieved 60%+ effort reduction in contract analysis through innovative prompt chaining architecture. Led benchmarking study resulting in Instructor-XL model selection with 12.8% improvement over baseline. Developed multi-modal classification incorporating visual and textual document attributes. Built GPT-4 VLM RAG pipeline achieving 90.3% weighted accuracy and 94.0% completeness. Implemented Meta's Segment-Anything Model resulting in 20% increase in text completeness. Pioneered segment reduction algorithm scaling from 100+ to <30 segments using MECE principle.

Achievements

  • Client and Partner Success Award - 2023
  • Manager Appreciation with BluePoints - 2023
  • IBMer Appreciation - 2023
  • Manager Appreciation with BluePoints - 2022
  • Manager Empowerment Fund - 2021
  • Managers Choice Award - 2019

Major Projects

2Projects

Contract Associate Toolkit

Jan, 2022 - Dec, 20231 yr 11 months
    Built and trained custom entity and relationship recognition pipeline by fine-tuning a pre-trained English RoBERTa-base transformer to achieve average 86.5% macro F1-Score (15% increase) as compared to IBM Watson Discovery.

Leveraging Alignment and Phonology for low-resource Indic to English Neural Machine Transliteration

Jul, 2019 - Dec, 20201 yr 5 months
    Modeled the impact of various segmentation techniques vis-a-vis alignment & attention mechanisms, leading to the development of an innovative Orthographic Syllable Segmentation technique. Showcased an average improvement of 26.58% over baseline models with our technique using an Encoder-Decoder model for the task of transliteration between 10 Indic and English language pairs.

Education

  • Master of Technology - Computer Science and Engineering

    Indian Institute of Technology Bombay (2019)
  • Bachelor of Technology - Computer Engineering

    Charotar University of Science and Technology (2016)

Certifications

  • Microsoft certified trainer 2023-2024

  • Microsoft certified: azure ai engineer associate

  • Microsoft certified: azure ai fundamentals

  • Financial markets industry jumpstart - bronze

  • Ibm certified advocate - cloud v1

  • Microsoft certified: azure data fundamentals

  • Financial markets industry jumpstart - silver