profile-pic

Shahid Afridi

Highly skilled in machine learning, data analysis, and cyber security, with a goal to contribute to innovative projects and enhance the efficiency of processes through the use of advanced technology.
  • Role

    ML & Python Developer

  • Years of Experience

    4.75 years

Skillsets

  • Deepseek
  • pandas
  • Scikit-learn
  • Sentence transformers
  • SQL
  • Tableau
  • TensorFlow
  • Tkinter
  • Word embeddings
  • Myscale
  • Celery
  • Claude
  • NumPy
  • Excel
  • Gemini
  • GPT
  • Grok
  • LLAMA
  • PubSub
  • PyQt5
  • Rags
  • Time Series Analysis
  • vector embeddings
  • Python - 3.0 Years
  • NLP
  • Computer Vision
  • LLMs - 3.0 Years
  • OpenCV
  • EDA
  • voice cloning
  • Bert transformers
  • Data Analysis
  • Vector databases
  • voice cloning
  • LangChain - 3.0 Years
  • Data Analysis
  • BeautifulSoup
  • C
  • ChromaDB
  • Flask
  • Jupyter Notebook
  • Keras
  • Machine Learning
  • Matplotlib
  • Mediapipe
  • NLTK

Professional Summary

4.75Years
  • Sep, 2025 - Present 7 months

    Software Engineer

    Wollmilchsau GmbH
  • Apr, 2023 - Aug, 20252 yr 4 months

    Machine Learning Engineer

    GeeksforGeeks
  • Apr, 2022 - Jan, 2023 9 months

    Data Analyst

    Galytix
  • Dec, 2020 - Jan, 2021 1 month

    Trainee

    Oil and Natural Gas Corporation
  • Apr, 2021 - Dec, 2021 8 months

    Industrial Trainee

    Ducat Education
  • Jan, 2022 - Apr, 2022 3 months

    Associate Business Analyst

    Goldstone Technologies Limited
  • Apr, 2019 - May, 2019 1 month

    Student Volunteer

    Nasscom Foundation

Applications & Tools Known

  • icon-tool

    Flask

  • icon-tool

    Tableau

  • icon-tool

    Tableau Prep

  • icon-tool

    Jupyter Notebook

  • icon-tool

    Excel

  • icon-tool

    Tableau

  • icon-tool

    Jupyter Notebook

  • icon-tool

    PyQt5

  • icon-tool

    Excel

Work History

4.75Years

Software Engineer

Wollmilchsau GmbH
Sep, 2025 - Present 7 months

Machine Learning Engineer

GeeksforGeeks
Apr, 2023 - Aug, 20252 yr 4 months
    Setup the entire ML Backend on Flask while creating RESTful APIs. Engineered a RAG global search system on the GeeksForGeeks platform, facilitating searches on products through textual queries, while replacing Google's ElasticSearch and reducing costs. Introduced a premium AI Chatbot feature enabling dynamic question answering on any visited articles page, improving articles understanding with real-time doubt clearance using AI. Pioneered a system using AI technologies to transform textual scripts into short lecture videos, featuring a human avatar for narration and text highlighting. Streamlined article review processes by automating grammar checks, formatting assessments, and error identification, thus eliminating manual intervention and increasing efficiency. Engineered a feature to generate problem statements, examples, and editorial content for coding problems, employing Language Models, thus eliminating manual writing of the content and increasing efficiency. Developed a coding problem Recommendation System for DSA articles, streamlining the process by eliminating manual intervention in mapping relevant problems to articles. Integrated a premium feature for generating concise and specific summaries of articles, enhancing readability and user experience on the platform. Developed an automated proctoring system for contests to effectively detect and prevent cheating by participants. Integrated a queuing mechanism for chatbot API to improve the throughput using Google Pub/Sub and Celery.

Data Analyst

Galytix
Apr, 2022 - Jan, 2023 9 months

Associate Business Analyst

Goldstone Technologies Limited
Jan, 2022 - Apr, 2022 3 months
    Building visualizations, dashboards using Tableau. Data modelling with Tableau Prep. Data analysis using SQL and Tableau.

Industrial Trainee

Ducat Education
Apr, 2021 - Dec, 2021 8 months

Trainee

Oil and Natural Gas Corporation
Dec, 2020 - Jan, 2021 1 month
    Learning NLP while analysing oil drilling datasets from ONGC.

Student Volunteer

Nasscom Foundation
Apr, 2019 - May, 2019 1 month
    Helping people against the misinformation spread over internet.

Achievements

  • Setup the entire ML Backend on Flask
  • Engineered global search system on the GeeksForGeeks platform
  • Introduced premium AI Chatbot
  • Pioneered system transforming scripts into lecture videos
  • Streamlined article review processes
  • Developed coding problem Recommendation System
  • Integrated premium feature for article summaries
  • Social work volunteer at Nasscom Foundation
  • Sports Club member and football team member at UPES

Major Projects

12Projects

Global Search System

    A searching feature to get the most relevant object from GeeksForGeeks products such as Articles, Coding problems, Jobs, Courses. Utilised LangChain for similarity search with retrieval chains and generated embeddings using ChromaDB on sentence transformers. Created a Flask API for integration on GeeksForGeeks webpages.

Interactive AI Chatbot for articles

    LLM based Chatbot for question answering on any articles page. Generated embeddings of articles using sentence transformers and performed vector searches in a vectorDB. Used prompt engineering for custom prompts and LLM interaction.

Contest proctor

    An automated proctoring system to detect malpractices by contestants. Utilised Yolo v7 for object detection and LLMs to identify potential malpractice from video streams.

Script to Video Generation

    Generated videos featuring a human avatar narrating input text with text highlighting. Used Google Text-to-Speech, ElevenLabs API for voice cloning, Whisper for timestamps, Wav2Lip for lip syncing, and moviepy for text highlighting.

Content Review

    Implemented an automated article content review system with predefined checks. Used ChromaDB and sentence transformers for searching similar titles, Gramformer for grammar checking, and NLTK/textstat for difficulty analysis.

Problem Statement and Editorial Generation

    Generated problem statements, examples, and editorial content for coding problems using GPT-3.5 and prompt engineering for HTML formatting.

DSA Problems Recommender for Articles

    Developed a recommendation system to suggest problems for DSA articles using embeddings and cosine similarity between articles and problems.

Summary Generator

    Crafted concise summaries of articles using OpenAI and Google's LLMs with prompt engineering and LangChain for custom formatting.

AI Gym Trainer

    An automated workout exercise analyzer using mediapipe to compute angles and assess exercise precision.

AI Gesture Volume Controller

    Developed a device volume controller utilizing hand gestures, powered by mediapipe, OpenCV, and Numpy for gesture recognition and volume adjustment.

Financial Textual Classification

    Fine-tuned BERT-Transformers for classification of financial textual data from financial reports.

Face detection and face recognition

    Implemented face detection and recognition systems by fine-tuning YOLO and VGGNet using transfer learning for real-time detection from live feeds.

Education

  • B. Tech in CSE - Cyber Security and Forensics

    University of Petroleum and Energy Studies (2022)
  • Senior Secondary (CBSE)

    Montfort School, Roorkee (2017)
  • Secondary (CBSE)

    Montfort School, Roorkee (2015)
  • Google IT automation with python

    Coursera
  • Python with Machine Learning

    Ducat, Noida (2021)

Certifications

  • Python with machine learning

  • Google it automation with python