profile-pic

Ahmed Fakhry

Data scientist and AI researcher with over four years of experience in leading machine learning and deep learning projects, creating generative and predictive models, and building high-performing teams. Demonstrated proficiency in problem-solving, business process optimization, and developing automated AI agents. Capable of working with stakeholders and cross-functional teams to improve data integrity and model performance in both academic and commercial contexts using data-driven insights.

  • Role

    AI Research Scientist

  • Years of Experience

    6.42 years

Skillsets

  • C++
  • SciPy
  • Scikit-learn
  • pandas
  • MongoDB
  • Matplotlib
  • Linux
  • Jupyter Notebook
  • GitLab
  • Git
  • C
  • TensorFlow
  • SQLite
  • PyTorch
  • Python
  • OpenCV
  • MySQL
  • Keras
  • Java

Professional Summary

6.42Years
  • Aug, 2023 - Present2 yr 9 months

    AI Research Scientist

    Kyungpook National University
  • Nov, 2022 - Aug, 2023 9 months

    Senior Data Scientist

    Turing
  • Jul, 2021 - Nov, 20221 yr 4 months

    Senior Data Scientist

    Linnaea
  • Sep, 2020 - Feb, 2021 5 months

    Machine Learning Engineer

    Virufy
  • Dec, 2020 - Dec, 20211 yr

    Machine Learning Assistant

    University of Bergen
  • Mar, 2021 - Jun, 2021 3 months

    Machine Learning Team Lead

    Virufy

Applications & Tools Known

  • icon-tool

    Git

  • icon-tool

    GitLab

  • icon-tool

    Pandas

  • icon-tool

    SciPy

  • icon-tool

    Scikit-Learn

  • icon-tool

    Matplotlib

  • icon-tool

    Jupyter Notebook

Work History

6.42Years

AI Research Scientist

Kyungpook National University
Aug, 2023 - Present2 yr 9 months
    Leveraging large language models for vision applications such as image captioning and visual question answering for surveillance applications. Developed a highly effective video anomaly detection system utilizing few-shot and meta-learning techniques, achieving an AUC of 86.0% and demonstrating robust generalization to previously unseen anomalous actions. Collaborate on multiple computer vision projects with Korean research institutes and private companies to develop innovative technologies and optimize existing surveillance systems. Analyze large datasets using statistical methods to derive insights, identify patterns, and validate models with a focus on raising efficiency, accuracy, and scalability.

Senior Data Scientist

Turing
Nov, 2022 - Aug, 2023 9 months
    Ensured high-quality outputs and results by working closely with approx. 10 AI trainers and 5 LLM prompt engineers to improve the model resilience against hallucinations and factual inaccuracies. Contributed to a data science team focused on optimizing large language models like ChatGPT 3.5 by applying Reinforcement Learning from Human Feedback (RLHF) to human prompts and review data, enhancing the quality of user interactions. Partnered with several large AI companies, leveraging collaborative efforts to integrate advanced technologies and drive innovation within the team's projects.

Senior Data Scientist

Linnaea
Jul, 2021 - Nov, 20221 yr 4 months
    Created a deep learning model specifically designed to digitize handwritten documents in English and Arabic by leading approx. 5 Data Scientists, enhancing text recognition capabilities. Refined the model's ability to recognize and digitize text accurately by training the model on thousands of handwritten documents, achieving a remarkable CER of 0.24 for English and 0.33 for Arabic. Built and supervised an in-house Annotation team to annotate handwritten text in both languages and provide high-quality training data for supervised learning. Surpassed industry benchmarks and outperformed the Google OCR tool by fine-tuning the models performance in digitizing handwritten documents. Drove B2B engagements in the medical industry across Egypt and Saudi Arabia while engaging with key stakeholders from insurance and hospital companies to foster strategic partnerships and expand business opportunities.

Machine Learning Team Lead

Virufy
Mar, 2021 - Jun, 2021 3 months
    Managed 4 Machine Learning Engineers to formulate an AI-based method for precise prediction of COVID-19 infection rates with a ROC-AUC of 77.1%, facilitating data-driven decision-making. Rendered strategic direction for the creation of a state-of-the-art deep learning model for detecting COVID-19 on the COUGHVID dataset while presenting the results through a research paper at the Interspeech Conference 2021. Acquired clean and standardized clinical datasets to progress project research and reach substantial conclusions by negotiating agreements with hospitals. Enabled optimal integration and functionality by overseeing all aspects of deploying machine learning models in production, including making decisions regarding performance, fine-tuning based on clinical datasets, and testing APIs.

Machine Learning Assistant

University of Bergen
Dec, 2020 - Dec, 20211 yr
    Participated in a research project aimed at establishing an advanced deep learning model for detecting gene-gene interactions from gene expression data, contributing to advancements in genetic research methodologies. Secured an exceptional AUROC score of 0.834 on the combined BioGRID and DREAM5 dataset, enabling the model to achieve a current ranking as the best in performance.

Machine Learning Engineer

Virufy
Sep, 2020 - Feb, 2021 5 months
    Created a multi-branch deep learning network, integrating heterogeneous features from patients, such as human cough sounds and metadata (age/gender), to detect respiratory diseases, including common cold and COVID-19. Fulfilled real-time requirements by adapting machine learning and neural network algorithms and architectures for optimal performance in dynamic environments.

Achievements

  • Developed a highly effective video anomaly detection system utilizing few-shot and meta-learning techniques, achieving an AUC of 86.0% and demonstrating robust generalization to previously unseen anomalous actions.
  • Ensured high-quality outputs and results by working closely with approx. 10 AI trainers and 5 LLM prompt engineers to improve the model resilience against hallucinations and factual inaccuracies.
  • Refined the model's ability to recognize and digitize text accurately by training the model on thousands of handwritten documents, achieving a remarkable CER of 0.24 for English and 0.33 for Arabic.
  • Secured an exceptional AUROC score of 0.834 on the combined BioGRID and DREAM5 dataset, enabling the model to achieve a current ranking as the best in performance.

Major Projects

3Projects

GENER: A Parallel Layer Deep Learning Network To Detect Gene-Gene Interactions From Gene Expression Data

    Fakhry, A., Khafagy, R. and Ludl, A.-A. (2023) GENER: A Parallel Layer Deep Learning Network To Detect Gene-Gene Interactions From Gene Expression Data, arXiv.org.

A Multi-Branch Deep Learning Network for Automated Detection of COVID-19

    Fakhry, A., Jiang, X., Xiao, J., Chaudhari, G., Han, A. (2021) A Multi-Branch Deep Learning Network for Automated Detection of COVID-19. Proc. Interspeech 2021, 4139-4143.

Virufy: Global Applicability of Crowdsourced And Clinical Datasets for AI Detection of COVID-19 from Cough

    Chaudhari, G. et al. (2021) Virufy: Global Applicability of Crowdsourced And Clinical Datasets for AI Detection of COVID-19 from Cough, arXiv.org.

Education

  • Master of Science in Computer & Information Systems

    Liverpool John Moores University
  • Bachelor of Science in Electronics & Communication Engineering

    Alexandria University (2020)