Ahmed Fakhry

Data scientist and AI researcher with over four years of experience in leading machine learning and deep learning projects, creating generative and predictive models, and building high-performing teams. Demonstrated proficiency in problem-solving, business process optimization, and developing automated AI agents. Capable of working with stakeholders and cross-functional teams to improve data integrity and model performance in both academic and commercial contexts using data-driven insights.

Role
AI Research Scientist
Years of Experience
6.42 years

Skillsets

C++
SciPy
Scikit-learn
pandas
MongoDB
Matplotlib
Linux
Jupyter Notebook
GitLab
Git
C
TensorFlow
SQLite
PyTorch
Python
OpenCV
MySQL
Keras
Java

Professional Summary

6.42Years

Aug, 2023 - Present2 yr 9 months
AI Research Scientist
Kyungpook National University
Nov, 2022 - Aug, 2023 9 months
Senior Data Scientist
Turing
Jul, 2021 - Nov, 20221 yr 4 months
Senior Data Scientist
Linnaea
Sep, 2020 - Feb, 2021 5 months
Machine Learning Engineer
Virufy
Dec, 2020 - Dec, 20211 yr
Machine Learning Assistant
University of Bergen
Mar, 2021 - Jun, 2021 3 months
Machine Learning Team Lead
Virufy

Applications & Tools Known

Git
GitLab
Pandas
SciPy
Scikit-Learn
Matplotlib
Jupyter Notebook

Work History

6.42Years

AI Research Scientist

Kyungpook National University

Aug, 2023 - Present2 yr 9 months

Leveraging large language models for vision applications such as image captioning and visual question answering for surveillance applications. Developed a highly effective video anomaly detection system utilizing few-shot and meta-learning techniques, achieving an AUC of 86.0% and demonstrating robust generalization to previously unseen anomalous actions. Collaborate on multiple computer vision projects with Korean research institutes and private companies to develop innovative technologies and optimize existing surveillance systems. Analyze large datasets using statistical methods to derive insights, identify patterns, and validate models with a focus on raising efficiency, accuracy, and scalability.

Senior Data Scientist

Turing

Nov, 2022 - Aug, 2023 9 months

Ensured high-quality outputs and results by working closely with approx. 10 AI trainers and 5 LLM prompt engineers to improve the model resilience against hallucinations and factual inaccuracies. Contributed to a data science team focused on optimizing large language models like ChatGPT 3.5 by applying Reinforcement Learning from Human Feedback (RLHF) to human prompts and review data, enhancing the quality of user interactions. Partnered with several large AI companies, leveraging collaborative efforts to integrate advanced technologies and drive innovation within the team's projects.

Senior Data Scientist

Linnaea

Jul, 2021 - Nov, 20221 yr 4 months

Created a deep learning model specifically designed to digitize handwritten documents in English and Arabic by leading approx. 5 Data Scientists, enhancing text recognition capabilities. Refined the model's ability to recognize and digitize text accurately by training the model on thousands of handwritten documents, achieving a remarkable CER of 0.24 for English and 0.33 for Arabic. Built and supervised an in-house Annotation team to annotate handwritten text in both languages and provide high-quality training data for supervised learning. Surpassed industry benchmarks and outperformed the Google OCR tool by fine-tuning the models performance in digitizing handwritten documents. Drove B2B engagements in the medical industry across Egypt and Saudi Arabia while engaging with key stakeholders from insurance and hospital companies to foster strategic partnerships and expand business opportunities.

Machine Learning Team Lead

Virufy

Mar, 2021 - Jun, 2021 3 months

Managed 4 Machine Learning Engineers to formulate an AI-based method for precise prediction of COVID-19 infection rates with a ROC-AUC of 77.1%, facilitating data-driven decision-making. Rendered strategic direction for the creation of a state-of-the-art deep learning model for detecting COVID-19 on the COUGHVID dataset while presenting the results through a research paper at the Interspeech Conference 2021. Acquired clean and standardized clinical datasets to progress project research and reach substantial conclusions by negotiating agreements with hospitals. Enabled optimal integration and functionality by overseeing all aspects of deploying machine learning models in production, including making decisions regarding performance, fine-tuning based on clinical datasets, and testing APIs.

Machine Learning Assistant

University of Bergen

Dec, 2020 - Dec, 20211 yr

Participated in a research project aimed at establishing an advanced deep learning model for detecting gene-gene interactions from gene expression data, contributing to advancements in genetic research methodologies. Secured an exceptional AUROC score of 0.834 on the combined BioGRID and DREAM5 dataset, enabling the model to achieve a current ranking as the best in performance.

Machine Learning Engineer

Virufy

Sep, 2020 - Feb, 2021 5 months

Created a multi-branch deep learning network, integrating heterogeneous features from patients, such as human cough sounds and metadata (age/gender), to detect respiratory diseases, including common cold and COVID-19. Fulfilled real-time requirements by adapting machine learning and neural network algorithms and architectures for optimal performance in dynamic environments.

Achievements

Developed a highly effective video anomaly detection system utilizing few-shot and meta-learning techniques, achieving an AUC of 86.0% and demonstrating robust generalization to previously unseen anomalous actions.
Ensured high-quality outputs and results by working closely with approx. 10 AI trainers and 5 LLM prompt engineers to improve the model resilience against hallucinations and factual inaccuracies.
Refined the model's ability to recognize and digitize text accurately by training the model on thousands of handwritten documents, achieving a remarkable CER of 0.24 for English and 0.33 for Arabic.
Secured an exceptional AUROC score of 0.834 on the combined BioGRID and DREAM5 dataset, enabling the model to achieve a current ranking as the best in performance.

Major Projects

3Projects

GENER: A Parallel Layer Deep Learning Network To Detect Gene-Gene Interactions From Gene Expression Data

Fakhry, A., Khafagy, R. and Ludl, A.-A. (2023) GENER: A Parallel Layer Deep Learning Network To Detect Gene-Gene Interactions From Gene Expression Data, arXiv.org.

A Multi-Branch Deep Learning Network for Automated Detection of COVID-19

Fakhry, A., Jiang, X., Xiao, J., Chaudhari, G., Han, A. (2021) A Multi-Branch Deep Learning Network for Automated Detection of COVID-19. Proc. Interspeech 2021, 4139-4143.

Virufy: Global Applicability of Crowdsourced And Clinical Datasets for AI Detection of COVID-19 from Cough

Chaudhari, G. et al. (2021) Virufy: Global Applicability of Crowdsourced And Clinical Datasets for AI Detection of COVID-19 from Cough, arXiv.org.

Education

Master of Science in Computer & Information Systems
Liverpool John Moores University
Bachelor of Science in Electronics & Communication Engineering
Alexandria University (2020)

Ahmed Fakhry

AI Research Scientist

6.42 years

Skillsets

Professional Summary

Applications & Tools Known

Work History

AI Research Scientist

Senior Data Scientist

Senior Data Scientist

Machine Learning Team Lead

Machine Learning Assistant

Machine Learning Engineer

Achievements

Major Projects

GENER: A Parallel Layer Deep Learning Network To Detect Gene-Gene Interactions From Gene Expression Data

A Multi-Branch Deep Learning Network for Automated Detection of COVID-19

Virufy: Global Applicability of Crowdsourced And Clinical Datasets for AI Detection of COVID-19 from Cough

Education

Master of Science in Computer & Information Systems

Bachelor of Science in Electronics & Communication Engineering