Leading an R&D team of AI/ML engineers in architecting and deploying CRAVE (Cardiac Rhythm AI Visualization Engine), a production-grade, device-agnostic ECG arrhythmia detection platform processing 432,000+ beats from multi-day Holter recordings and achieving 92%+ real-world clinical accuracy on unseen patient datasets. Engineering a 1D pre-activation ResNet rhythm model (16 residual blocks, global temporal pooling), delivering 97% accuracy in Normal vs. classification on 10-second ECG segments. Designing and implementing BeatNet, a hybrid deep learning architecture combining multi-scale convolutions, Gaussian region pooling, and lightweight MLP layers, achieving 94.5% accuracy across N/S/V/F beat classes with adaptive rhythm orchestration logic. Orchestrating a comprehensive 6-stage ML pipeline integrating ConvBiLSTM-based R-peak detection, beat classification, PQST segmentation, arrhythmia sub-classification, and morphology-based clustering achieving >85% concordance with expert clinical interpretations. Developing an advanced ECG denoising framework leveraging NeuroKit2, PyWavelets, EMD, SSA, morphology-driven Wiener filtering, dynamic Kalman filtering, and DTW-based alignment, enhancing P-QRS-T wave delineation and diagnostic accuracy. Building a device-agnostic preprocessing framework using MNE for EDF parsing across multiple manufacturers, with automated resampling, sliding-window analysis, gap detection, and daily batch processing for 8,600+ ECG segments per day. Resolving critical memory and performance constraints by reducing RAM consumption from 15 GB to <3 GB and cutting processing time for 2-day recordings to under 20 minutes through adaptive caching, cleanup strategies, and dynamic batch scaling. Optimizing cloud deployment workflows by reducing Docker image size from 4 GB to 1.8 GB, applying multi-stage builds, layer caching, and PyTorch cleanup, while integrating real-time performance monitoring for GPU, VRAM, CPU, and latency. Architecting a fault-tolerant AWS infrastructure using Step Functions, AWS Batch, EKS, ECR, S3, and CloudWatch, ensuring parallel containerized execution, automatic recovery, and continuous observability in large-scale clinical data processing.