Principal Engineer
NetradyneSep, 2020 - Present5 yr 6 months
Owned end-to-end cloud platform architecture supporting 7,000+ AWS servers and 50+ production microservices, built entirely using Infrastructure as Code. Lead and mentor a 20-member AWS and Platform Engineering team, driving operational excellence and reliability. Architected and executed AWS landing zone migrations, SSO integrations, and infrastructure readiness for ISO and security certifications. Enterprise-scale AWS re-architecture, migrating from a single account to 20 sub-accounts, executing cross-region migration for cost savings, and standardising network connectivity via Transit Gateway. Architected Terraform-to-Terragrunt migration, including refactoring a monolithic Terraform state into multiple smaller, independently managed state files. Designed and developed InfraBot, an AI-driven self-healing infrastructure platform in Python that performs automated incident analysis, root-cause detection, and safe remediation, reducing MTTR by ~65%. Architected transient QA environments with a one-click Jenkins pipeline that provisions AWS infrastructure, configures services, and deploys applications end-to-end in ~45 minutes, reducing setup time from several days. Drove 25% reduction in AWS costs through architectural optimization, right-sizing, and automation-driven governance.