profile-pic

Amith RK

Seasoned Engineering Leader with experience of driving Cloud, Platform, and Site Reliability Engineering initiatives. Proven track record of leading high-performing teams, fostering innovation, and delivering cost-effective solutions. Skilled in Agile methodologies, DevOps practices, Site Reliability Engineering, and cloud optimization. Passionate about building scalable and secure cloud platforms that empower businesses.
  • Role

    Manager Site Reliability Engineering (Platform & Tooling)

  • Years of Experience

    16.3 years

Skillsets

  • automation
  • Site Reliability Engineering
  • runbooks
  • Platform Engineering
  • Jenkins
  • IAC
  • Helm
  • GitHub Actions
  • Compliance
  • Cloud Migration
  • Cloud Cost Optimization
  • Cloud
  • canary
  • Blue-Green Deployments
  • AWS
  • Alerting
  • Virtualization
  • Spinnaker
  • CI/CD
  • ArgoCD
  • Terraform
  • Policy as Code
  • Kubernetes
  • Docker
  • Monitoring
  • GCP
  • FinOps
  • Azure

Professional Summary

16.3Years
  • Nov, 2024 - Present1 yr 4 months

    Manager Site Reliability Engineering (Platform & Tooling)

    Okta
  • May, 2023 - Sep, 20241 yr 4 months

    Senior Software Engineering Manager - DevOps & Cloud Platform

    Boeing
  • Dec, 2021 - Feb, 20231 yr 2 months

    Site Reliability Engineering Manager

    VMware
  • Jun, 2018 - Nov, 20191 yr 5 months

    Hybrid Cloud Infrastructure Architect

    Novo Nordisk
  • Nov, 2019 - Aug, 2020 9 months

    Cloud Services Manager (DevOps)

    Technicolor
  • Aug, 2020 - Dec, 20211 yr 4 months

    Team Manager, Hosting & Cloud Solutions

    Eli Lilly and Company
  • Feb, 2012 - Jun, 20186 yr 4 months

    End User Computing Specialist

    VMware
  • Jan, 2010 - Feb, 20122 yr 1 month

    Client Technical Support Associate

    Dell

Applications & Tools Known

  • icon-tool

    AWS

  • icon-tool

    Azure

  • icon-tool

    GCP

  • icon-tool

    ESXi

  • icon-tool

    RSA

  • icon-tool

    Active Directory

Work History

16.3Years

Manager Site Reliability Engineering (Platform & Tooling)

Okta
Nov, 2024 - Present1 yr 4 months
    Oversee the SRE organization focused on the Okta Cloud platform, managing Infra Delivery, Cloud Tooling Automations and Pipelines (CTAP), K8s, and Observability. Built, mentored, and led a high-performing team of SREs and Software Engineers; partnering directly with recruiting to hire top-tier talent and fostering a culture of continuous learning and "Automation First" ethics. Spearheaded the architecture and strategic rollout of the Internal Developer Platform (IDP), adopting a "Platform-as-Product" mindset to reduce cognitive load and accelerate developer velocity across the engineering organization. Integrated CI/CD orchestration (Spinnaker, ArgoCD) and self-service IaC to streamline the path to production. Transforming core infrastructure stability by shifting from reactive to proactive SRE practices; architected a predictive, self-healing cloud platform that sustains 99.99% availability for critical production systems. Championed the transition from reactive toil to proactive, code-driven fleet management. Instituted Continuous Stability practices driven by AI forecasting, predicting service degradation 30 minutes in advance and slashing Mean Time To Resolution (MTTR) by 85% (120m to 18m). Directed cloud spend optimization and resource efficiency initiatives aligned with business metrics. Achieved $310K+ in annualized savings ($180K infrastructure + $130K licensing) within the first 90 days by implementing ML-based right-sizing engines and negotiating strategic vendor agreements. Identified and mitigated bottlenecks in the development flow by deploying an ML-driven change risk assessment engine and expanding automated regression coverage from 45% to 85%. Resulted in an 18% uplift in cloud release velocity and a 60% reduction in production defects. Fortified the platform by embedding security best practices into the delivery pipeline. Deployed real-time anomaly detection to identify novel threats and automated vulnerability scanning to ensure secure, compliant releases without slowing delivery. Collaborating with cross-functional stakeholders to align platform capabilities with competing constraints of reliability, security, and delivery speed; actively governing key metrics including RPO, RTO, cloud spend, and vulnerability posture.

Senior Software Engineering Manager - DevOps & Cloud Platform

Boeing
May, 2023 - Sep, 20241 yr 4 months
    Built and scaled a Cloud Platform organization from inception to 30+ members, including architects and DevOps engineers. Established Azure Platform Engineering, FinOps, reusable cloud components, Policy as Code, Governance as Code, and observability capabilities. Increased Azure efficiency by 30%, reducing deployment timelines by over 4 weeks. Led a FinOps program delivering over $1M in cloud savings within 8 months. Cultivated a high-performance culture through goal setting, performance reviews, and mentorship. Developed a Cloud Platform roadmap aligning initiatives to business priorities and stakeholder needs. Improved developer experience and cloud onboarding, driving streamlined adoption. Maintained strong global stakeholder relationships and represented engineering with customers and vendors. Produced documentation and learning resources to accelerate platform proficiency. Advanced continuous improvement based on post-incident analyses to enhance stability. Improved code quality by expanding unit test coverage from 45% to 90%, reducing production defects by 60% within six months. Stabilized cloud platform by reducing MTTR from 2 hours to 15 minutes through proactive runbooks and automated rollback on failure enabling faster recovery.

Site Reliability Engineering Manager

VMware
Dec, 2021 - Feb, 20231 yr 2 months
    Embedded a proactive reliability mindset, anticipating issues and implementing preventive measures. Evolved platform reliability architecture to boost availability, resiliency, and performance. Integrated advanced monitoring and alerting to strengthen observability and incident readiness. Directed incident response during major events to ensure rapid and effective resolution. Managed a backlog of platform improvements enhancing availability, security, and performance. Defined policies and procedures to guide service operations and quality standards. Presented performance reports to leadership, showcasing outcomes and opportunities. Fostered collaboration and recognized technical excellence to motivate engineers. Reduced incident response time by 23% via cohesive on-call processes and automated escalation. Led initiative to consolidate incident response playbooks, reducing outage duration by 12% and saving $60k in annual operational costs organization-wide. Scaled infrastructure with automated policies to double peak capacity and sustain 99.99% uptime during traffic spikes across critical VM workloads.

Team Manager, Hosting & Cloud Solutions

Eli Lilly and Company
Aug, 2020 - Dec, 20211 yr 4 months
    Orchestrated a high-performance engineering team to deliver a next-generation cloud platform. Served as Co-Product Owner, streamlining deliverables and acceptance criteria. Selected methodologies and architectures aligned with mission and governance. Planned and governed cloud migration programs to AWS/Azure with cross-functional, global teams. Enabled teams with training and resources to meet timelines; refined forecasts via variance analysis. Determined and executed cloud transformation and migration strategies after discovery and assessment. Provided consulting and engineering services to programs and projects. Accelerated migration throughput by 40% through standardized tooling and templates.

Cloud Services Manager (DevOps)

Technicolor
Nov, 2019 - Aug, 2020 9 months
    Led major cloud and automation initiatives for Technicolor. Built and mentored a multi-cloud team managing daily operations on Azure, AWS, and GCP. Empowered engineers to deliver innovative solutions, capture requirements, and make platform decisions. Authored runbooks, implemented industry best practices, and provided 24/7 infrastructure support. Improved service uptime to 99% by instituting robust automation and runbooks.

Hybrid Cloud Infrastructure Architect

Novo Nordisk
Jun, 2018 - Nov, 20191 yr 5 months
    Worked on AWS, Azure, Private Cloud, IaaS, Virtualization, Compliance (GxP, GISP, GDPR).

End User Computing Specialist

VMware
Feb, 2012 - Jun, 20186 yr 4 months
    Provided end-user computing support.

Client Technical Support Associate

Dell
Jan, 2010 - Feb, 20122 yr 1 month
    Provided client technical support for Dell products and services.

Achievements

  • Established a DevOps & Azure Platform Engineering Team within Boeings Digital Aviation Solutions group. Realized 1M+ savings in Azure Infrastructure expenditure through a dedicated FinOps Team
  • Implemented a Platform Engineering Team at VMware Inc. focused on Tanzu Observability on AWS
  • Assembled an Engineering Team dedicated to Platform Engineering, App Modernization, Cloud Architecture, and Data Center Operations at Eli Lilly & Company
  • Successfully integrated Microsoft Azure as a secondary cloud vendor at Eli Lilly & Company, resulting in significant cost savings of over 600K per year through optimized cloud workload distribution and licensing
  • Delivered over 200K+ in annual savings on Private and Public Cloud costs at Novo Nordisk

Education

  • Master of Computer Applications (Cloud Computing)

    Jain University
  • Bachelors Degree in Computer Applications

    SSMRV College (Bangalore University)