Production Ops Engineer / Site Reliability Engineer
CotivitiAug, 2023 - Present2 yr 4 months
Create and maintain fully automated CI/CD pipelines for code deployment using Jenkins. Deploy on Docker Swarm and OpenShift with Groovy scripts and YAML. Built and deployed Docker containers to break up the monolithic app into microservices, improving developer workflow, increasing scalability, and optimizing speed. Managed GIT and GitHub repositories for branching, merging, and tagging. Developed container support for cloud environments and Docker. Deployed Java applications in Docker containers and managed containers at the enterprise level. Created Docker images using Docker file, worked on Docker container snapshots, removing images, and managing Docker volumes. Configured Docker container for branching purposes and implemented Docker automation solution for CI/CD model. Experienced in working with OpenShift Enterprise, allowing developers to quickly develop, host, and scale applications in a self-managed cloud environment. Managing OpenShift master, nodes with upgrades, decommissioning them from active participation by evacuating the nodes and upgrading them. Implemented persistent storage provisioning by creating and managing PVs and PVCs in OpenShift. Designed and configured Namespaces and Resource Quotas to optimize resource allocation and enforce multi-tenant workload isolation. Developed and enforced custom RBAC policies for users and groups to meet organizational security and compliance requirements. Performed troubleshooting and root cause analysis of pod failures and cluster-level issues to ensure high availability. Automated application deployments using Helm charts integrated with Service Accounts for secure and controlled access management. Handle issue tracking, project collaboration, and reporting through JIRA, and documentation and knowledge sharing using Confluence.