profile-pic

Arghaya Mondal

Arghaya Mondal

Skilled DevOps Engineer with AWS DevOps and Sysops Certification. As a DevOps Engineer, I have architected, deployed, and managed Linux-based Servers, Kubernetes Clusters with high uptime and load tolerance. Wrote scripts to automate Infra deployment, backups, CI-CD pipelines, serverless applications, monitoring and reporting.


AWS: IAM,RDS,ALB,Autoscaling,EKS,ECS,ECR, Route 53, S3 bucket, AWS Pipeline, Lambda, Cloudwatch. Tools and Technology: Kubernetes, Docker, Docker-Compose, Gitlab, Ansible, Jenkins, Burpsuite, Terraform, Sonarqube

Monitoring tools: New Relic, Nagios, ELK, Security Hub, GuardDuty

Domain Knowledge: DHCP, DNS, Active Directory, NFS

Database: Redis, MySQL, MongoDB, Postgres, ElasticSearch

App & web Servers: Nginx, Apache, Tomcat

Audits: HIPAA , SSAE – SOC

Operating Systems: Linux - Ubuntu, Arch Linux, Alpine

  • Role

    DevOps Engineer

  • Years of Experience

    7.00 years

Skillsets

  • Python - 4 Years
  • SonarQube
  • Terraform
  • Tomcat
  • Ubuntu
  • Alb
  • Autoscaling
  • Ecr
  • Pipeline
  • Burpsuite
  • Security hub
  • Nfs
  • Arch linux
  • Alpine
  • S3
  • Bash - 7 Years
  • MicroServices - 5 Years
  • Performance Optimization - 8 Years
  • Cloud Systems - 8 Years
  • Security - 5 Years
  • Networking - 5 Years
  • AWS Services - 7 Years
  • IAC - 5 Years
  • Java - 3 Years
  • AWS - 5 Years
  • C - 1 Years
  • C++ - 2 Years
  • DevOps - 6 Years
  • Jenkins
  • Ansible
  • Apache
  • CloudWatch
  • DHCP
  • DNS
  • Docker - 8 Years
  • ECS
  • EKS
  • Elasticsearch
  • ELK
  • GitLab - 5 Years
  • Guardduty
  • IAM
  • Active Directory
  • Kubernetes - 5 Years
  • Lambda
  • Linux - 8 Years
  • MongoDB
  • MySQL
  • Nagios
  • New Relic
  • Nginx
  • Postgres
  • RDS
  • Redis
  • Route 53

Professional Summary

7.00Years
  • Oct, 2022 - Present2 yr 6 months

    Sr. Site Reliability Engineer

    Grab Grecco LLP & GRXST (GxS Bank Singapore)
  • Jan, 2021 - Oct, 20221 yr 9 months

    Senior Engineer-DevOps

    SourceFuse Technologies
  • Nov, 2017 - Dec, 20203 yr 1 month

    DevOps Engineer

    Grazitti Interactive
  • Mar, 2017 - Nov, 2017 8 months

    Network and Server Administrator

    Toxsl Technologies

Applications & Tools Known

  • icon-tool

    Datadog

  • icon-tool

    PagerDuty

  • icon-tool

    Terraform

  • icon-tool

    SWIFT

  • icon-tool

    EKS

  • icon-tool

    EC2

  • icon-tool

    Python

  • icon-tool

    AWS CodePipeline

  • icon-tool

    Jenkins

  • icon-tool

    S3

  • icon-tool

    AWS Lambda

  • icon-tool

    SonarQube

  • icon-tool

    Snyk

  • icon-tool

    New Relic

  • icon-tool

    Nagios

  • icon-tool

    ELK

  • icon-tool

    RDS

  • icon-tool

    DigitalOcean

  • icon-tool

    VMware ESXi

  • icon-tool

    GitLab

  • icon-tool

    Docker

  • icon-tool

    Harbor

  • icon-tool

    Ansible

  • icon-tool

    CloudWatch

  • icon-tool

    MySQL

  • icon-tool

    PostgreSQL

  • icon-tool

    MongoDB

  • icon-tool

    Elasticsearch

  • icon-tool

    Kubernetes

  • icon-tool

    Docker-Compose

  • icon-tool

    Burpsuite

Work History

7.00Years

Sr. Site Reliability Engineer

Grab Grecco LLP & GRXST (GxS Bank Singapore)
Oct, 2022 - Present2 yr 6 months
    Led the observability setup and documentation, optimizing logging solutions using Datadog, refining logging pipelines, and updating attribution mappings to enhance readability and searchability of logs. Achieved a 35% month-on-month cost reduction by optimizing log storage through index creation and exclusion filters in Datadog. Empowered engineering teams by enabling the setup of service-specific dashboards, significantly improving outage diagnosis and resolution. Developed custom monitoring and alerting tools using Datadog, providing proactive system health insights. Collaborated with the engineering team to reduce nagging and false positive alerts on PagerDuty, enhancing alert accuracy. Integrated PagerDuty and Datadog using Terraform, with full documentation of the process for future reference. Set up SWIFT instances based on guidelines from the SWIFT team for Worldwide Interbank Financial Telecommunications, ensuring compliance and security. Implemented a 1-click AMI rotation process for EKS and EC2-based solutions maintaining machine integrity and security by creating custom EC2 Terraform module. Automated the provisioning, configuration, and deployment processes, reducing manual effort and improving efficiency. Authored multiple SOPs and troubleshooting guides to support the SRE team in standardizing operations.

Senior Engineer-DevOps

SourceFuse Technologies
Jan, 2021 - Oct, 20221 yr 9 months
    Spearheaded, configured, and maintained highly scalable infrastructure on AWS using Infrastructure as Code (IaC) scripts like Terraform, ensuring robust and efficient deployment. Administered robust CI/CD pipelines through AWS CodePipeline, CodeBuild, and Jenkins, streamlining the continuous integration and deployment processes. Administered highly available static websites using S3 buckets and CloudFront for CDN, ensuring optimal performance and reliability. Established automated application deployment processes across production, pre-release, QA, and development environments, enhancing deployment efficiency. Enforced stringent security controls on AWS by leveraging KMS, Security Hub, GuardDuty, and AWS Organizations, ensuring a secure cloud environment. Configured and maintained EKS-Kubernetes clusters, with autoscaling of nodes and pods to accommodate traffic fluctuations, maintaining seamless operations. Promoted secure coding practices by integrating SonarQube, Snyk, and ECR vulnerability scanning into the CI pipeline, ensuring code quality and security. Utilized AWS Lambda for deploying serverless applications and configured custom headers for CloudFront, enhancing application functionality. Documented standard runbooks and operating procedures, ensuring consistent and efficient operational practices. Managed IAM roles and policies, implemented internal controls, and adhered to the Principle of Least Privilege (POLP) for risk management.

DevOps Engineer

Grazitti Interactive
Nov, 2017 - Dec, 20203 yr 1 month
    Administered Searchunify Server on AWS infrastructure, encompassing EC2, RDS, and ElastiCache, while managing troubleshooting, monitoring, configuration, optimisation, and comprehensive documentation of server operations. Deployed, maintained, and monitored Linux servers across diverse platforms, ensuring seamless operations. Managed AWS EC2, DigitalOcean, and in-house VM servers on VMware ESXi, providing a robust and scalable server environment. Implemented CI/CD pipelines using GitLab and Jenkins, alongside microservices with Docker, Docker Hub, Harbor, and SonarQube, enabling efficient code linting in real-time. Leveraged Ansible for configuration management, secret management, and automation of IT infrastructure tasks, streamlining operations. Migrated a monolithic codebase to microservices architecture using Docker, enabling serverless application deployment. Set up CloudWatch, Nagios, and ELK for comprehensive alerting, logging, and SIEM, enhancing system observability and security. Orchestrated, monitored, and maintained high-availability DB clusters, ensuring reliable uptime and performance. Administered MySQL, PostgreSQL, MongoDB, and Elasticsearch on both bare-metal and AWS RDS/AWS Elasticsearch platforms, ensuring optimal database performance. Contributed to external and internal audits, managed security breaches, SLA breaches, and handled client questionnaires with diligence and expertise.

Network and Server Administrator

Toxsl Technologies
Mar, 2017 - Nov, 2017 8 months
    Maintained and monitored Office Network and Firewall, Email servers. Maintained employee user accounts and email accounts using Active Directory. Maintained Internal VM and hardware and Cloud VPS servers for Hosting Services. Utilized multiple cloud servers to deploy PHP based websites. Installed latest security patches on Linux, Windows and Mac OS.

Achievements

  • Led the observability setup and documentation, optimizing logging solutions using Datadog, refining logging pipelines, and updating attribution mappings to enhance readability and searchability of logs.
  • Achieved a 35% month-on-month cost reduction by optimizing log storage through index creation and exclusion filters in Datadog.
  • Empowered engineering teams by enabling the setup of service-specific dashboards, significantly improving outage diagnosis and resolution.
  • Developed custom monitoring and alerting tools using Datadog, providing proactive system health insights.
  • Collaborated with the engineering team to reduce nagging and false positive alerts on PagerDuty, enhancing alert accuracy.
  • Integrated PagerDuty and Datadog using Terraform, with full documentation of the process for future reference.
  • Set up SWIFT instances based on guidelines from the SWIFT team for Worldwide Interbank Financial Telecommunications, ensuring compliance and security.
  • Implemented a 1-click AMI rotation process for EKS and EC2-based solutions maintaining machine integrity and security by creating custom EC2 Terraform module.
  • Automated the provisioning, configuration, and deployment processes, reducing manual effort and improving efficiency.
  • Authored multiple SOPs and troubleshooting guides to support the SRE team in standardizing operations.

Major Projects

1Projects

Implementation of DevSecOps Framework in Telemedicine Project Development Using AWS Cloud Provider

Oct, 2021 - Present3 yr 6 months
    Research Paper on Implementation of DevSecOps Framework in Telemedicine Project Development Using AWS Cloud Provider.

Education

  • M.Sc Computer Science - DevOps Specialization

    Liverpool John Moores University (2024)
  • Bachelor of Technology in Computer Science and Engineering

    Lovely Professional University (2017)

Certifications

  • Aws certified devops engineer - professional

  • Aws certified sysops administrator

  • Hipaa awareness for business associates

  • Hashicorp certified: terraform associate