SRE · Platform · MLOps · Cloud
MILAN
SURAS
Site Reliability Engineer
Thiruvananthapuram, Kerala

2.6 years of hands-on experience managing and automating multi-cloud infrastructure across AWS and GCP. Specializing in DevOps, SRE, Platform and Observability engineering. CKA certified with 4 active cloud certifications. Currently exploring AI integration, AI Observability, and MLOps.

☁️CKA
☁️GCP DevOps Pro
☁️GCP Architect Pro
☁️GCP ACE
☁️AWS CCP
~ milan@sre:~$
whoami
Milan Suras · SRE @ Equifax Analytics
 
cat certifications.txt
CKA · GCP DevOps Pro · GCP Architect
GCP ACE · AWS Cloud Practitioner
 
kubectl get expertise
GKE / EKS / Kubernetes
Terraform / Packer / GitOps
Jenkins / GitHub Actions / ArgoCD
Prometheus / Grafana / Istio
 
echo $EXPLORING
AI Observability · MLOps · LLM Infra
2.6+
Years Experience
5
Cloud Certifications
85+
Microservices Managed
20+
Apps GitOps Onboarded
01
Technical Arsenal Skills & Stack
☁️
Cloud Platforms
GCP GKE Compute Engine Artifact Registry Cloud Build Dataflow Dataproc AWS EKS EC2 S3 Route53 ECR
☸️
Containers & Orchestration
Kubernetes Docker Helm Kustomize Istio HPA / Cluster Autoscaler
🔁
CI/CD & GitOps
Jenkins GitHub Actions Argo CD Cloud Build GitOps ArgoCD Image Updater Argo Rollouts
🏗️
Infrastructure as Code
Terraform Packer VM Image Automation Cloud Migration IaC
📡
Monitoring & Observability
Prometheus Grafana Cloud Operations Cloud Trace Cloud Logging Loki Distributed Tracing
💻
Languages & OS
Bash / Shell Python Golang Linux (RHEL, Ubuntu) Git Google Apps Script
🔒
Security & Secrets
HashiCorp Vault Google Secret Manager PingSafe Cloud Compliance RHEL Patching
🤖
MLOps / AI (Exploring)
ML Model Deployment on GKE AI Observability MLOps Pipelines LLM Infrastructure
Example= Production-ready
Example= Proficient
02
Production & Side Work Projects
⚕️
GitOps · Healthcare
GitOps Pipeline for Med-Tech Client
July 2024

Designed and deployed a fully-fledged GitOps pipeline for a healthcare application on EKS with complete observability stack and zero-touch deployments.

  • CI managed by GitHub Actions on self-hosted runner
  • EKS cluster with Prometheus, Grafana, Loki & ArgoCD via Helm + Terraform
  • HashiCorp Vault for secrets management with automatic credential rotation
  • Automated container builds, ECR artifact storage, ArgoCD Image Updater
  • Zero-touch deployments with automated rollback on health check failures
EKSTerraformGitHub Actions HelmArgoCDVaultAWS
Live
🔷
Platform Engineering · GCP
Microservices Platform on GKE with Istio
October 2024

Built scalable microservices platform on GKE supporting 15+ services with Istio service mesh architecture, full observability and cost-optimised autoscaling.

  • Istio service mesh for traffic management, mTLS and observability
  • Google Cloud Operations (Stackdriver) + Prometheus + Grafana dashboards
  • Distributed tracing with Cloud Trace, log aggregation with Cloud Logging
  • HPA + Cluster Autoscaler for cost-optimised resource management
  • GitHub Actions CI/CD with automated testing and GKE deployments
GKEIstioTerraform PrometheusGrafanaCloud Trace
Live
🐧
Automation · Equifax
RHEL VM Image Rebuild Automation Pipeline
2025

Developed a fully-automated Jenkins + Packer pipeline for RHEL-based Linux VM image rebuilds every 21 days across NPE, UAT and PROD environments at Equifax.

  • Packer-based image builds triggered by Jenkins on a 21-day schedule
  • Automated across all 3 environments: NPE → UAT → PROD
  • Integrated compliance patching and security hardening in every build
  • Bash script for seamless Hashicorp Vault → Google Secret Manager migration
  • Disaster recovery implementation for VM-based applications
JenkinsPackerTerraform RHELBashGCP
Live
🤖
MLOps · Side Project
ML Model Serving on GKE (Exploring)
2025 — In Progress

Extending existing GKE expertise to ML workloads — deploying Python-based ML models with automated CI/CD pipelines, GPU node pools and AI observability dashboards.

  • Onboarding Python ML models to GKE using existing CI/CD pipelines
  • GPU node pool configuration with NVIDIA device plugins
  • AI Observability: model latency, throughput, drift metrics in Grafana
  • MLflow experiment tracking integrated with ArgoCD model rollouts
GKEPythonMLflow PrometheusGrafanaArgoCD
⚡ Building
03
Professional History Experience
Site Reliability Engineer
Equifax Analytics Private Limited
📍 Thiruvananthapuram, Kerala
Jan 2025 – Present
GCR → Artifact Registry migration with zero downtime for production workloads. Designed Jenkins pipelines for automated image builds every 90+ days and cleanup of untagged images. Onboarded Python-based ML model deployments to GKE clusters with automated CI/CD pipelines. Orchestrated on-premises to GCP cloud migration using Terraform and Packer. Developed fully-automated VM image rebuild pipeline for RHEL Linux VMs every 21 days across NPE, UAT, PROD. Wrote Bash script migrating secrets from HashiCorp Vault to Google Secret Manager. Automated cost fetching from Cloudability via Google Apps Script with weekly/monthly email notifications.
GKEGCPJenkins Artifact RegistryTerraformPacker RHELHashiCorp VaultDataflow DataprocJiraServiceNow
DevOps Engineer
Stackgenie Consulting
📍 Bengaluru, Karnataka
Sep 2023 – Dec 2024
Onboarded 20+ applications to GitOps-based deployment workflow using Jenkins and ArgoCD. Executed EKS cluster upgrades across multiple environments with zero downtime and worker node updates. Managed 85+ microservices across dev, staging, and production environments. Collaborated with development teams to streamline application deployments through ArgoCD and Helm charts. Remediated cloud security misconfigurations identified by PingSafe.
EKSArgoCDJenkins HelmKubernetesGitOps AWSPingSafe
DevOps Intern
Stackgenie Consulting
📍 Thiruvananthapuram, Kerala
Nov 2022 – Aug 2023
Built foundational knowledge in SDLC, Linux, Git, CI/CD, Cloud, and containerization. Supported senior engineers in maintaining CI/CD pipelines and Kubernetes cluster operations. Automated routine operational tasks with Bash scripts. Completed POC for fintech client — deployed application using ArgoCD with Helm GitOps-based POC on EKS (AWS).
KubernetesArgoCDEKS BashLinuxGitDocker
B.Tech — Mechanical Engineering
Sree Chithra Thirunal College of Engineering
📍 Thiruvananthapuram, Kerala
Jul 2019 – Jul 2023
Graduated in Mechanical Engineering while pivoting into DevOps and cloud infrastructure through self-directed learning and internship. Began DevOps internship during final year.
B.TechMechanical Engineering
04
Verified Credentials Certifications
Certified Kubernetes Administrator (CKA)
Linux Foundation
Nov 2024 – Nov 2026
ID: LF-n9y0yisqz4
Professional Cloud DevOps Engineer
Google Cloud
Dec 2025 – Dec 2028
Professional Cloud Architect
Google Cloud
Dec 2025 – Dec 2028
Associate Cloud Engineer (ACE)
Google Cloud
Feb 2025 – Mar 2028
AWS Certified Cloud Practitioner
Amazon Web Services
Aug 2024 – Aug 2027
05
Let's Connect Get in Touch

Open to SRE, Platform Engineering, MLOps and LLM Infrastructure roles. Based in Thiruvananthapuram, Kerala. Available at +91 7356584497.