DevOps - Cloud - SRE mindset

Building reliable
AWS & Kubernetes platforms.

I'm a DevOps Engineer with around 3 years of experience designing CI/CD pipelines, container platforms, and observability for fast-moving teams. I care about clean automation, predictable releases, and systems that don't wake you up at 3 AM.

AWS Certified Solutions Architect - Associate
Terraform - Kubernetes - Jenkins - GitHub Actions
Prometheus - Grafana - Cost Optimization

About

Who I am & how I think about DevOps.

I'm Vikash Kumar, a DevOps Engineer focused on building predictable, observable, and cost-efficient platforms on AWS. I've worked with product teams to automate deployments, migrate from EC2 to Kubernetes, and introduce monitoring that actually helps people debug issues, not drown in alerts.

  • Design CI/CD pipelines that keep shipping fast but safe.
  • Use infrastructure as code (Terraform) so infra is repeatable and reviewable.
  • Invest early in monitoring, logs, and dashboards to reduce MTTR.
  • Partner with dev teams to make DevOps a collaboration, not a ticket queue.

Skills

The tools & concepts I use daily.

I work across cloud, containers, automation, and observability, with a strong focus on AWS and Kubernetes ecosystems.

AWS (EC2, S3, IAM, VPC, CloudWatch) Kubernetes - EKS Docker - Docker Compose Terraform - IaC Jenkins - GitHub Actions - CI/CD Prometheus - Grafana Linux - Networking basics Python - Bash scripting Security & cost optimization mindset
Recently, I:
  • Migrated a legacy stack from EC2 to Kubernetes using Helm & ArgoCD.
  • Improved deployment times by ~70% via CI/CD optimizations.
  • Cut staging infra costs by ~46% with smarter scheduling & right-sizing.
  • Rolled out Prometheus & Grafana for centralized observability.

Services

How I can help your team or product.

DevOps Solutions

CI/CD - Automation - Release Engineering

Design and implement end-to-end CI/CD pipelines that keep shipping fast while staying safe, with automated testing, approvals, and instant rollbacks.

I work with teams to replace manual deploys with fully automated pipelines using tools like Jenkins and GitHub Actions. We define pipeline stages for build, test, security checks, and deployment, so every release is traceable and repeatable.

I also focus on versioning, artifact governance, and rollback strategies so it's always clear what is running in each environment and how to roll back quickly if needed.

Cloud Infrastructure Design

AWS - Kubernetes - IaC

Plan and build cloud infrastructure on AWS with infrastructure as code, focusing on security, reliability, and cost awareness from day one.

From VPCs and networking to EC2, EKS, and managed services, I help design architectures that match your product's needs without overcomplicating the stack.

Using Terraform, environments are codified and version-controlled, making changes reviewable, auditable, and easy to replicate across staging and production.

📉 FinOps & Cost Optimization

Cloud Cost Analysis & Tuning

Analyze AWS usage, right-size infrastructure, and implement smart scheduling and policies to reduce cloud bills without hurting performance.

At Dhwani, I helped reduce AWS staging costs by ~46% by auditing unused resources, applying instance scheduling, and picking more appropriate instance families and storage options.

I bring this mindset to new environments as well: measure first, then optimize with data, making cost a feature of the platform rather than a surprise at the end of the month.

📊 Monitoring & SRE Solutions

SLIs - SLOs - Observability

Set up meaningful dashboards, alerts, and logging that help teams see issues early and debug quickly instead of drowning in noise.

I've rolled out Prometheus and Grafana for services, defining SLIs like latency, error rates, and resource saturation. Alerts are based on error budgets instead of raw thresholds, so teams get notified about what actually matters.

Logs and metrics are structured to make it easy to correlate deploys, incidents, and performance regressions across environments.

🚀 Scalable & Reliable Deployments

Kubernetes - Containers - Rollouts

Containerize applications and run them on Kubernetes with clean rollout strategies, health checks, and resource tuning for stability.

I've migrated workloads from EC2 to Kubernetes using Helm and GitOps tooling like ArgoCD, enabling automated deployments across multiple environments with consistency.

Readiness and liveness probes, resource requests/limits, and progressive rollouts help keep apps healthy while you iterate quickly.

Experience

Where I've been applying these skills.

Associate DevOps Engineer

Dhwani Rural Information System - Nov 2023 to Present - Gurgaon
Current

Owning CI/CD, containerization, and observability for multiple products, with a focus on reliability and cost efficiency.

  • Migrated legacy workloads from EC2 to Kubernetes using Helm, ArgoCD & Terraform.
  • Optimized Jenkins pipelines and integrated automated tests, reducing deployment time by ~70%.
  • Reduced AWS staging costs by ~46% using instance scheduling and usage audits.
  • Implemented Prometheus & Grafana for centralized monitoring and faster incident response.
AWS Kubernetes Terraform Jenkins Prometheus - Grafana

DevOps Engineer

Solulab - Dec 2022 to Aug 2023 - Remote
Previous

Built and maintained CI/CD pipelines and cloud infrastructure for containerized applications running on AWS and Kubernetes.

  • Created CI/CD workflows with GitHub Actions for build, test & deploy.
  • Developed Shell & Python automation scripts for backups and log management.
  • Deployed and managed containerized apps on AWS & Kubernetes.
GitHub Actions Docker Kubernetes Python

Projects

Highlights of real impact work.

mGrant Platform - DevOps Enhancements

Dhwani - Core product
Infra redesign

Led DevOps initiatives to improve scalability, reliability, and operational efficiency for a major grants management product.

  • Moved from EC2-based deployments to Kubernetes with Helm & ArgoCD.
  • Redesigned staging infra, cutting costs while keeping performance intact.
  • Built Python scripts to generate daily test reports to speed up debugging.
Kubernetes ArgoCD Terraform Python

Observability & Uptime Initiative

Cross-product monitoring
Reliability

Introduced centralized monitoring and alerting patterns to reduce blind spots and improve mean time to resolution (MTTR).

  • Rolled out Prometheus + Grafana template dashboards for key services.
  • Defined SLIs around latency, error rate & resource saturation.
  • Helped teams debug issues faster by correlating logs, metrics & deploys.
Prometheus Grafana CloudWatch

Certifications

Some formal proof I take this seriously.

AWS Solutions Architect - Associate

Amazon Web Services
Cloud Architecture

Validated skills in designing and operating scalable, secure, and cost-optimized solutions on AWS.

Docker Essentials - DevOps on AWS - GCP Milestone

IBM - Coursera - Google Cloud
Continuous learning

Completed multiple courses covering containers, CI/CD, and hands-on cloud labs, reinforcing day-to-day DevOps practices.

Contact

Let's build something reliable together.

Got a role, project, or idea?

I'm open to DevOps / Platform / Cloud roles where I can own infrastructure, CI/CD, and observability. I'm also happy to collaborate on DevOps projects or help with guidance where I can add value.

📍 Gurgaon, India - Open to remote

I'll tailor my reply based on what you pick here.