Available for new opportunities

Hemanth

Site Reliability Engineer

5+ years building resilient, secure, and cost-efficient infrastructure on AWS — Kubernetes, Terraform, CI/CD, and AI-augmented workflows.

hemanth@sre — bash
hemanth@sre:~$
5 +
Years Experience
AWS & Kubernetes
30 x
Faster Analysis
AI-augmented SRE
80 %
Vuln Reduction
Automated patching
4 x
DDoS Blocked
Zero downtime

What I Do

AI-Augmented Ops

Claude & Codex skill files that cut vulnerability analysis from 60 min to under 2 min. Reasoning where scripts can't.

Platform Reliability

SLO definition, burn-rate alerting, and on-call infrastructure built to surface risk before customer impact.

Kubernetes Operations

EKS & GKE cluster management, blue-green upgrades, Karpenter node autoscaling, and Helm-based deployments.

Security & Hardening

CloudFront + WAF edge defense, secrets centralization, and automated kernel patching across the EC2 fleet.

Infrastructure as Code

Modular Terraform with remote state, mandatory tagging, and drift detection — every resource is reproducible.

CI/CD Pipelines

Designed and built GitLab CI/CD from scratch across 5+ microservices — multi-stage pipelines, environment-gated deployments, and 60% less manual intervention.

Cost Engineering

Rightsizing and resource cleanup based on utilization analysis — Karpenter over Cluster Autoscaler for 25% compute savings, combined with scheduled scaling and idle resource deletion.

Cloud Networking

Designed AWS VPC architecture with public/private subnet segmentation, NAT gateways, and security groups across dev, QA, and production environments.

Featured Projects

View All

AI-Augmented SRE Workflows

Built reusable Claude/Codex skill files at GoGuardian that automated vulnerability analysis and DDoS alert investigation — cutting 60+ minutes of manual security analysis to under 5 minutes per run.

AI/LLMClaudeCodexPythonAWS Athena +1
Read more

WealthFolio — Self-Hosted Portfolio Tracker

Production-grade Indian investment portfolio tracker built with Go and React 19. Multi-broker import (Zerodha, Groww, INDMoney), Gmail auto-import, FIFO cost basis, XIRR, TimescaleDB time-series snapshots, and AI market analysis — deployed as a single Docker binary on a Raspberry Pi.

GoReact 19TimescaleDBDockerTanStack Router +1
Read more

EKS Cluster Upgrade: v1.23 → v1.28

Blue-green EKS cluster migration from a manually-managed v1.23 cluster to a Terraform-provisioned v1.28 cluster with VPC-only access — achieving under 5 minutes of user-facing impact and 100% IaC coverage.

KubernetesAWS EKSTerraformHelmArgoCD +1
Read more

Latest Writing

View All

AI as an SRE Tool: Beyond the Hype

We moved from writing Python scripts for repetitive SRE tasks to using Claude skill files — and it changed how we think about automation. Here's what actually works, what doesn't, and why the distinction matters.

AISREAutomationDevOpsClaude +1
Read more

How We Stopped DDoS Attacks from Reaching Our Servers

After a 90-minute outage taught us that blocking traffic at the origin is already too late, we redesigned our DDoS defense around CloudFront and WAF at the edge — and haven't had a successful attack in 2 years.

AWSCloudFrontWAFDDoSSecurity +1
Read more

Let's work together

Open to SRE, DevOps, and Platform Engineering roles. Always happy to connect.