10+ years engineering systems from scratch. I build highly reliable, scalable, and observable multi-cloud infrastructure platforms.
Zero-Downtime
Tenant Upgrades
Alert Noise
Eliminated via AI
Deploy Speed Gain
(600m to <10m)
Infra Cost Saved
via Arch Migrations
I am a Staff-level engineer with a deep background in Python software engineering, SRE, and platform infrastructure. My approach bridges the gap between software development and cloud-native architecture.
Instead of just operating systems, I build and automate them. Whether it's crafting an autonomous AI SRE agent from scratch or architecting a multi-tenant observability platform capable of handling 26 million logs per minute across 600+ tenants, I focus on creating self-healing, scalable structures.
I believe in proactive over reactive engineering. My goal is always to treat infrastructure as a software problem, eliminating toil, reducing MTTR, and empowering developers to move faster with confidence.
Atlan | Platform Engineering
Architected and built an autonomous AI SRE agent (OpenClaw + Claude) that investigates production incidents end-to-end. Orchestrates parallel observability queries across Prometheus, ClickHouse, and Grafana.
Atlan | Multi-Tenant Platform
Defined and led platform reliability across AWS, Azure, and GCP, setting standards for all engineering teams across 600+ enterprise tenants.
Atlan | Observability
Led a massive migration from isolated, tenant-based ELK stacks and Prometheus to a centralized, highly scalable observability pipeline.
Careem | Core Infrastructure
Drove a complete overhaul of the deployment pipeline and container orchestration strategy for the organization's super-app ecosystem.
Careem | Engineering Metrics
Designed and scaled an organization-wide Deployment KPIs dashboard tracking critical metrics like Change Lead Time (CLT), MTTR, MTTD, and batch size across all 600+ services.
Careem | Python Automation
Built a comprehensive Python automation toolkit to programmatically convert untracked, manually-created AWS infrastructure into strictly managed Infrastructure as Code (IaC) without downtime.
Certified Kubernetes Administrator
Certified Kubernetes Security Specialist