Senior Platform & DevOps Engineer

Julius
Omoleye

I build the infrastructure layer that lets product teams ship with speed, confidence, and zero firefighting. Cloud platforms, Kubernetes, developer tooling — from first principles to production scale.

Platform Engineering Kubernetes Stabilization Cloud Architecture Internal Developer Platforms AI Ops Agents Data Streaming CI/CD Automation Disaster Recovery
6+
Years Experience
30+
Client Engagements
3
Cloud Platforms
60%
Downtime Reduction

What I Build & Operate

Across AWS, Azure, and GCP — designing infrastructure that is observable, recoverable, and developer-friendly.

Kubernetes Platform Engineering

Stabilizing, scaling, and operating production Kubernetes clusters with advanced scheduling, autoscaling, and GitOps-driven delivery.

EKSROSAKarpenter ArgoCDHelmKEDA

Internal Developer Platforms

Building self-service IDPs with Backstage — software catalogs, golden path templates, and scaffolded onboarding that reduce toil for engineering teams.

BackstageIDPService Catalog ScaffoldingSelf-Service

Observability & AI Ops

Full-stack observability with Prometheus, Grafana, and ELK. Built read-only AI agents that autonomously diagnose live incidents — reducing MTTR without human escalation.

PrometheusGrafanaELK LokiAI AgentsBedrock

Distributed Data Infrastructure

Self-hosting and operating Apache Kafka (via Strimzi) and Cassandra clusters at scale. Integrating with AWS MSK and Keyspaces for hybrid event-streaming architectures.

KafkaStrimziAWS MSK CassandraAWS Keyspaces

Cloud Architecture & Migration

Designing and executing on-prem-to-cloud migrations, multi-account AWS foundations, VPN/Direct Connect hybrid networks, and disaster recovery strategies.

AWSAzureGCP TerraformControl Tower

CI/CD & Infrastructure as Code

Building repeatable, multi-environment pipelines with Terraform, Terragrunt, and Ansible. GitOps-first delivery with ArgoCD, GitHub Actions, Jenkins, and Azure DevOps.

TerraformTerragruntAnsible GitHub ActionsJenkins

Real Outcomes, Real Impact

A selection of high-impact projects and the measurable results they produced.

01 Tramango

EC2 to EKS Migration — Nigeria's Fastest OTA

Led the full architectural redesign and migration of a high-traffic flight booking system from EC2 to Kubernetes on EKS. Redesigned the deployment model, introduced health checks with Liveness and Readiness probes, and implemented Karpenter for node autoscaling — transforming Tramango into Nigeria's fastest online travel agency.

+50% Scalability
−35% Response Time
−45% Production Incidents
02 Tramango

Disaster Recovery System with 60% Faster RTO

Designed and implemented a warm standby disaster recovery architecture for a startup operating in a high-availability travel industry. Automated replication procedures, encoded the entire DR strategy in infrastructure-as-code, and ran tabletop exercises with executive stakeholders — ultimately becoming a key factor in securing investor confidence for a $5M funding round.

−60% Recovery Time
$5M Funding Round
03 Opsfleet

AI Troubleshooting Agent for Live Environments

Built a read-only AI agent integrated with CloudWatch logs, pod states, metrics, and distributed traces. The agent autonomously analyses live production incidents, surfaces root causes, and suggests remediation steps — all without requiring write access or human escalation as a first step. Directly reduced MTTR across multiple client environments.

MTTR
Escalation Rate
04 Opsfleet

Backstage IDP — Developer Self-Service Platform

Established a full Internal Developer Platform using Backstage, including a service catalog, golden path scaffolding templates, and automated environment provisioning workflows. Reduced onboarding time for new engineers, standardised infrastructure patterns, and eliminated the "ask the platform team" bottleneck for common infrastructure tasks.

Onboarding Time
Self-Service
05 Opsfleet

Self-Hosted Kafka & Cassandra on Kubernetes

Deployed and operated production-grade Apache Kafka clusters via the Strimzi operator and Apache Cassandra for distributed, low-latency workloads — all running on Kubernetes. Managed topic replication, partition strategies, compaction, and token-aware replication. Also integrated AWS MSK and Keyspaces for teams preferring managed services in hybrid architectures.

High Throughput
Hybrid Arch
06 Multiple Clients

Terraform-Driven Provisioning Automation

Built Terraform and Terragrunt-based infrastructure automation frameworks used across 30+ client environments. Standardised multi-account AWS foundations with Control Tower, automated provisioning of VPC architectures, EKS clusters, and RDS instances — eliminating manual provisioning errors and reducing infrastructure provisioning time by over 60%.

−60% Provisioning Time
30+ Environments

Tools of the Trade

Cloud Platforms
AWSAzureGCP AWS EKSAWS ECSAWS Lambda AWS BedrockAWS MSKAWS Keyspaces AWS Control TowerAWS Organizations
Containers & Orchestration
KubernetesDockerHelm ArgoCDKarpenterKEDA StrimziExternalDNSALB Controller ExternalSecretsNode AffinityTaints & Tolerations
Data & Streaming
Apache KafkaApache Cassandra PostgreSQLMySQL MongoDBDynamoDBRedis
Infrastructure as Code & CI/CD
TerraformTerragruntAnsible CloudFormationGitHub ActionsJenkins Bitbucket PipelinesAzure DevOpsAWS CodePipeline
Observability & Developer Platforms
PrometheusGrafanaLoki ElasticsearchKibanaFilebeat DatadogSplunkCloudWatch BackstageSonarQube
Languages & Scripting
PythonBashPowerShell DjangoFlaskHCL

Where I've Worked

Jan 2024 — Present
Senior Platform / DevOps Engineer
Opsfleet — Israel

Leading cloud migrations, Kubernetes platform engineering, IDP development with Backstage, self-hosted Kafka and Cassandra operations, and AI-powered ops tooling for enterprise clients across AWS, Azure, and IBM environments.

Jan 2024 — Jul 2025
Senior AWS DevOps Consultant
Datamellon — London

Engaged with 30+ clients across diverse industries, delivering AWS Well-Architected reviews, serverless migrations, RAG systems on Bedrock, and multi-account AWS foundations with Control Tower and Organizations.

Jan 2022 — Dec 2024
Lead DevOps Engineer
Tramango — Nigeria

Spearheaded cloud infrastructure foundation, EC2-to-EKS migration, disaster recovery strategy, and CI/CD automation. Contributed directly to a $5M Series A funding round by demonstrating operational maturity.

Jan 2019 — Jan 2022
Software Engineer (DevOps)
FastRyders — Nigeria

Built production-grade payment backend in Django, led migration to Amazon ECS, developed CI/CD pipelines with AWS native tools, and created monitoring solutions tracking 200+ operational metrics.

Verified Expertise

AWS Certified Solutions Architect
AWS Certified Developer
AWS Certified DevOps Engineer Professional
AWS Certified Advanced Networking Specialty
AWS Knowledge: Migration Foundations
Microsoft Certified: Azure Fundamentals

Let's build something
solid.

Whether you're modernising a legacy system, scaling a Kubernetes platform, or want to discuss how an Internal Developer Platform could transform your engineering team — I'm open to conversations.