Work
Experience:
12+ years (with 5+ years in a hands-on architect role)
Requirements:
Cloud platforms: Deep, hands-on experience with at least two of AWS, Azure, GCP - including compute, networking, storage, identity, and managed data services. Familiarity with the third is expected.
IaC: Terraform at an advanced level (modules, workspaces, remote backends, custom providers a plus). Working knowledge of CloudFormation, ARM/Bicep, or Pulumi.
Containers & orchestration: Production experience with Docker and Kubernetes - Helm, operators, RBAC, network policies, and cluster upgrades.
Configuration management: Ansible, Chef, or Puppet for hybrid and legacy workloads.
CI/CD: Designing and operating pipelines in at least two of Jenkins, GitHub Actions, GitLab CI, Azure DevOps, or Argo Workflows.
Scripting & programming: Strong in Python, Bash, and Go or one comparable language.
Networking: TCP/IP, DNS, load balancing, VPNs, SD-WAN, and zero-trust patterns across cloud and on-prem.
Security: IAM, KMS/HSM, secrets management, WAF, vulnerability scanning, and policy-as-code.
Certifications such as AWS Solutions Architect Professional, Azure Solutions Architect Expert, GCP Professional Cloud Architect, or CKA/CKS.
Experience with event-driven architectures (Kafka, EventBridge, Pub/Sub) and serverless (Lambda, Functions, Cloud Run).
Exposure to MLOps platforms, data lakes (Databricks, Snowflake), or edge/IoT deployments.
Contributions to open-source DevOps or cloud tooling.
10+ years in infrastructure, cloud, or DevOps roles, with at least 3 years architecting solutions at scale across multiple clouds.
Demonstrated ownership of production systems — on-call experience, post-mortems, and capacity planning.
Strong written communication - able to produce clear architecture documents, RFCs, and runbooks.
Comfortable influencing senior stakeholders and pushing back constructively on unrealistic timelines or designs.
Pragmatic - chooses the simplest solution that meets the requirement, and knows when to buy vs build.
Qualifications:
Bachelor’s degree in computer science, Engineering, Information Systems, or related technical field (or equivalent practical experience).
Job Description:
Design and implement multi-cloud landing zones across AWS, Azure, and GCP, including networking (VPC/VNet, Transit Gateway, peering, hybrid connectivity), IAM, and account/subscription structures.
Build and maintain Infrastructure-as-Code modules in Terraform (or OpenTofu), including reusable modules, remote state strategy, policy-as-code (OPA/Sentinel), and drift detection.
Architect and operate Kubernetes platforms (EKS, AKS, GKE) - cluster bootstrapping, autoscaling, ingress, service mesh (Istio/Linkerd), GitOps (ArgoCD/Flux), and workload onboarding.
Design CI/CD pipelines using Jenkins, GitHub Actions, GitLab CI, or Azure DevOps, with built-in security scanning (SAST/DAST/SCA), artifact management, and progressive delivery (blue-green, canary).
Define and enforce observability standards using Prometheus, Grafana, Loki, OpenTelemetry, Datadog, or the cloud-native stacks (CloudWatch, Azure Monitor, Cloud Logging).
Lead FinOps initiatives - tagging strategy, rightsizing, savings plans/reserved capacity, and chargeback/showback reporting.
Own the cloud security posture - IAM least-privilege, secrets management (Vault, KMS, Secrets Manager), CSPM tooling, encryption, and compliance with SOC2/ISO 27001/HIPAA/PCI as applicable.
Drive migration and modernization programs - lift-and-shift, replatforming, containerization, and event-driven/serverless redesigns.
Mentor engineers, run architecture reviews, and write ADRs that capture decisions and trade-offs.
Partner with application, data, and security teams to embed reliability and DevSecOps practices into the SDLC.