Azure certifications (AZ-305, AZ-400, AZ-700, AZ-104 or equivalent).
Job Description:
We are seeking a highly experienced Azure SRE / DevOps Architect to design, implement, and optimize scalable, secure, and highly available cloud solutions. The ideal candidate will have deep expertise in Azure, automation, CI/CD, and reliability engineering practices.
Key Responsibilities:
Cloud Infrastructure & Automation
Automate provisioning and configuration using Terraform, Ansible, Puppet, or Chef.
Develop and maintain ARM templates, Bicep modules, and Infrastructure-as-Code (IaC) pipelines.
Build automation workflows using Python, PowerShell, or Shell scripting to streamline cloud operations.
CI/CD & DevOps Engineering
Design and manage CI/CD pipelines using Azure DevOps, GitHub Actions, or Jenkins.
Implement automated testing, version control, and deployment strategies.
Integrate SRE practices, observability, and operational checks into pipelines.
Azure Architecture & Reliability Engineering
Design scalable architectures including microservices, serverless (Azure Functions), and containerized workloads.
Manage and optimize Azure Kubernetes Service (AKS) clusters.
Implement monitoring solutions using Azure Monitor, Application Insights, and Log Analytics.
Security, Governance & Compliance
Implement security-by-design principles across cloud solutions.
Manage security controls including Key Vault, RBAC, NSGs, and Private Endpoints.
Ensure compliance with industry standards (SOC, ISO, NIST, CIS benchmarks).
Monitoring, Reliability & Incident Response
Define and monitor SLOs, SLIs, and SLAs.
Build dashboards and alerts using Azure Monitor and Grafana.
Troubleshoot production issues and drive root cause analysis.
Participate in on-call rotations and improve system reliability (MTTR reduction).
Collaboration & Continuous Improvement
Collaborate with Dev, Security, and Operations teams.
Promote automation and cloud best practices.
Mentor junior engineers and drive continuous improvement initiatives.
Qualifications:
Required:
Bachelor’s degree in Computer Science, Engineering, or related field.
12+ years of experience in SRE, DevOps, or Cloud Engineering.
Strong expertise in Azure services (IAM, networking, storage, compute).
Proficiency in Python, PowerShell, or Shell scripting.
Hands-on experience with AKS, Docker, and container orchestration.
Experience with CI/CD pipelines, Git, and automated testing.
Strong knowledge of cloud security, identity management, and compliance.
Preferred:
Azure certifications (AZ-305, AZ-400, AZ-700, AZ-104 or equivalent).
Experience with Service Mesh (Istio, Linkerd) and API Gateways.
Knowledge of FinOps and cloud cost optimization.
Familiarity with distributed tracing tools (OpenTelemetry, Jaeger, Zipkin).