Join LeapThought's Innovative Team
Build the future of infrastructure delivery with a team that values innovation, collaboration, and excellence in transforming the built environment.
SRE Engineer (Azure)
About the Role
We are hiring an SRE Engineer to build and operate highly reliable, scalable cloud systems on Azure. You will play a key role in ensuring platform stability, performance, and operational excellence through automation and proactive monitoring.
What You'll do
-
Own reliability, availability, and performance of Azure-based systems
-
Design and implement monitoring, alerting, and observability frameworks
-
Define and track SLOs/SLAs to drive service reliability
-
Lead incident response, postmortems, and continuous improvement initiatives
-
Automate infrastructure and operational workflows using modern tooling
-
Implement resilience patterns (autoscaling, failover, redundancy)
- Partner with engineering teams to improve deployment and operational readiness
What We're Looking For
-
3–6 years in SRE, Cloud Operations, or Infrastructure Engineering
-
Strong experience with Azure (Monitor, Log Analytics, AKS, networking)
-
Hands-on experience with IaC (Terraform, Bicep) and scripting (Python, PowerShell)
-
Solid understanding of distributed systems and reliability engineering principles
-
Willingness to participate in on-call rotations
