Senior Site Reliability Engineer, Azure

🕒 April 10

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of MLabs

MLabs

51 - 200 employees

MLabs Consulting helps to setup project specification, implementation, management and maintenance of technical projects for AI, Fintech, Information Technology and more. We specialise in functional programming, compilers, AI, DevOps and full-stack development.

📋 Description

• Infrastructure Design: Architect and deploy secure, scalable Azure infrastructure tailored for production-grade distributed systems. • Automation & IaC: Develop and maintain Terraform-based infrastructure as code to enable repeatable, automated deployments across various environments. • Technical Leadership: Translate ambiguous product and customer requirements into structured technical architecture and actionable execution plans. • Platform Enhancement: Build and optimize platform services, APIs, and integrations to extend core system capabilities. • Cross-Functional Collaboration: Partner with engineering, security, and product teams to deliver enterprise-ready infrastructure solutions. • Operational Excellence: Drive improvements in reliability, observability, and incident response while providing Tier 2 infrastructure support for customer deployments.

🎯 Requirements

• Proven Track Record: Extensive experience designing and building production-grade systems specifically on the Azure stack. • Problem Solving: Ability to transform high-level requirements into scalable, delivered systems. • Communication: Strong technical communication skills with the ability to interface with both engineering teams and non-technical stakeholders. • Mindset: A high-ownership approach with a strong bias for action and accountability. • Functional Expertise: Deep knowledge of Azure networking, compute, identity, security, and storage. • Infrastructure as Code: Advanced proficiency with Terraform at production scale. • Programming: Professional experience in Go and/or Python. • Systems Engineering: Background in distributed systems, high-availability architectures, or platform engineering. • CI/CD: Experience with automation tooling for the entire infrastructure lifecycle.

🏖️ Benefits

• Equity & Tokens: Participation in the long-term growth of the project. • Performance Bonuses: Annual incentives based on individual and company milestones. • Health & Retirement: Comprehensive health insurance and 401k plans (available for US-based employees).

Apply Now

Similar Jobs

🕒 April 10

NEC Software Solutions

5001 - 10000

🏢 Enterprise

🏛️ Government

Senior DevOps Engineer managing cloud infrastructure and DevOps solutions for public service systems in a hybrid setup. Role requires AWS expertise and contributions to major national programmes.

AWS

Azure

Cloud

Kubernetes

Python

Terraform

🕒 April 9

Andromeda

11 - 50

🤖 Artificial Intelligence

🤝 B2B

🔧 Hardware

Senior Site Reliability Engineer designing and operating multi-region GPU compute clusters. Collaborating directly with customers to optimize large-scale training workloads.

Distributed Systems

Kubernetes

Linux

Python

PyTorch

Go

🕒 April 9

PostHog

11 - 50

☁️ SaaS

⚡ Productivity

🏢 Enterprise

SRE role focusing on turning fast-growing systems into predictable, reliable platforms. Join PostHog to build and automate infrastructure.

AWS

Cloud

Kubernetes

Linux

Node.js

Terraform

🕒 April 9

Cresta

51 - 200

☁️ SaaS

🤖 Artificial Intelligence

🏢 Enterprise

Senior Infrastructure Engineer/SRE responsible for building core infrastructure at AI-driven contact center company. Designing tools for developers and ensuring reliability across cloud platforms.

AWS

Azure

Cloud

DNS

EC2

Flux

Kubernetes

Postgres

Python

Terraform

Go

🕒 April 9

Toast

1001 - 5000

☁️ SaaS

🤝 B2B

Senior Software Engineer focusing on Mobile DevOps at Toast, creating innovative solutions for restaurant technology with a strong emphasis on AI tools and developer experience.

Android

Cloud

Gradle

Java

Jenkins

Kotlin

React