Lead Site Reliability Engineer

November 24

Apply Now
Logo of Masabi

Masabi

Transport • SaaS

Masabi is a company revolutionizing fare payments and public transportation systems through its Fare Payments-as-a-Service model. It offers the Justride Platform, an enterprise-ready, cloud-native solution designed to facilitate seamless integration with various transit networks. Masabi provides contactless ticketing solutions, enabling smooth, connected journeys while reducing costs for agencies and operators. By leveraging smart card, mobile ticketing, and open payment systems, Masabi enhances the passenger experience, fosters sustainable cities, and powers a connected transit ecosystem that emphasizes Mobility-as-a-Service (MaaS).

201 - 500 employees

Founded 2007

🚗 Transport

☁️ SaaS

💰 Venture Round on 2022-03

📋 Description

• Lead design discussions and make key architectural decisions for reliability, scalability, and performance. • Establish SRE standards and best practices (IaC patterns, CI/CD maturity, observability, etc.) across teams. • Design and manage infrastructure using Terraform and CloudFormation. • Build and evolve CI/CD pipelines that support fast, safe, and frequent deployments. • Automate manual tasks to reduce operational load and enable faster delivery. • Help expand our infrastructure globally, scaling up new environments with care. • Define and maintain SLIs, SLOs, and alerting strategies aligned with user experience. • Implement monitoring solutions that give us clear, early signals during incidents. • Lead capacity planning and performance tuning as our systems and teams grow. • Own incident response, root cause analysis, and post-incident reviews. • Design and maintain disaster recovery and failover strategies. • Mentor others in areas like observability, incident readiness, and infrastructure-as-code. • Document systems and processes clearly to support learning and long-term success.

🎯 Requirements

• 8+ years of experience in DevOps/Site Reliability Engineer roles. • Proven experience designing and evolving production-grade systems for scale and resilience. • Comfortable designing and operating in AWS, with strong knowledge of cloud architecture, networking and security (VPC design, IAM, least privilege). • Hands-on experience with Terraform, infrastructure automation, and CI/CD systems. • Experience with observability, performance, incident command and/or reliability (distributed tracing, log correlation, metrics maturity, etc). • Clear communication and ability to drive cross-functional reliability improvements in distributed, async-first teams. • A commitment to helping others grow in a collaborative engineering culture.

🏖️ Benefits

• 15 days paid vacation for each year plus 18 public holidays • Private Healthcare • Monthly team bonding allowance • Menopause support • Choice of a workstation • Ability to work for up to 3 months per year from any country in the world • Fun and collaborative environment with a focus on making a difference in the world. • Training allowance of up to $750 USD • $250 USD to spend on your home office every year

Apply Now

Similar Jobs

November 20

Join Solvd, a growing AI-native firm, as a Mid to Sr Software Engineer with a focus on backend code and DevOps processes.

Azure

Cloud

SQL

Terraform

November 20

Cloud DevOps Engineer at Logical Media Group managing cloud infrastructure deployment. Collaborating with development teams for performance tuning, scaling, and cost optimization.

AWS

Cloud

Docker

Google Cloud Platform

JavaScript

Node.js

Python

November 12

DevOps Engineer ensuring client websites' uptime and performance for a BCorp company. Collaborating with teams to keep client sites online and optimized within a fast-paced agile environment.

Azure

Cyber Security

DNS

Grafana

JavaScript

Kubernetes

Node.js

Sitecore

SQL

Terraform

TypeScript

Vault

.NET

November 10

Cloud DevOps Engineer responsible for deployment, scaling, and optimizing cloud infrastructure. Join Logical Media Group, a digital marketing agency, for a remote role based in Colombia.

AWS

Cloud

Docker

Google Cloud Platform

JavaScript

Node.js

Python

November 9

Senior DevOps Engineer at Gorilla Logic focusing on CI/CD and cloud infrastructure on Google Cloud Platform. Collaborating within teams to implement best practices and tools for DevOps.

Cloud

Docker

Google Cloud Platform

Gradle

Jenkins

Kubernetes

Linux

Maven

Python

Terraform

Yarn

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com