Lead Site Reliability Engineer

November 24

🇨🇴 Colombia – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Apply Now
Logo of Masabi

Masabi

Transport • SaaS

Masabi is a company revolutionizing fare payments and public transportation systems through its Fare Payments-as-a-Service model. It offers the Justride Platform, an enterprise-ready, cloud-native solution designed to facilitate seamless integration with various transit networks. Masabi provides contactless ticketing solutions, enabling smooth, connected journeys while reducing costs for agencies and operators. By leveraging smart card, mobile ticketing, and open payment systems, Masabi enhances the passenger experience, fosters sustainable cities, and powers a connected transit ecosystem that emphasizes Mobility-as-a-Service (MaaS).

201 - 500 employees

Founded 2007

🚗 Transport

☁️ SaaS

💰 Venture Round on 2022-03

📋 Description

• Lead design discussions and make key architectural decisions for reliability, scalability, and performance. • Establish SRE standards and best practices (IaC patterns, CI/CD maturity, observability, etc.) across teams. • Design and manage infrastructure using Terraform and CloudFormation. • Build and evolve CI/CD pipelines that support fast, safe, and frequent deployments. • Automate manual tasks to reduce operational load and enable faster delivery. • Help expand our infrastructure globally, scaling up new environments with care. • Define and maintain SLIs, SLOs, and alerting strategies aligned with user experience. • Implement monitoring solutions that give us clear, early signals during incidents. • Lead capacity planning and performance tuning as our systems and teams grow. • Own incident response, root cause analysis, and post-incident reviews. • Design and maintain disaster recovery and failover strategies. • Mentor others in areas like observability, incident readiness, and infrastructure-as-code. • Document systems and processes clearly to support learning and long-term success.

🎯 Requirements

• 8+ years of experience in DevOps/Site Reliability Engineer roles. • Proven experience designing and evolving production-grade systems for scale and resilience. • Comfortable designing and operating in AWS, with strong knowledge of cloud architecture, networking and security (VPC design, IAM, least privilege). • Hands-on experience with Terraform, infrastructure automation, and CI/CD systems. • Experience with observability, performance, incident command and/or reliability (distributed tracing, log correlation, metrics maturity, etc). • Clear communication and ability to drive cross-functional reliability improvements in distributed, async-first teams. • A commitment to helping others grow in a collaborative engineering culture.

🏖️ Benefits

• 15 days paid vacation for each year plus 18 public holidays • Private Healthcare • Monthly team bonding allowance • Menopause support • Choice of a workstation • Ability to work for up to 3 months per year from any country in the world • Fun and collaborative environment with a focus on making a difference in the world. • Training allowance of up to $750 USD • $250 USD to spend on your home office every year

Apply Now

Similar Jobs

November 20

Solvd, Inc.

501 - 1000

☁️ SaaS

🤝 B2B

🏢 Enterprise

Join Solvd, a growing AI-native firm, as a Mid to Sr Software Engineer with a focus on backend code and DevOps processes.

🇨🇴 Colombia – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 20

Logical Media Group

11 - 50

📱 Media

Cloud DevOps Engineer at Logical Media Group managing cloud infrastructure deployment. Collaborating with development teams for performance tuning, scaling, and cost optimization.

🇨🇴 Colombia – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 12

Sharesource

51 - 200

🎯 Recruiter

🏢 Enterprise

👥 HR Tech

DevOps Engineer ensuring client websites' uptime and performance for a BCorp company. Collaborating with teams to keep client sites online and optimized within a fast-paced agile environment.

🇨🇴 Colombia – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 10

Logical Media Group

11 - 50

📱 Media

Cloud DevOps Engineer responsible for deployment, scaling, and optimizing cloud infrastructure. Join Logical Media Group, a digital marketing agency, for a remote role based in Colombia.

🇨🇴 Colombia – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 9

Gorilla Logic

501 - 1000

☁️ SaaS

🏢 Enterprise

🤖 Artificial Intelligence

Senior DevOps Engineer at Gorilla Logic focusing on CI/CD and cloud infrastructure on Google Cloud Platform. Collaborating within teams to implement best practices and tools for DevOps.

🇨🇴 Colombia – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com