Senior/Staff Infrastructure, Site Reliability Engineer (SRE)

🕒 May 13

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Oscilar

Oscilar

51 - 200 employees

Founded 2021

💳 Fintech

🏦 Banking

📋 Compliance

Fintech • Banking • Compliance

Oscilar is a risk management platform that focuses on fraud defense, credit underwriting, onboarding risk, and AML compliance for financial institutions. Through its advanced AI Risk Decisioning™, Oscilar enables organizations to make faster and more intelligent risk decisions, monitor customer journeys, and ensure regulatory compliance seamlessly. Oscilar's platform provides comprehensive analytics and proactive detection capabilities, tailored to meet the unique risk needs of banks, fintechs, and credit unions.

📋 Description

• Architect and operate resilient cloud infrastructure (AWS, Pulumi, Kubernetes). • Lead initiatives to improve availability, latency, and performance at scale. • Design and evolve our CI/CD pipelines to optimize for speed, safety, and repeatability. • Define the metrics, alerts, and runbooks that form our observability backbone. • Run chaos experiments and failure simulations to harden the platform. • Mentor engineers and set best practices for SRE across the company.

🎯 Requirements

• Proven track record as a senior SRE or Infrastructure Engineer in high-scale environments. • Expert-level skills in AWS and Infrastructure as Code (Pulumi, Terraform). • Strong programming ability in Go or Python. We use Go. • Deep understanding of distributed systems (Kafka, ClickHouse) and microservices architecture. • Mastery of container orchestration (Kubernetes) and production debugging. • Strong sense of ownership, and the judgment to balance velocity with reliability.

🏖️ Benefits

• Compensation: Competitive salary and equity packages, including a 401k plan • Flexibility: Remote-first culture — work from anywhere • Health: 100% Employer covered comprehensive health, dental, and vision insurance with a top tier plan for you and your dependents (US) • Balance: Unlimited PTO policy • Technical: AI First company; both Co-Founders are engineers at heart; and over 50% of the company is Engineering and Product • Culture: Family-Friendly environment; Regular team events and offsites • Development: Unparalleled learning and professional development opportunities • Impact: Making the internet safer by protecting online transactions

Apply Now

Similar Jobs

🕒 May 8

HostPapa

51 - 200

☁️ SaaS

🌐 Web 3

🛍️ eCommerce

DevOps Engineer at HostPapa designing and operating cloud infrastructure for multi-tenant SaaS platforms. Focused on CI/CD, infrastructure automation, and scalability.

Ansible

AWS

Azure

Cloud

Distributed Systems

Docker

Google Cloud Platform

Grafana

Groovy

Jenkins

Kubernetes

Linux

Microservices

Python

Terraform

🕒 May 7

Fullsteam

1001 - 5000

💳 Fintech

☁️ SaaS

🤝 B2B

Lead DevOps Manager at Fullsteam overseeing infrastructure and operational practices while managing a team. Responsible for scaling reliability and supporting product and engineering collaboration.

AWS

Cloud

Docker

EC2

Kubernetes

Microservices

Python

Terraform

🕒 May 6

Intrahealth, a HEALWELL AI Company

51 - 200

⚕️ Healthcare Insurance

☁️ SaaS

🤖 Artificial Intelligence

DevOps Engineer fluent in AI-augmented development to build Kubernetes infrastructure for Intrahealth. Responsible for CI/CD pipelines and ensuring reliable cloud environments.

AWS

Azure

Cloud

DNS

Google Cloud Platform

Kubernetes

Python

Terraform

Go

🕒 May 1

Ticketmaster

10,000+ employees

🛍️ eCommerce

⚽ Sports

Lead Site Reliability Developer delivering consulting across teams for Ticketmaster's SRE practices. Focused on enhancing reliability, resilience, and engineering practices globally from Toronto or Quebec.

🗣️🇫🇷 French Required

AWS

Kubernetes

🕒 May 1

Live Nation Entertainment

10,000+ employees

📱 Media

Lead Site Reliability Developer guiding reliability initiatives at Ticketmaster. Collaborating across teams to improve engineering practices and mentor consultants in SRE principles.

🗣️🇫🇷 French Required

AWS

Kubernetes