Director, Software Engineering – Site Reliability Engineering

🕒 February 19

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Affirm

Affirm

1001 - 5000 employees

Founded 2012

💳 Fintech

👥 B2C

🛍️ eCommerce

💰 Post-IPO Equity on 2021-01

Fintech • B2C • eCommerce

Affirm is a financial technology company that offers a 'Buy Now, Pay Later' service, allowing consumers to make purchases and pay for them over time with flexible payment plans. Affirm eliminates hidden fees and compound interest, providing clear terms and conditions for its users. The company also offers the Affirm Card, a debit card that allows users to request to pay over time for larger purchases or pay in full for smaller ones. Affirm partners with various retailers across multiple categories, including electronics, apparel, and travel, providing customers with the convenience of paying over time at checkout both online and in physical stores. Affirm's services are integrated with Apple Pay, enabling customers to make payments seamlessly from their iPhone or iPad.

📋 Description

• Set the vision and drive execution for Reliability Engineering at Affirm • Own and coordinate delivery of high availability of core Affirm’s services, to attain our service level standards and expectations with external partners • Iterate and maintain a best-in-industry global incident response & lifecycle program • Build software and program management structure to perform continual risk management across the entire Affirm system and Engineering organization • Run a robust development lifecycle establishing a culture for operational excellence, while experimenting and failing fast • Work with a wide variety of cross functional partners outside of engineering ranging from product, enterprise risk, security, legal and compliance • Hire and build a global team of SREs, system engineers, and full stack engineers • Cultivate a respectful and supportive environment for all team members that effectively demonstrates the diversity of the team

🎯 Requirements

• 15+ years of relevant experience in software and site reliability engineering • Experience leading SRE, systems engineering, and full stack engineering teams • Successful track record driving key outcomes that drive the company’s success • Comfortable partnering across disciplines and influencing across a wide variety of leaders • World-class communicator with excellent instincts for empathetic messaging • Keen technical mind comfortable reading and understanding full-stack code • Proven track record of establishing and growing teams, retaining talent, and comfort working with ambiguity • This position requires either equivalent practical experience or a Bachelor’s degree in a related field

🏖️ Benefits

• Health care coverage - Affirm covers all premiums for all levels of coverage for you and your dependents • Flexible Spending Wallets - generous stipends for spending on Technology, Food, various Lifestyle needs, and family forming expenses • Time off - competitive vacation and holiday schedules allowing you to take time off to rest and recharge • ESPP - An employee stock purchase plan enabling you to buy shares of Affirm at a discount

Apply Now

Similar Jobs

🕒 February 12

Upstart

1001 - 5000

Principal Software Engineer on the SRE team at Upstart, advocating for reliability and scalability. Leading cross-functional collaboration and shaping technical roadmaps for SRE initiatives.

JavaScript

Prometheus

Python

Terraform

TypeScript

Go

🕒 January 27

Whitespace

1 - 10

🔐 Security

🤖 Artificial Intelligence

📋 Compliance

Senior DevSecOps Engineer improving cybersecurity posture and supporting compliance for federal requirements in the U.S. Working remotely with less than 10% travel.

Ansible

AWS

Azure

Cloud

Docker

Google Cloud Platform

Kubernetes

OpenShift

Python

Terraform

🕒 January 9

PathAI

501 - 1000

🤖 Artificial Intelligence

⚕️ Healthcare Insurance

🧬 Biotechnology

Staff Site Reliability Engineer designing and operating a hybrid cloud environment at PathAI. Focused on implementing SRE best practices and enhancing infrastructure reliability.

Ansible

AWS

Cloud

Grafana

Prometheus

Python

Terraform

🕒 December 24, 2025

Upshop

51 - 200

☁️ SaaS

🛒 Retail

🛍️ eCommerce

SRE / DevOps Manager at Upshop leading reliability and operations engineering team. Responsible for scalability, security, and performance of infrastructure.

AWS

Azure

Cloud

Docker

Google Cloud Platform

Grafana

Kubernetes

MongoDB

Prometheus

Python

Shell Scripting

Terraform

Go

🕒 November 13, 2025

FloSports

201 - 500

Staff SRE at FloSports improving developer enablement and migrating infrastructure to AWS. Leading technical architecture and critical tooling development with a focus on reliability and automation.

AWS

Google Cloud Platform

JavaScript

Kubernetes

Node.js

Terraform

Go