Site Reliability Engineer

11 - 50 employees

💳 Fintech

🤝 B2B

💸 Finance

Fintech • B2B • Finance

Tern is a company focused on providing flexible cards and accessible fintech tools designed to increase revenue and streamline processes for businesses and enterprises. They offer a variety of embedded banking solutions such as virtual and physical prepaid cards, bank transfers, cross-border transactions, and compliance support. Tern's platform is equipped with low code/no code solutions, APIs, and intelligent data analytics, making it user-friendly and efficient for companies looking to quickly launch financial products. With a goal to democratize fintech services, Tern aims to make these tools available to a broad audience, breaking down barriers and fostering innovation in the financial technology space.

Site Reliability Engineer

Job not on LinkedIn

🕒 June 4

🇺🇸 United States – Remote

💵 $175k - $200k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

BigQuery

Cloud

Distributed Systems

Google Cloud Platform

Heroku

Postgres

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Tern

11 - 50 employees

💳 Fintech

🤝 B2B

💸 Finance

Fintech • B2B • Finance

📋 Description

• Own the migration from Heroku to Google Cloud Platform, architecture, execution, and a cutover that doesn't surprise anyone • Build and maintain the Postgres core, Fivetran pipeline, BigQuery data layer, and Hex reporting infrastructure • Optimize the hot paths that matter most: key backend code paths and our heaviest third-party syncs, so performance holds as volume climbs • Own monitoring, alerting, cost reduction, and proactive scaling: surface problems early, keep spend sane, and stay ahead of growth rather than reacting to it • Lead incident response and write post-mortems that turn an outage into a permanent fix and a smarter team • Set the operational bar across engineering and pull others up to it

🎯 Requirements

• Production reliability ownership: Track record of personally owning production reliability at meaningful scale. Concrete stories of incidents you led, fixed, and prevented from recurring, not just participated in. This is a primary responsibility, not something you've done on the side. • Infrastructure migrations: Real experience owning a cloud migration end to end, not just contributing to one. Fluent in GCP (or a comparable cloud), infrastructure-as-code, and the failure modes of distributed systems. • Observability and proactive operations: You build monitoring and alerting that surfaces problems before users find them. You know what to instrument, what to alert on, and what's just noise. • High agency: You find the highest leverage reliability problems and go fix them without being assigned to them. You don't wait for an outage to justify the work. • AI in your working habits: Specific examples of how AI has made your debugging, automation, or operational workflows faster or more reliable.

🏖️ Benefits

• Own the migration from Heroku to Google Cloud Platform • Build and maintain the Postgres core, Fivetran pipeline, BigQuery data layer, and Hex reporting infrastructure • Optimize the hot paths that matter most: key backend code paths and our heaviest third-party syncs, so performance holds as volume climbs • Own monitoring, alerting, cost reduction, and proactive scaling: surface problems early, keep spend sane, and stay ahead of growth rather than reacting to it • Lead incident response and write post-mortems that turn an outage into a permanent fix and a smarter team • Set the operational bar across engineering and pull others up to it

Apply Now

Similar Jobs

Senior Site Reliability Engineer

🕒 June 3

AuthZed

11 - 50

🔌 API

🔒 Cybersecurity

☁️ SaaS

Site Reliability Engineer responsible for maintaining systems reliability and performance at AuthZed. Collaborate globally while developing scalable infrastructure solutions for a cutting-edge authorization platform.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Cloud

Docker

Grafana

Java

Kubernetes

Node.js

Prometheus

Python

Ruby

SQL

Terraform

Senior Site Reliability Engineer

🕒 June 3

Amwell

501 - 1000

Senior Systems Engineer managing cloud and on-prem infrastructure at Amwell. Enhancing operational efficiency through automation and system management tools.

🇺🇸 United States – Remote

💵 $129.3k - $140k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Ansible

AWS

Azure

Cloud

ElasticSearch

Google Cloud Platform

Linux

Logstash

Puppet

Python

TCP/IP

Terraform

Senior DevSecOps Engineer – Tech Lead

🕒 June 3

Real

51 - 200

🏠 Real Estate

🛍️ eCommerce

💳 Fintech

Senior DevSecOps Engineer leading DevOps & Security team at Real. Managing AWS, Kubernetes, and infrastructure as code with a focus on security practices.

🇺🇸 United States – Remote

💵 $184k - $230k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

AWS

Cloud

Java

Kubernetes

Terraform

Senior DevOps Engineer, Applications

🕒 June 3

Endeavor

5001 - 10000

📱 Media

⚽ Sports

Senior DevOps Engineer for WME building Azure infrastructure and improving system reliability in entertainment industry. Leading cloud resources, Kubernetes deployments, and CI/CD pipelines.

🇺🇸 United States – Remote

💵 $131.3k - $175k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

Azure

Docker

Grafana

Kubernetes

Microservices

Postgres

Terraform

Vault

DevOps Architect

🕒 June 3

o9 Solutions, Inc.

1001 - 5000