Site Reliability Engineer

Job not on LinkedIn

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Tern

Tern

11 - 50 employees

💳 Fintech

🤝 B2B

💸 Finance

Fintech • B2B • Finance

Tern is a company focused on providing flexible cards and accessible fintech tools designed to increase revenue and streamline processes for businesses and enterprises. They offer a variety of embedded banking solutions such as virtual and physical prepaid cards, bank transfers, cross-border transactions, and compliance support. Tern's platform is equipped with low code/no code solutions, APIs, and intelligent data analytics, making it user-friendly and efficient for companies looking to quickly launch financial products. With a goal to democratize fintech services, Tern aims to make these tools available to a broad audience, breaking down barriers and fostering innovation in the financial technology space.

📋 Description

• Own the migration from Heroku to Google Cloud Platform, architecture, execution, and a cutover that doesn't surprise anyone • Build and maintain the Postgres core, Fivetran pipeline, BigQuery data layer, and Hex reporting infrastructure • Optimize the hot paths that matter most: key backend code paths and our heaviest third-party syncs, so performance holds as volume climbs • Own monitoring, alerting, cost reduction, and proactive scaling: surface problems early, keep spend sane, and stay ahead of growth rather than reacting to it • Lead incident response and write post-mortems that turn an outage into a permanent fix and a smarter team • Set the operational bar across engineering and pull others up to it

🎯 Requirements

• Production reliability ownership: Track record of personally owning production reliability at meaningful scale. Concrete stories of incidents you led, fixed, and prevented from recurring, not just participated in. This is a primary responsibility, not something you've done on the side. • Infrastructure migrations: Real experience owning a cloud migration end to end, not just contributing to one. Fluent in GCP (or a comparable cloud), infrastructure-as-code, and the failure modes of distributed systems. • Observability and proactive operations: You build monitoring and alerting that surfaces problems before users find them. You know what to instrument, what to alert on, and what's just noise. • High agency: You find the highest leverage reliability problems and go fix them without being assigned to them. You don't wait for an outage to justify the work. • AI in your working habits: Specific examples of how AI has made your debugging, automation, or operational workflows faster or more reliable.

🏖️ Benefits

• Own the migration from Heroku to Google Cloud Platform • Build and maintain the Postgres core, Fivetran pipeline, BigQuery data layer, and Hex reporting infrastructure • Optimize the hot paths that matter most: key backend code paths and our heaviest third-party syncs, so performance holds as volume climbs • Own monitoring, alerting, cost reduction, and proactive scaling: surface problems early, keep spend sane, and stay ahead of growth rather than reacting to it • Lead incident response and write post-mortems that turn an outage into a permanent fix and a smarter team • Set the operational bar across engineering and pull others up to it

Apply Now

Similar Jobs

🔥 6 hours ago

The Amatriot Group

201 - 500

🎯 Recruiter

🏛️ Government

🔒 Cybersecurity

DevSecOps Engineer designing and operating enterprise CI/CD pipelines at Amatriot Group. Integrating security/compliance controls and maintaining documentation for large-scale environments.

🔥 11 hours ago

Applied Research Solutions

501 - 1000

🏛️ Government

🔒 Cybersecurity

Senior DevOps Engineer responsible for cloud application administration and integration engineering. Collaborating with cross-functional teams to ensure seamless data flow and architecture.

🔥 13 hours ago

EXL

10,000+ employees

Forward Deployment Engineer responsible for deploying EXLdata.ai in client cloud environments. Collaborating with client teams to ensure successful deployment and adoption.

🇺🇸 United States – Remote

💵 $130k - $150k / year

💰 $2M Venture Round on 2015-01

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🔥 16 hours ago

AuthZed

11 - 50

🔌 API

🔒 Cybersecurity

☁️ SaaS

Site Reliability Engineer responsible for maintaining systems reliability and performance at AuthZed. Collaborate globally while developing scalable infrastructure solutions for a cutting-edge authorization platform.

🔥 16 hours ago

Bellese Technologies

51 - 200

⚕️ Healthcare Insurance

Engineer II, DevOps developing software solutions for healthcare, enhancing public health outcomes and quality patient care.