Site Reliability Engineer – SRE

Job not on LinkedIn

October 2

Apply Now
Logo of OpenFX

OpenFX

Fintech • Banking • eCommerce

OpenFX is a modern financial infrastructure company that specializes in real-time cross-border payments. It provides a comprehensive solution for finance teams, enabling quick and efficient money movement across various global corridors with support for over 40 currency pairs. OpenFX offers features such as instant settlements, transparent pricing, and 24/7 availability, allowing clients like payment service providers and neo banks to optimize their operations without historical banking limitations.

1 - 10 employees

Founded 2024

💳 Fintech

🏦 Banking

🛍️ eCommerce

📋 Description

• Serve as first responder for production incidents during U.S. operating hours (±2h EST). • Lead triage during outages, analyzing logs, metrics, and traces to identify root causes. • Drive incident postmortems and follow-ups to prevent recurrence. • Communicate clearly and quickly during incidents to internal stakeholders. • Own reliability outcomes across all OpenFX systems, with a focus on uptime, latency, and error budgets. • Enhance observability through logging, metrics, alerting, and dashboards. • Optimize on-call processes and ensure smooth handoffs across IST, EST, and PST coverage. • Partner with DevOps and engineering pods to implement fixes or approve production changes. • Proactively identify systemic reliability risks and propose improvements. • Contribute automation and tooling to reduce manual incident handling. • Champion best practices in reliability engineering and operational excellence.

🎯 Requirements

• 5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering. • Proven experience leading incident response, running postmortems, and communicating during outages. • Strong background with cloud infrastructure (AWS preferred), container orchestration (Kubernetes, ECS), and Infrastructure-as-Code (Terraform, CloudFormation). • Familiarity with observability stacks (e.g., Prometheus, Grafana, Datadog, ELK, OpenTelemetry). • Ability to triage errors at both the infrastructure and application level, and escalate effectively when deeper intervention is required. • Ownership mindset with strong communication skills in high-pressure situations.

🏖️ Benefits

• Competitive salary and benefits package. • Equity in a rapidly growing company. • Opportunity to work on mission-critical infrastructure in fintech. • A collaborative team culture with a bias toward ownership and outcomes. • The chance to make a direct impact on the resilience of global financial infrastructure.

Apply Now

Similar Jobs

October 2

SDL

1001 - 5000

☁️ SaaS

Senior DevOps Engineer leading cloud infrastructure initiatives at SDL. Designing and implementing Azure and GCP architectures while optimizing CI/CD pipelines and ensuring security compliance.

October 1

Catio

2 - 10

Senior SRE / DevOps Engineer at Catio, building AWS cloud operations and infrastructure-as-code strategy while promoting automation and reliability.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

October 1

The Voleon Group

51 - 200

💸 Finance

🤖 Artificial Intelligence

Senior Cluster Site Reliability Engineer at Voleon scaling research compute clusters using machine learning techniques in finance. Ensuring uptime, reliability, and performance of HPC platforms.

🇺🇸 United States – Remote

💵 $205k - $235k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

October 1

Domyn

51 - 200

🤖 Artificial Intelligence

💳 Fintech

⚕️ Healthcare Insurance

Senior DevOps Engineer at Domyn managing cloud and on-prem infrastructure for enterprise AI. Optimize deployments across GCP, Azure, AWS and ensure security, reliability, and high availability.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

September 30

Mission Box Solutions

11 - 50

👥 HR Tech

🎯 Recruiter

⚕️ Healthcare Insurance

Talent-pool for DevOps-specialist roles at Mission Box Solutions. Connecting veteran-owned recruiting agency candidates with hiring companies across DevOps specializations.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com