Senior Site Reliability Engineer

Job not on LinkedIn

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Tilt (formerly Empower)

Tilt (formerly Empower)

201 - 500 employees

💳 Fintech

👥 B2C

💸 Finance

Fintech • B2C • Finance

Tilt is a financial technology company that provides cash advance, lines of credit, and credit card services aimed at empowering customers financially. With an emphasis on accessibility, Tilt offers flexible repayment plans and eligibility irrespective of a customer's credit history, enabling individuals to qualify based on their real-time financial habits. The platform is user-friendly, boasting positive customer experiences and a high rating, making it a convenient option for those seeking to manage their finances and build credit effectively.

📋 Description

• Define the APAC infrastructure architecture. This isn't a maintenance role. You'll design and own the Azure footprint for our APAC businesses end-to-end — compute, networking, storage, identity, containerisation, and application hosting — and set the patterns the region builds on. New regional environments for our Philippines and India businesses are live work from day one. • Own cross-region connectivity. Take ownership of the network paths connecting our cloud environments to local partners and HQ: site-to-site VPN tunnels, partner IP allow-listing, edge and CDN custom domains, certificate management, and the DNS and TLS hygiene that keeps customer-facing endpoints healthy. • Drive the migration and hardening programs. Lead infrastructure migrations across clouds safely and at scale — including moving sensitive customer data — and own the APAC side of our PCI-DSS hardening program. These are active, in-flight programs, not future roadmap items. • Lead incident response for the region. When something breaks during APAC hours, you run the response — calmly, methodically, and all the way through. That means root cause analysis, blameless post-mortems, and durable fixes. The on-call posture is follow-the-sun: APAC business hours, not 3am pages. • Operate and improve the platform over time. Keep hardening reliability, deploy-pipeline confidence, alert quality, and Azure spend right-sizing. Security and compliance are baked into how we operate — you'll partner with those teams on PCI-DSS scope, vulnerability scanning, least-privilege access, and secret management. • Raise the bar across the team. Through code, runbooks, architecture decisions, and peer collaboration with our US-hours Platform engineers, your work sets the standard for how infrastructure gets done in the region. • Work AI-first. Use AI tooling — coding assistants, observability assistants, IaC helpers — to investigate faster, cut toil, and ship more reliable work. Our Platform team is actively reshaping how engineering and ops are done with AI, and we expect you to bring that same instinct.

🎯 Requirements

• 5+ years in SRE, DevOps, or infrastructure roles supporting production systems — with genuine ownership of critical infrastructure, not just contributing to it. • Strong infrastructure-as-code depth (Terraform: real modules and environments, not one-off scripts) and production cloud experience. Azure is our primary cloud — if you're coming from AWS or GCP, we expect you to ramp quickly. Transferable depth across networking, identity, compute, storage, and observability matters more than years on Azure specifically. • Hands-on networking: site-to-site VPN, VNet design and peering, DNS, TLS and certificate management, firewall and IP allow-listing. • You make decisions independently and back them. We're looking for someone who identifies what needs doing and moves on it — not someone who defers every judgment call upward. Own your reasoning and defend your tradeoffs. • Incident response leadership — you run incidents calmly, find root causes, and follow through to fixes that don't come back. • Scripting and automation fluency across PowerShell, Bash, Python, and the Azure CLI. Comfortable enough with code to build your own tooling. Familiarity with C# and .NET is a bonus. • Containers and CI/CD: Docker, GitHub Actions or Azure DevOps, and the habit of automating and tidying as you go. • Security and compliance awareness: you've worked in a regulated environment and understand PCI-DSS, least-privilege access, and good secret management. Fintech or financial services backgrounds are a genuine plus here. • You actively use AI tools in your workflow — not just experimented. You can describe how. • Based in APAC or able to reliably work APAC business hours — that's the core reason this role exists.

🏖️ Benefits

• Virtual-first teamwork: The Tilt team is collaborating across 14 countries, 12 time zones, and counting. You’ll get started with a WFH office reimbursement. • Competitive pay: We're big on potential, and it's reflected in our competitive compensation packages and generous equity. • Complete support: Find flexible health plans at every premium level, and substantial subsidies that stand up to global standards. • Visibility is yours: You can count on direct exposure to our leadership team — we’re a team where good ideas travel quickly. • Paid global onsites: Magic happens IRL: we gather twice yearly to reconnect over shared meals or kayaking adventures. (We’ve visited Vail, San Diego, and Mexico City, to name a few.) • Impact is recognized: Growth opportunities follow your contributions, not rigid promotion timelines.

Apply Now

Similar Jobs

🔥 17 hours ago

Advanced Solutions International, Inc.

201 - 500

🤝 B2B

🤝 Non-profit

DevOps Reliability Engineer at ASI ensuring performance, scalability, and reliability of Azure SaaS platform. Proactively improving system efficiency and effectiveness using telemetry and technical investigation.

Azure

Cloud

SQL

🕒 Yesterday

CrowdStrike

5001 - 10000

🔒 Cybersecurity

☁️ SaaS

🤖 Artificial Intelligence

Database Reliability Engineer managing and optimizing cloud-based databases at CrowdStrike. Collaborating with engineering teams to automate and secure data management processes.

AWS

Cassandra

Chef

Cloud

ElasticSearch

Google Cloud Platform

Kafka

Kubernetes

Linux

MySQL

Postgres

Python

Zookeeper

🕒 June 4

Omilia - Conversational Intelligence

201 - 500

🤖 Artificial Intelligence

🛍️ eCommerce

Senior Site Reliability Engineer maintaining production clusters and developing observability solutions. Collaborate with teams to ensure platform reliability and performance using automation and monitoring tools.

Ansible

AWS

Cloud

Docker

Grafana

Kubernetes

Linux

MySQL

NoSQL

Postgres

Prometheus

Python

RDBMS

Redis

TCP/IP

Terraform

VoIP

Go

🕒 June 1

Red Hat

10,000+ employees

🏢 Enterprise

Customer Site Reliability Engineer managing critical services and driving reliability and customer satisfaction at Red Hat. Engaging with cross-functional teams and enhancing system resilience.

🗣️🇯🇵 Japanese Required

Ansible

AWS

Azure

Cloud

Distributed Systems

Google Cloud Platform

Kubernetes

Linux

OpenShift

Prometheus

TCP/IP

Terraform

Go

🕒 May 8

Megaport

201 - 500

📡 Telecommunications

Senior Platform Engineer at Megaport, focusing on DevOps and SRE practices across their systems. Responsible for reliability and stakeholder engagement in a collaborative tech environment.

AWS

Cassandra

Cloud

Kubernetes

Linux

Postgres

Python

Terraform

Go