Senior Site Reliability Engineer (SRE) – APAC

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Pearster

Pearster

51 - 200 employees

Founded 2020

🎯 Recruiter

🤝 B2B

🏢 Enterprise

Recruitment • B2B • Enterprise

Pearster is a global IT staffing and managed services firm that connects experienced software and infrastructure professionals with companies seeking to scale engineering, data, security, and IT operations teams. They offer staff augmentation, dedicated teams, and managed services with a full-stack talent pool spanning cloud, DevOps, data & AI, product development, network, and security, emphasizing culture fit, time-zone alignment, and transparent pricing.

📋 Description

• Design, implement, and maintain highly available, scalable, and secure cloud infrastructure. • Monitor production systems and proactively identify reliability, performance, and capacity issues. • Lead incident response activities, root cause analysis (RCA), and post-incident reviews. • Develop automation solutions to reduce operational overhead and improve system reliability. • Build and enhance observability platforms, including monitoring, logging, tracing, and alerting systems. • Establish and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets. • Partner with development teams to improve application resilience, deployment strategies, and operational readiness. • Support CI/CD pipelines and deployment automation initiatives. • Participate in on-call rotations and production support activities. • Drive infrastructure-as-code and platform engineering best practices. • Ensure compliance with security, governance, and fintech industry standards. • Contribute to disaster recovery, business continuity, and high-availability strategies.

🎯 Requirements

• Must be located in the APAC region and available to work during Canadian overnight hours. • Fluent English. • 5+ years of experience in Site Reliability Engineering, DevOps, Cloud Engineering, Platform Engineering, or related roles. • Strong experience supporting mission-critical production environments. • Experience with cloud platforms such as AWS, Azure, or GCP. • Hands-on expertise with Kubernetes and containerized environments. • Strong knowledge of Infrastructure as Code (Terraform preferred). • Experience building and maintaining CI/CD pipelines. • Proficiency in scripting and automation using Python, Bash, or Go. • Strong understanding of Linux systems administration. • Experience with monitoring and observability tools such as Prometheus, Grafana, Datadog, New Relic, Splunk, ELK, or OpenTelemetry. • Knowledge of networking fundamentals including DNS, load balancing, TLS/SSL, VPNs, and firewalls. • Experience managing production incidents and conducting root cause analysis. • Strong communication skills and ability to collaborate across distributed teams.

🏖️ Benefits

• Work from anywhere with true flexibility and freedom. • Earn in USD with compensation that matches your expertise. • Recharge confidently with dedicated paid time off. • Advance your career with fully covered international certifications. • Access coworking spaces worldwide whenever you want a professional setup. • Strengthen your English and expand your global reach. • Connect and have fun with activities that unite our international team. • Feel appreciated with personalized gifts and a thoughtful welcome kit. • Grow our community and earn through our referral program.

Apply Now

Similar Jobs

🔥 22 hours ago

Omilia - Conversational Intelligence

201 - 500

🤖 Artificial Intelligence

🛍️ eCommerce

Senior Site Reliability Engineer ensuring platform reliability and availability in production environments. Collaborating with engineering teams to improve incident response and maintain operational documentation.

Ansible

AWS

Cloud

Docker

Grafana

Kubernetes

Linux

MySQL

NoSQL

Postgres

Prometheus

Python

RDBMS

Redis

TCP/IP

Terraform

VoIP

Go

🕒 2 days ago

DysrupIT

51 - 200

🏢 Enterprise

☁️ SaaS

🔒 Cybersecurity

DevOps Engineer responsible for building and setting up new tools and automating release processes. Working closely with stakeholders and developers in a remote capacity from Makati, Metro Manila.

Grafana

Linux

Prometheus

Splunk

🕒 May 19

ZigZag Offshoring

501 - 1000

👥 HR Tech

🎯 Recruiter

🏢 Enterprise

Senior Site Reliability Engineer at ZigZag, focused on designing and maintaining scalable cloud infrastructure. Collaborates with engineering teams for reliability and performance improvement.

AWS

Cloud

Distributed Systems

Microservices

Python

Terraform

🕒 May 16

IGT

10,000+ employees

🎮 Gaming

🛍️ eCommerce

☁️ SaaS

DevOps Engineer contributing to software development and operational responsibilities for gaming and fintech solutions. Collaborating within a small agile team to enhance platform capabilities and drive automation efforts.

Grafana

Java

Jenkins

Prometheus

SDLC

Spring

Spring Boot

SpringBoot

🕒 May 12

MoneySmart Group

51 - 200

💸 Finance

💳 Fintech

👥 B2C

Senior DevOps Engineer creating cloud infrastructures and system reliability solutions for MoneySmart Group. Collaborating with APAC teams to enhance financial product delivery.

🇵🇭 Philippines – Remote

💵 ₱190k - ₱260k / year

💰 $10M Series B - MoneySmart on 2017-06

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Cloud

Kubernetes

Python

Terraform