pod network

Website LinkedIn All Job Openings

1 - 10 employees

₿ Crypto

🌐 Web 3

Crypto • Web 3 • Software

pod network is a pioneering platform aiming to simplify and accelerate Web3 development. By introducing a unique stake-based programmable layer one, pod focuses on delivering optimal latency and seamless user experience for decentralized applications. The company's ethos centers on providing an infrastructure that enables developers to build real products with a sleek user interface, without the complexities of traditional blockchain technology. With backing from notable investors and a commitment to community engagement, pod network seeks to onboard the next billion users to a more democratic internet.

Site Reliability Engineer – APAC

Job not on LinkedIn

🕒 June 19

🇲🇾 Malaysia – Remote

💵 $100k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Cloud

Distributed Systems

Docker

Grafana

Linux

Prometheus

Python

Rust

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

pod network

Website LinkedIn All Job Openings

1 - 10 employees

₿ Crypto

🌐 Web 3

Crypto • Web 3 • Software

📋 Description

• Monitor the health and performance of the platform • Respond to production incidents and drive them through to resolution • Investigate failures, identify root causes, and coordinate fixes • Ensure issues are detected, understood, and addressed quickly • Identify recurring operational pain points and eliminate them • Improve software, deployment processes, and operational workflows • Participate in incident reviews and help drive preventative improvements • Contribute reliability-focused changes directly to production systems • Design and maintain dashboards, metrics, alerting, and monitoring systems • Improve signal quality while reducing alert fatigue • Build automation and internal tools that make the platform easier to operate • Help establish reliability best practices across the engineering organization

🎯 Requirements

• Strong experience with Linux and cloud infrastructure • Experience operating and supporting production systems • Experience with Docker and containerized environments • Experience with observability and incident-management tools such as Grafana, Prometheus, PagerDuty, or similar • Ability to automate workflows using Rust, Python, Bash, or similar languages • Strong troubleshooting and debugging skills • A high degree of ownership and the ability to make sound decisions independently • Nice to Have: Experience with distributed systems, high-availability, low-latency services, CI/CD systems, deployment automation, designing secure operational workflows and access controls

🏖️ Benefits

• Competitive compensation (~$100k USD/year) • Meaningful token/equity allocation • Real ownership and responsibility from day one • Work from wherever you are within the target timezone range (UTC+7 to UTC+1) • Occasional travel to Europe and elsewhere for team meetups

Apply Now