Senior Site Reliability Engineer

Job not on LinkedIn

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of The Leaflet

The Leaflet

11 - 50 employees

🔌 API

API

The Leaflet is an open-source JavaScript library for building mobile-friendly interactive maps. It is lightweight (around 42 KB), designed for simplicity, performance and usability, and provides core mapping features such as tile layers, markers, vector layers, popups, and interaction handlers. Leaflet is highly extensible via a large plugin ecosystem, well-documented, and maintained by a broad community of contributors and organizations.

📋 Description

• Ensure the availability, reliability, and performance of high-traffic Java-based applications in a distributed environment • Troubleshoot and resolve complex issues across production and non-production environments • Participate in pre- and post-deployment performance testing and monitoring to continuously improve application performance • Design, build, and operate agentic AI workflows that automate operational tasks such as alert triage and root cause analysis

🎯 Requirements

• Degree in Computer Science or related field, or equivalent professional experience • 5+ years in SRE, DevOps, or similar infrastructure roles with experience managing large-scale, high-availability production systems • 3+ years hands-on experience managing production Kubernetes clusters, including deep understanding of architecture, networking, storage, and security • Advanced expertise with the Grafana observability stack: dashboards, alerting, visualization, and Grafana Alloy for telemetry collection • Strong scripting abilities in Python, Bash, or Go, with experience building CI/CD pipelines and deployment automation • 1+ years of practical experience building or operating AI/LLM-powered tools, agents, or workflows

🏖️ Benefits

• Fully remote position • Opportunity to work with cutting-edge AI tools • Collaborative team environment

Apply Now

Similar Jobs

🔥 9 hours ago

Recruiting.com

11 - 50

🎯 Recruiter

☁️ SaaS

🤝 B2B

Site Reliability Engineer focusing on maintaining high service levels and monitoring production environments at Cencora. Collaborating with Development and DevOps to enhance global product platform reliability.

Azure

Kubernetes

MySQL

Python

Terraform

Go

🕒 Yesterday

Aura

501 - 1000

☁️ SaaS

🛍️ eCommerce

👥 B2C

DevOps Engineer creating tools and optimizing cloud deployments at Aura. Collaborating with teams to ensure a stable infrastructure and efficient development processes.

Cloud

ETL

Grafana

Python

Terraform

TypeScript

🕒 Yesterday

MARGO

201 - 500

🤖 Artificial Intelligence

💳 Fintech

Network Reliability Engineer for building AI infrastructure with monitoring and production incident remediation. Collaborating with teams on high-impact production issues in a remote work setup.

Ansible

DNS

Grafana

Linux

MariaDB

Prometheus

Python

SaltStack

TCP/IP

Go

🕒 2 days ago

Netguru

501 - 1000

☁️ SaaS

🏢 Enterprise

🤝 B2B

Senior DevOps Engineer at Netguru managing diverse projects remotely. Collaborating as part of an experienced team with flexibility over hours and tasks.

Grafana

Kafka

Kubernetes

Postgres

🕒 2 days ago

Netguru

501 - 1000

☁️ SaaS

🏢 Enterprise

🤝 B2B

Regular DevOps Engineer working remotely on projects for various industries. Collaborating with experienced developers at Netguru to modernize digital commerce solutions.

Grafana

Kafka

Kubernetes

Postgres