SRE Partner – Affirmative Action Position for Persons with Disabilities

🕒 May 11

🗣️🇧🇷🇵🇹 Portuguese Required

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Jusbrasil

Jusbrasil

201 - 500 employees

Informação que transforma você, a justiça e o mundo.

📋 Description

• Ensure reliability, availability and scalability of systems and services in the Product Areas (PAs) where assigned. • Develop and implement monitoring, observability and alerting solutions integrated with the Agentic Engineering Platform. • Support teams in defining and tracking SLIs, SLOs and error budgets. • Structure and evolve on-call management in the PAs: rotation, escalation, alerting tools and incident management. • Work closely with the Engineering Platform to ensure platform capabilities reach and are adopted by product teams. • Actively contribute to the evolution of the Agentic Engineering Platform by bringing real feedback from PAs about friction points, gaps and opportunities for improvement. • Participate in and influence the building of a reliability-oriented (SRE) engineering culture across the company. • Support migrations of critical systems, environment segregation and deprecation of legacy technologies.

🎯 Requirements

• Experience with cloud environments, preferably GCP. • Proficiency in observability tools and practices (Prometheus, Grafana, Loki, Thanos, Elasticsearch, AlertManager, etc.). • Strong knowledge of Kubernetes and distributed architectures. • Strong knowledge of Infrastructure as Code (IaC) and Terraform. • Hands-on experience with incident management, on-call and post-mortems. • Experience defining and tracking SLOs and error budgets. • Ability to analyze logs and the performance of distributed systems. • Strong communication and influencing skills: ability to advocate technical solutions to diverse audiences — engineers, PMs and leadership. • Data-driven mindset, using data to map risks, prioritize actions and demonstrate impact.

🏖️ Benefits

• N/A

Apply Now

Similar Jobs

🕒 May 11

CI&T

5001 - 10000

🤖 Artificial Intelligence

☁️ SaaS

Senior DevOps Engineer at CI&T creating scalable tech solutions and driving innovation in infrastructure. Collaborate with teams to design, build, and optimize cutting-edge solutions.

AWS

Cloud

Docker

Kubernetes

Python

Terraform

🕒 May 9

AmorSaúde Brasil

5001 - 10000

👥 B2C

🧘 Wellness

Cloud Engineer responsible for designing, operating, and evolving critical infrastructure for AmorSaúde. Focused on reliability and security in cloud platforms, mainly AWS.

🗣️🇧🇷🇵🇹 Portuguese Required

AWS

Cloud

Docker

Kubernetes

Postgres

Terraform

🕒 May 8

Keyrus

1001 - 5000

🤝 B2B

SRE Engineer defining practices and improving the availability and performance of systems. Working with automation and observability strategies in a diverse and inclusive environment.

🗣️🇧🇷🇵🇹 Portuguese Required

Ansible

AWS

Azure

Cloud

Google Cloud Platform

Grafana

Prometheus

Terraform

🕒 May 8

Keyrus

1001 - 5000

🤝 B2B

DevOps Analyst responsible for designing and maintaining CI/CD pipelines within Keyrus. Collaborating with teams to enhance cloud environments and implement best practices in security and reliability.

🗣️🇧🇷🇵🇹 Portuguese Required

Ansible

AWS

Azure

Cloud

Docker

Google Cloud Platform

Jenkins

Kubernetes

Linux

Terraform

🕒 May 8

Review ALL

11 - 50

🎯 Recruiter

🤝 B2B

Senior Site Reliability Engineer managing global bare metal infrastructure at Latitude. Responsibilities include monitoring, automation, and collaboration with engineers.

🗣️🇧🇷🇵🇹 Portuguese Required

Linux

Prometheus

Python

Go