🕒 May 11
🗣️🇧🇷🇵🇹 Portuguese Required
Improve your chances of getting an interview by checking your resume score before you apply.
• Ensure reliability, availability and scalability of systems and services in the Product Areas (PAs) where assigned. • Develop and implement monitoring, observability and alerting solutions integrated with the Agentic Engineering Platform. • Support teams in defining and tracking SLIs, SLOs and error budgets. • Structure and evolve on-call management in the PAs: rotation, escalation, alerting tools and incident management. • Work closely with the Engineering Platform to ensure platform capabilities reach and are adopted by product teams. • Actively contribute to the evolution of the Agentic Engineering Platform by bringing real feedback from PAs about friction points, gaps and opportunities for improvement. • Participate in and influence the building of a reliability-oriented (SRE) engineering culture across the company. • Support migrations of critical systems, environment segregation and deprecation of legacy technologies.
• Experience with cloud environments, preferably GCP. • Proficiency in observability tools and practices (Prometheus, Grafana, Loki, Thanos, Elasticsearch, AlertManager, etc.). • Strong knowledge of Kubernetes and distributed architectures. • Strong knowledge of Infrastructure as Code (IaC) and Terraform. • Hands-on experience with incident management, on-call and post-mortems. • Experience defining and tracking SLOs and error budgets. • Ability to analyze logs and the performance of distributed systems. • Strong communication and influencing skills: ability to advocate technical solutions to diverse audiences — engineers, PMs and leadership. • Data-driven mindset, using data to map risks, prioritize actions and demonstrate impact.
• N/A
Apply Now🕒 May 11
Senior DevOps Engineer at CI&T creating scalable tech solutions and driving innovation in infrastructure. Collaborate with teams to design, build, and optimize cutting-edge solutions.
🇧🇷 Brazil – Remote
💰 $5.5M Venture Round on 2014-04
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
AWS
Cloud
Docker
Kubernetes
Python
Terraform
🕒 May 9
Cloud Engineer responsible for designing, operating, and evolving critical infrastructure for AmorSaúde. Focused on reliability and security in cloud platforms, mainly AWS.
🗣️🇧🇷🇵🇹 Portuguese Required
AWS
Cloud
Docker
Kubernetes
Postgres
Terraform
🕒 May 8
SRE Engineer defining practices and improving the availability and performance of systems. Working with automation and observability strategies in a diverse and inclusive environment.
🗣️🇧🇷🇵🇹 Portuguese Required
Ansible
AWS
Azure
Cloud
Google Cloud Platform
Grafana
Prometheus
Terraform
🕒 May 8
DevOps Analyst responsible for designing and maintaining CI/CD pipelines within Keyrus. Collaborating with teams to enhance cloud environments and implement best practices in security and reliability.
🗣️🇧🇷🇵🇹 Portuguese Required
Ansible
AWS
Azure
Cloud
Docker
Google Cloud Platform
Jenkins
Kubernetes
Linux
Terraform
🕒 May 8
Senior Site Reliability Engineer managing global bare metal infrastructure at Latitude. Responsibilities include monitoring, automation, and collaboration with engineers.
🗣️🇧🇷🇵🇹 Portuguese Required
Linux
Prometheus
Python
Go