Senior DevOps / SRE

🕒 2 days ago

🗣️🇧🇷🇵🇹 Portuguese Required

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of CI&T

CI&T

5001 - 10000 employees

Founded 1995

🤖 Artificial Intelligence

☁️ SaaS

💰 $5.5M Venture Round on 2014-04

Artificial Intelligence • Cloud Services • SaaS

CI&T is a global tech transformation specialist focusing on helping organizations navigate their technology journey. With services spanning from application modernization and cloud solutions to AI-driven data analytics and customer experience, CI&T empowers businesses to accelerate their growth and maximize operational efficiency. The company emphasizes digital product design, strategy consulting, and immersive experiences, ensuring a robust support system for enterprises in various industries.

📋 Description

• Design, implement and evolve CI/CD pipelines for .NET and Next.js applications, ensuring fast, secure and traceable deliveries • Manage and advance container infrastructure with Docker and Kubernetes, including deployment configuration, autoscaling and resource management • Implement and maintain the product observability stack: metrics, logs, traces and operational dashboards • Create and maintain SRE dashboards with visibility into SLIs, SLOs and error budgets • Configure proactive alerts and runbooks for incident response • Collaborate with the development team to define code instrumentation standards (structured logging, distributed tracing) • Work with AWS and infrastructure security best practices • Support the QA team in running automated tests in ephemeral, isolated environments via containers • Contribute to engineering culture: runbook documentation, post-mortems and continuous process improvement

🎯 Requirements

• Strong experience with CI/CD (GitHub Actions, GitLab CI, Azure DevOps or equivalent) • Proficiency with Docker and Kubernetes in production environments (deployments, Services, Ingress, HPA, namespaces) • Experience with AWS — especially EKS, ECR, Secrets Manager, IAM and WAF • Knowledge of observability tools: Datadog, Grafana, Prometheus, OpenTelemetry or similar • Experience building operational dashboards focused on availability, latency, errors and saturation (RED / USE / Four Golden Signals models) • Familiarity with infrastructure as code — Terraform, Pulumi or CDK • Knowledge of databases for monitoring health (connections, slow queries, locks) • Understanding of infrastructure security: secrets rotation, least privilege, network policies • Ability to read and understand .NET / C# and TypeScript / Next.js code to support instrumentation and troubleshooting • Experience with service mesh (Istio, Linkerd) for service-to-service observability • Knowledge of distributed tracing with Jaeger, Tempo or Datadog APM • Experience with incident management and building runbooks and operational playbooks • Experience integrating performance and load testing into pipelines (k6, Gatling) • Experience with multi-tenant environments and isolating observability per client

🏖️ Benefits

• Health and dental insurance • Meal and food allowance • Childcare assistance • Extended parental leave • Partnerships with gyms and health & wellness professionals via Wellhub (Gympass) / TotalPass • Profit Sharing (PLR) • Life insurance • Continuous learning platform (CI&T University) • Discount club • Free online platform dedicated to promoting physical and mental health and well-being • Pregnancy and parental responsibility course • Partnerships with online course platforms • Language learning platform • And many more

Apply Now

Similar Jobs

🕒 2 days ago

Runtalent

501 - 1000

🤝 B2B

👥 HR Tech

☁️ SaaS

Engenheiro Devops - Sênior em trabalho remoto, projetando e gerenciando ambientes de nuvem no Agronegócio.

🗣️🇧🇷🇵🇹 Portuguese Required

Ansible

AWS

Azure

Cloud

DNS

Docker

Google Cloud Platform

Grafana

Jenkins

Kubernetes

Linux

Prometheus

Terraform

🕒 3 days ago

Experian

10,000+ employees

🤖 Artificial Intelligence

🤝 B2B

☁️ SaaS

SRE Specialist managing cloud resources and contributing to automations for productivity enhancement. Collaborating with SRE teams and participating in operational ceremonies in a supportive role.

🗣️🇧🇷🇵🇹 Portuguese Required

AWS

Azure

Cloud

Grafana

JMeter

Kubernetes

Prometheus

Terraform

🕒 3 days ago

WEX

5001 - 10000

🚗 Transport

💸 Finance

💳 Fintech

Senior Site Reliability Engineer leading initiatives for WEX, a global commerce platform. Involves designing scalable systems and mentoring engineers, enhancing service reliability and operational excellence.

AWS

Azure

Cloud

Distributed Systems

Docker

Google Cloud Platform

Grafana

Java

Kubernetes

Python

Splunk

Go

🕒 3 days ago

Tec2Cloud

51 - 200

🤝 B2B

🏢 Enterprise

Specialist in SAP OpenText VIM providing support and evolution for USA operations. Focusing on SAP ECC and S/4HANA environments with global collaboration.

🗣️🇧🇷🇵🇹 Portuguese Required

🕒 3 days ago

Ubiminds

51 - 200

DevOps Engineer focusing on modernizing CI/CD pipelines and building systems for Ubi. Join a team to improve developer productivity and delivery speed through automation.

AWS

Azure

Cloud

Docker

Jenkins

Kubernetes

Python