Monitoring and Observability Analyst – Sat, Sun, Holidays

3 hours ago

Apply Now
Logo of Coderio

Coderio

B2B • Enterprise • Artificial Intelligence

Coderio is a nearshore software development firm that provides on-demand engineering teams, IT staff augmentation, and fully managed software outsourcing to accelerate clients' digital transformation. They deliver front-end, back-end, mobile, cloud, data science, e-commerce, and QA solutions, and operate specialized studios for Data Governance, Machine Learning & AI, and Open Banking. Coderio emphasizes enterprise-level, timezone-aligned, fully-vetted, English-proficient engineering talent for B2B customers.

📋 Description

• Contribute to the definition of the company's observability strategy, aligned with industry best practices (SRE/DevOps). • Design and implement end-to-end monitoring solutions. • Configure alert thresholds (SLIs/SLOs) based on business impact and minimize notification noise. • Develop and maintain informative and visually clear dashboards (e.g., Grafana, Kibana) for real-time visibility. • Implement and optimize monitoring automation, from agent deployment to automatic alert response (AIOps basic/intermediate). • Administer and maintain monitoring platforms (updates, patches, cost optimization). • Create and maintain technical documentation (runbooks, monitoring procedures, service maps).

🎯 Requirements

• Minimum 3 years of experience in Monitoring, IT Operations, or SRE roles. • Advanced experience with one or more monitoring platforms: Prometheus/Grafana, ELK Stack, New Relic, Datadog or similar. • Dominance in monitoring Cloud environments (AWS/Azure/GCP) and containers (Docker, Kubernetes). • Solid understanding of Logs (fluentd, Logstash, Loki) and Distributed Tracing (Jaeger, Zipkin, OpenTelemetry). • Practical experience in scripting languages (e.g., Python, Bash) for task automation and custom checker development. • Deep knowledge of Linux operating systems. • Strong ability to correlate events and data from multiple sources to identify the root cause of complex problems (Analysis Skill). • Ability to anticipate problems instead of just reacting to alerts (Proactivity Orientation). • Excellent oral and written communication skills. • Experience in a collaborative work environment with a DevOps mindset. • Bachelor's degree in Systems Engineering, Computer Science, or a related field.

🏖️ Benefits

• 100% remote • Long-term commitment, with autonomy and impact • Strategic and high-visibility role in a modern engineering culture • Collaborative international team and strong technical leadership • Clear path to growth and leadership within Coderio

Apply Now

Similar Jobs

October 28

Ecosistemas

501 - 1000

Analista QC Senior en Ecosistemas, solicitado para equipo de Control de Calidad de Software. Participando en auditorías y diseñando pruebas funcionales para el sector financiero.

🗣️🇪🇸 Spanish Required

October 28

Senior HR Specialist focused on organizational culture implementation for a client. Developing strategies and training programs to enhance change management and cultural effectiveness.

🗣️🇪🇸 Spanish Required

September 10

CRO Analyst coordinating A/B tests and ensuring data integrity at eJam, optimizing consumer brands through performance-driven digital marketing.