SRE Observability Engineer

November 18

Apply Now
Logo of Capital.com

Capital.com

We are on a mission to make the world of finance more accessible, engaging and useful.

501 - 1000 employees

📋 Description

• Designing, implementing, and maintaining robust observability solutions using Prometheus, Grafana, VictoriaMetrics, and related tools. • Developing and managing logging systems, including Opensearch/ELK stack, fluentbit/fluentd, rsyslog, and journald, to ensure efficient log collection and analysis. • Collaborating with teams to build and maintain automation scripts and playbooks using Ansible to streamline infrastructure management. • Overseeing the administration of Nginx web servers, Docker containers, and virtualization platforms such as kvm/qemu(Proxmox). • Managing source control, CI/CD pipelines, and code deployments with GIT and Gitlab CI/CD. • Providing support for network configurations, including VLANs, routing, firewalls, and multicast setups. • Contributing to incident reporting, root cause analysis, and system documentation. • Working cross-functionally with multiple teams to ensure seamless coordination and efficient operations.

🎯 Requirements

• 7+ years of IT experience, with a proven track record in system reliability, observability, and infrastructure management. • Strong expertise in Linux administration (bash scripting, base utilities, SSH). • Solid understanding of networking concepts such as VLANs, routing, firewalls, and multicast. • Hands-on experience with modern observability tools, monitoring frameworks, and logging systems. • Proficiency with Ansible, Docker, and virtualization technologies. • Familiarity with GIT, Gitlab CI/CD, and task management tools like JIRA. • Excellent incident reporting and documentation skills. • English level B1 or higher, with the ability to communicate effectively in a professional environment. • Exceptional communication and collaboration skills, proactive mindset.

🏖️ Benefits

• Competitive Salary: We believe great work deserves great pay! Your skills and talents will be rewarded with a salary that makes you feel valued and motivated. • Work-Life Harmony: Join a company that genuinely cares about you - because your life outside of work matters just as much as your time on the clock. #LI-Hybrid • Annual Performance Bonus: Your hard work doesn’t go unnoticed! Celebrate your achievements with a well-deserved annual bonus tied to your performance. • Generous Time Off: Need a breather? Our annual leave policy lets you recharge and enjoy life outside of work without a worry. • Employee Referral Program: Love working here? Share the love! Bring your talented friends on board and get rewarded for growing our awesome team. • Comprehensive Health & Pension Benefits: From medical insurance to pension plans, we’ve got your back. Plus, location-specific benefits and perks! • Workation Wonderland: Live your digital nomad dreams with 30 extra days to work remotely from anywhere in the world (some restrictions apply). Adventure awaits! • Volunteer Days: Make a difference! Take two additional paid days each year to support causes you care about and give back to the community.

Apply Now

Similar Jobs

November 18

DevOps Engineer managing cloud infrastructure and ML tools for a digital accessibility software company. Overseeing operations, improving processes, and developing new solutions.

AWS

Azure

Docker

ElasticSearch

Google Cloud Platform

Kubernetes

NoSQL

November 15

DevOps / System Engineer managing ICT solutions and application development for mobile network operators. Collaborating on delivery and maintaining smooth operation of applications.

🗣️🇵🇱 Polish Required

Docker

Java

Kubernetes

Linux

MySQL

Perl

Python

SQL

November 14

DevOps Engineer developing and managing Kubernetes infrastructure for Catalyst Blockchain Manager at IntellectEU. Collaborating with teams and ensuring compliance and security standards while automating processes.

AWS

Azure

Cloud

EC2

Google Cloud Platform

Grafana

Kubernetes

Prometheus

Terraform

November 14

Lead Site Reliability Engineer developing cloud-based platforms and services at Coupa. Focus on reliability and scalability across AWS and Azure environments for better business solutions.

Ansible

AWS

Azure

Chef

Cloud

Distributed Systems

Docker

Google Cloud Platform

Kubernetes

Ruby

Terraform

November 13

DataDevOps Engineer optimizing data infrastructure and pipelines at Hard Rock Digital. Collaborating with data teams to implement best practices and improve data processing.

Airflow

AWS

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com