Site Reliability Engineer

6 days ago

Apply Now
Logo of Ciklum

Ciklum

Artificial Intelligence • B2B • Enterprise

Ciklum is a global digital engineering and AI-enabled product and platform services company that helps enterprises design, build, and scale AI-infused software, cloud, data, and automation solutions. It combines UX and product design with engineering, DevOps, data engineering, responsible AI, and edge/IoT capabilities to move pilots into production and deliver enterprise-ready outcomes across industries such as banking, retail, healthcare, hi-tech, automotive, and travel. Ciklum emphasizes platform-agnostic, scalable solutions—covering AI incubators, conversational AI, agentic automation, cloud and edge services, XR/AR/VR, and digital assurance—focused on transforming workflows and customer experiences for B2B enterprise clients.

📋 Description

• You will report to an Engineering Manager in DevOps • As a Site Reliability Engineer, you will combine software and systems engineering to build and run large-scale, distributed, fault-tolerant systems • Your primary goal will be to solve operational problems with a software engineering mindset, treating operations as a software problem and automating away toil • Define, measure, and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs) in partnership with product and engineering teams • Champion and help manage error budgets to guide the balance between reliability work and new feature velocity • Participate as a partner in planning the product roadmap, sprint planning, stand-ups • Scale systems sustainably through automation; evolve systems by pushing for changes that improves reliability and velocity • Increase visibility into the health and durability of the platform • Practice sustainable incident response and blameless postmortems • Maintain existing services and tools, augmenting and replacing

🎯 Requirements

• 2+ years as a Site Reliability Engineer or Software Developer • Advanced experience with programming/scripting languages such as JavaScript/NodeJS, Python or Bash • Knowledge in Linux monitoring, troubleshooting, and administration • Competence in at least one programming language • Can write and evaluate code for scalability/runtime • Experience with container orchestration platforms such as Kubernetes or Nomad • Experience with monitoring, APM, and logging tooling (Eg: ELK, Grafana, Datadog, NewRelic, or Splunk) • Experience working with at least one DBMS (Eg: Postgres, MySQL, Oracle, or MongoDB) • Experience with configuration management tools (Eg: Ansible, Puppet, Chef, or Salt). Ansible Tower or AWX is a plus • Experience with Infrastructure-as-Code tools such as Terraform, Cloudformation, Google Deployment Manager, or Azure Resource Manager • Experience working with at least one major Cloud Provider (AWS/Azure/GCP) • Understanding of cloud native security requirements (Eg: WAF, security groups)

🏖️ Benefits

• Strong community: Work alongside top professionals in a friendly, open-door environment • Growth focus: Take on large-scale projects with a global impact and expand your expertise • Tailored learning: Boost your skills with internal events (meetups, conferences, workshops), Udemy access, language courses, and company-paid certifications • Endless opportunities: Explore diverse domains through internal mobility, finding the best fit to gain hands-on experience with cutting-edge technologies • Flexibility: Enjoy radical flexibility – work remotely or from an office, your choice • Care: We’ve got you covered with company-paid medical insurance, mental health support, and financial & legal consultations

Apply Now

Similar Jobs

November 25

DevOps Engineer at TalentNeuron optimizing infrastructure and implementing CI/CD pipelines. Collaborating with teams to ensure scalable and secure systems for talent data analytics.

AWS

Cloud

DNS

Grafana

JavaScript

Kubernetes

Node.js

Postgres

Prometheus

Python

React

Terraform

November 25

DevOps Engineer responsible for end-to-end delivery of infrastructure and implementing CI/CD pipelines. Collaborating with skilled teammates in a fast-paced development environment on modern DevOps tools.

AWS

Azure

Cloud

Kubernetes

Linux

Terraform

November 6

DevOps Engineer responsible for end-to-end infrastructure delivery and implementing CI/CD pipelines. Joining a skilled team focused on high-quality solutions to meet business needs.

AWS

Azure

Cloud

Kubernetes

Linux

Terraform

October 23

DevOps Engineer responsible for managing cloud infrastructure and CI/CD pipelines at a technology company. Collaborating with hosting partners for infrastructure security and efficiency.

Ansible

Azure

Cloud

Docker

Grafana

Kubernetes

Prometheus

Terraform

October 4

DevOps Engineer focusing on infrastructure delivery and CI/CD pipeline implementation. Joining a motivated team in a fast-paced environment at a company specializing in Enterprise and Game Solutions.

AWS

Azure

Cloud

Kubernetes

Linux

Terraform

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com