April 11
• Building and leading processes to ensure reliability, availability, scalability, and performance of cloud infrastructure • Collaborating with different teams to design and implement scalable, secure, and highly available distributed systems • Incident management, post-mortem analysis, and continuous improvement of ClickHouse services • Developing software platforms and tools to optimize operational and engineering efficiencies of ClickHouse Cloud
• Bachelor’s or Master’s degree in Computer Science or related field • 8 years of experience in Site Reliability Engineering or related field • Previous experience using ClickHouse in production • Hands-on experience with Go and/or Python • Strong knowledge of cloud computing platforms (AWS, Azure, Google Cloud) • Excellent understanding of distributed databases, SQL, especially ClickHouse • Hands-on experience with container orchestration tools like Kubernetes or Docker Swarm • Strong experience with automation and configuration management tools such as Ansible, Terraform, or Puppet • Solid problem-solving and production debugging skills • Passion for efficiency, availability, scalability, and data governance • Ability to thrive in a fast-paced environment and partner with the business towards shared goals • High level of responsibility, ownership, and accountability • Excellent communication and interpersonal skills
• Cash compensation and stock options grant • Flexible work environment - remote-first work • Healthcare - employer contributions towards healthcare
Apply Now