May 9
• Collaborate closely with engineering teams to design and develop highly resilient and performant systems at scale. • Use your understanding of distributed systems to identify and resolve low-level challenges quickly. Dive into complex issues such as networking, load balancing, and hardware maintenance and demonstrate your troubleshooting and problem-solving skills. • Identify bottlenecks and repetitive patterns in existing support processes and reduce costs by introducing better automation. • Participate in existing cluster operations, including monitoring cluster health, investigating issues, and resolving bugs. This commitment extends to providing on-call support. • Contribute to our Open-Source repos
• 5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role. • Experience with highly distributed systems and databases/data stores. • Proficiency in Python or Go. • Experience with alerting and monitoring tools such as Prometheus. • Strong working knowledge of Linux and containers, Bash and administration tools. • A comprehensive knowledge of Kubernetes. • Readiness to occasionally read code in C++ for reference and better understanding of our internals. • Knowledge of standard algorithms and data structures.
• Flexible work environment - ClickHouse is a distributed company offering remote-first work to all employees • Healthcare - Employer contributions towards your healthcare. • Equity in the company - Every new team member who joins our company receives stock options. • Time off - Flexible time off in the US, generous entitlement in all countries. • A $500 Home office setup if you’re a remote employee. • Employee-driven international mobility - we enable you to relocate internationally if you wish (within certain countries and timelines and subject to role requirements, time zones and work permit considerations)
Apply Now