Database Reliability Engineer

September 10

Apply Now
Logo of ClickHouse

ClickHouse

SaaS • Enterprise • Artificial Intelligence

ClickHouse is a fast and resource-efficient real-time data warehouse and open-source database that is designed to deliver superior query performance for mission-critical and time-sensitive applications. It is available as a cloud service on major platforms like AWS, GCP, and Azure, with a "Bring Your Own Cloud" option and a wide range of integrations for seamless operation within diverse tech stacks. ClickHouse excels in real-time analytics, machine learning, business intelligence, and observability, making it an ideal choice for tasks such as financial services, fraud detection, and gaming analytics. It supports developer-friendly SQL operations, offers cost-effective storage solutions, and provides an open-source alternative to traditional databases. Companies like Sony, Lyft, Cisco, GitLab, and Twilio leverage ClickHouse for its scalability, efficiency, and ease of use.

51 - 200 employees

Founded 2016

☁️ SaaS

🏢 Enterprise

🤖 Artificial Intelligence

📋 Description

• Build and lead processes to ensure and improve the reliability, availability, scalability, and performance of ClickHouse core. • Collaborate with Control Plane, Dataplane, Security, Support and Operations teams to implement ClickHouse best practices for customers. • Own engineering escalation management and response, investigations, post-mortem analysis including running blameless postmortems, and continuous improvement of how Clickhouse is run and optimized in the cloud. • Improve and create metrics and alerts for ClickHouse to identify and prevent production problems before they affect customers. • Dig deeper into common customer problems in ClickHouse Core to identify root causes and submit bug fixes, issue reports and suggest improvements. • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages and communicate with impacted customers. • Plan, enable, and drive Chaos initiatives across Engineering teams based upon internal priorities. • Manage on-call processes to respond to performance and reliability issues and establish best practices for escalation coordination.

🎯 Requirements

• Bachelor’s or Master’s degree in Computer Science or a related field. • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering. • Previous experience operating ClickHouse or other SQL databases in production. • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus. • Scripting experience with Shell or Python, and ability to read and understand C++ code. • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform. • Strong problem-solver with solid production debugging skills. • Thrive in a fast-paced environment as part of a global team and partner with the business. • High level of responsibility, ownership, and accountability. • Excellent communication skills.

🏖️ Benefits

• Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries. • Healthcare - Employer contributions towards your healthcare. • Equity in the company - Every new team member who joins our company receives stock options. • Time off - Flexible time off in the US, generous entitlement in other countries. • A $500 Home office setup if you’re a remote employee. • Global Gatherings – opportunities to engage with colleagues at company-wide offsites.

Apply Now

Similar Jobs

September 4

deepset

51 - 200

🤖 Artificial Intelligence

☁️ SaaS

🏢 Enterprise

Customer Reliability Engineer embedding with customers to design, deploy, and operate deepset AI Platform. Driving reliability, observability, and productization for enterprise deployments.

🇩🇪 Germany – Remote

💰 Series B on 2023-08

⏰ Full Time

🟡 Mid-level

🟠 Senior

February 8

SAP Fioneer

501 - 1000

Seeking a Senior DevSecOps Developer for Cloud Platform, focusing on Banking and Insurance solutions.

🇩🇪 Germany – Remote

⏰ Full Time

🟠 Senior

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com