Customer Reliability Engineer

Job not on LinkedIn

September 4

Apply Now
Logo of deepset

deepset

Artificial Intelligence • SaaS • Enterprise

deepset is a leader in framework and platform technology that accelerates AI application development with large language models (LLMs). As the creator of the deepset AI Platform and the Haystack open-source framework, deepset powers custom AI solutions in production across industries and government, earning recognition as a Gartner Cool Vendor in AI Engineering. Their offerings include building AI agents, retrieval augmented generation, enterprise search, and intelligent document processing, tailored for various sectors such as finance, legal, media, and healthcare.

51 - 200 employees

Founded 2018

🤖 Artificial Intelligence

☁️ SaaS

🏢 Enterprise

💰 Series B on 2023-08

📋 Description

• Design & Land Own technical outcomes from POC → production: integrations, data connectors, workflows, and infra-as-code (Kubernetes/Terraform/Helm). • Produce reference architectures and reusable templates; upstream patterns to Product to reduce future “custom” work. • Unblock enterprise environments: identity (OIDC/SAML), networking, storage, GPU scheduling, observability hooks. • Run & Harden Define SLOs/Error Budgets with customers; implement end-to-end observability (logs/metrics/traces) and dashboards. • Create runbooks/playbooks; lead L3 incident response and RCAs; drive reliability roadmaps to closure. • Plan/execute upgrades and security patches in change windows; ensure rollback and post-upgrade verification. • Be an active member of the on-call rotation to make sure we deliver excellent customer experience • Partner & Enable Train customer teams on operations and emergency procedures; hand off cleanly to Support/CSM. • Prioritize reliability and “productization” backlog with Product/Engineering based on field signal. • Document clearly: setup guides, diagrams, SLOs, testing/DR procedures, and “golden path” standards.

🎯 Requirements

• Hands on experience in programming language in Python (needed for improvements, bug fixing and small feature builds) • 5+ years across SRE/Platform/Solutions/FDE, with evidence of shipping customer-facing builds and operating production systems. • Strong with Kubernetes, containers, Linux, IaC (Terraform/Helm), CI/CD, networking (TLS, DNS, ingress/LB), backup/restore. • Observability stacks (Prometheus/Grafana/OpenTelemetry/ELK); scripting (Python/Bash). • Enterprise integration experience (SSO, secrets, compliance); confident communicator with execs and engineers under time pressure. • Must be resident of the European Union with an EU Passport

🏖️ Benefits

• Remote-first setup with flexible hours & tech of your choice • 30 days vacation + extra days for family sick leave • Competitive salary & stock options for every team member • Monthly sports & mental health support allowance with Oliva • Annual learning & development budget • Monthly team socials & in-person meetups • Dog-friendly Berlin HQ

Apply Now

Similar Jobs

April 15

Axiom

1001 - 5000

☁️ SaaS

🤝 B2B

Join Axiom as a labor law lawyer, offering legal solutions to businesses remotely.

🇩🇪 Germany – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🗣️🇩🇪 German Required

April 11

Axiom

1001 - 5000

☁️ SaaS

🤝 B2B

Axiom transforms the legal market by providing high-quality legal professionals for internal tasks.

🇩🇪 Germany – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🗣️🇩🇪 German Required

February 23

Join Everlast Media as a legal expert managing contracts and client legal inquiries.

🇩🇪 Germany – Remote

💵 €48k - €60k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🗣️🇩🇪 German Required

February 8

SAP Fioneer

501 - 1000

Seeking a Senior DevSecOps Developer for Cloud Platform, focusing on Banking and Insurance solutions.

🇩🇪 Germany – Remote

⏰ Full Time

🟠 Senior

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com