Senior Incident Manager

November 10

Apply Now
Logo of Databricks

Databricks

Artificial Intelligence • Enterprise • SaaS

Databricks is a data and AI company that provides a unified platform for data engineering, machine learning, and analytics. It focuses on optimizing big data processing and helps organizations leverage Apache Spark to deliver deeper insights and powerful data-driven applications. Databricks also offers robust tools and seamless integration for machine learning operations.

1001 - 5000 employees

Founded 2013

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

💰 $1.6G Series H on 2021-08

📋 Description

• Lead critical incidents — coordinate multi-disciplinary response efforts across Databricks’ cloud-based services to rapidly mitigate impact and restore operations. • Drive technical root cause analysis and Reliability improvements: • collaborate with engineering teams to trace and document underlying causes across distributed systems, services, and data stores. • Summarize key learnings, clearly communicate action items, and ensure that technical and procedural improvements are followed through. • Own communications during incidents — deliver frequent, high-quality updates to internal stakeholders (executives, engineering leadership, support) and compose and publish customer-facing notifications that are accurate, timely, and empathetic. • Mentor and train peers in both incident communication and technical response disciplines to raise the overall quality of Databricks’ incident response.

🎯 Requirements

• 5+ years of experience in incident management, site reliability engineering, or production operations supporting large-scale, cloud-native systems. • Proven ability to lead and coordinate high-severity incidents, including identifying impact, isolating fault domains, and managing multi-team response efforts. • Strong understanding of cloud infrastructure (AWS, Azure, or GCP) — including compute, networking, storage, and observability components. • Deep expertise in log analysis and debugging: • Familiarity with log aggregation and search tools (e.g., Datadog, Elasticsearch, Splunk, Cloud Logging, or OpenTelemetry). • Hands-on experience with observability systems — metrics, logging, and tracing frameworks (Prometheus, Grafana, OpenTelemetry, etc.). • Proficiency in at least one major programming or scripting language (Python, Go, or Bash) for automating diagnostics, data collection, or analysis. • Experience developing and maintaining incident playbooks and communication templates to ensure consistent, timely updates. • Excellent contextual interpretation and writing skills, as well as the ability to effectively summarize and communicate to both technical and business audiences, are required. • BS, Master's or other advanced degree in Computer Science or Computer Engineering, or related Engineering field.

🏖️ Benefits

• At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees.

Apply Now

Similar Jobs

November 10

HighLevel

201 - 500

☁️ SaaS

🤝 B2B

Sr. SEC Reporting Manager at HighLevel overseeing public company readiness and SEC reporting. Leading financial reporting processes and compliance across an innovative, global AI platform.

🇺🇸 United States – Remote

💰 Series A on 2021-11

⏰ Full Time

🟠 Senior

👔 Manager

November 10

Colibri Group

1001 - 5000

📚 Education

🤝 B2B

💸 Finance

Strategy & Corporate Development Manager shaping Colibri's strategic direction through M&A activities and growth projects. Collaborates with senior leaders to deliver high-impact strategic initiatives.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

👔 Manager

November 10

Crest Industries

1001 - 5000

⚡ Energy

Construction Manager Team Leader supervising high voltage construction projects across the US, ensuring quality and safety while managing teams.

🇺🇸 United States – Remote

💵 $102k - $140k / year

⏰ Full Time

🟠 Senior

👔 Manager

November 10

Angi

1001 - 5000

🏪 Marketplace

Senior Manager, Internal Audit managing IT audit efforts at Angi. Leading cross-functional teams to identify risks and drive improvements in controls and operations.

🇺🇸 United States – Remote

💵 $135k - $185k / year

⏰ Full Time

🟠 Senior

👔 Manager

🦅 H1B Visa Sponsor

November 9

PolyNovo Limited

201 - 500

🧬 Biotechnology

🔬 Science

Area Business Manager leading sales efforts for PolyNovo’s Novosorb technology in the advanced wound/burn space. Responsible for revenue growth and managing a team in the territory.

🇺🇸 United States – Remote

💵 $157.5k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

👔 Manager

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com