Staff Software Engineer – Alerting, Observability

October 7

Apply Now
Logo of Cribl

Cribl

SaaS • Cloud

Cribl is a company providing a cloud-based service, allowing users to manage and analyze their data through a web application. The service includes features for user accounts and integration with Google for authentication.

501 - 1000 employees

Founded 2017

☁️ SaaS

📋 Description

• Design and build sophisticated alerting systems that enable proactive monitoring and incident detection across distributed systems • Develop query-based alert rules and expressions using PromQL, SQL, and other query languages to surface meaningful insights • Create intelligent alert routing, deduplication, and correlation mechanisms to reduce noise and improve signal quality • Build scalable backend services for alert evaluation, notification delivery, and alert management workflows • Optimize time-series data storage and query performance for high-volume metrics and telemetry data • Develop intuitive interfaces for alert configuration, visualization, and management using React and modern frontend technologies • Collaborate with cross-functional teams to understand monitoring requirements and deliver comprehensive alerting solutions • Mentor and guide engineers on best practices for observability and alerting architecture

🎯 Requirements

• Strong proficiency in TypeScript/Node.js with a proven track record of building production-grade services • Experience with query languages for metrics and monitoring (PromQL, SQL, or similar) and ability to write complex queries for data analysis • Hands-on experience building or maintaining alerting systems, including rule evaluation engines and notification pipelines • Experience with time-series databases and columnar storage systems (ClickHouse experience is a plus) • Frontend development skills with React and modern JavaScript frameworks for building data visualization and management interfaces • Strong understanding of distributed systems, data structures, and algorithms • Experience with observability concepts including metrics, logs, traces, and their correlation • Ability to work independently with minimal supervision and a track record of learning quickly • Dedication to writing clean, maintainable, and well-tested code • Prometheus ecosystem, including AlertManager • Background in building rule engines or expression evaluation systems • Experience with notification systems and integrations (PagerDuty, Slack, webhooks, etc.) • Familiarity with observability tools like Grafana, ELK stack, or similar solutions • Experience with CI/CD pipelines such as BitBucket, Jenkins, CircleCI, etc. • Understanding of alert fatigue mitigation strategies and intelligent alerting patterns • Experience with high cardinality data and performance optimization • Willingness to speak your mind and share ideas • Appreciation for humor and a love for goats • Comfort working remotely

🏖️ Benefits

• health, dental, vision insurance • short-term disability and life insurance • paid holidays and paid time off • fertility treatment benefit • 401(k) • equity • eligibility for a discretionary company-wide bonus

Apply Now

Similar Jobs

October 7

Toast

1001 - 5000

☁️ SaaS

🤝 B2B

Principal Software Engineer on Orders Cloud Sync Team at Toast. Designing scalable systems and leading projects to improve order processing for restaurants and businesses.

🇺🇸 United States – Remote

💵 $188k - $301k / year

⏰ Full Time

🔴 Lead

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor

October 7

NBCUniversal

10,000+ employees

📱 Media

Staff Software Engineer developing SAP Fiori applications for NBCUniversal's transformation project. Responsible for design, development, and maintaining user-centric interfaces.

🇺🇸 United States – Remote

💵 $140k - $180k / year

⏰ Full Time

🔴 Lead

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor

October 4

Descript

51 - 200

Staff full-stack Software Engineer at Descript, building and scaling foundational systems for AI-powered audio and video content creation. Collaborating across full stack with critical user functionality.

🇺🇸 United States – Remote

💵 $215k - $270k / year

💰 Corporate Round on 2022-10

⏰ Full Time

🔴 Lead

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor

October 3

CGI

10,000+ employees

🏢 Enterprise

🤖 Artificial Intelligence

🔒 Cybersecurity

Staff Software Engineer building AI-native cloud infrastructure for data-intensive products at ReadySet. Join a fully-remote team to tackle enterprise-scale AI system challenges.

🇺🇸 United States – Remote

💵 $190k - $240k / year

⏰ Full Time

🔴 Lead

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor

October 3

Diligent Robotics

2 - 10

⚕️ Healthcare Insurance

🤖 Artificial Intelligence

Staff Simulation Software Engineer responsible for designing simulation systems for robotics. Building environments for evaluating AI performance and ensuring robot reliability in real-world settings.

🇺🇸 United States – Remote

💰 $30M Series B on 2022-04

⏰ Full Time

🔴 Lead

🧑‍💻 Full-stack Engineer

🦅 H1B Visa Sponsor

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com