Staff Backend Engineer – Databases

🕒 April 27

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Grafana Labs

Grafana Labs

501 - 1000 employees

Founded 2014

🏢 Enterprise

☁️ SaaS

🤖 Artificial Intelligence

Enterprise • SaaS • Artificial Intelligence

Grafana Labs is a company that specializes in open-source observability technologies and solutions. It offers a comprehensive suite of tools for logging, metrics, tracing, and profile management with products like Grafana, Loki, Tempo, and Mimir. Their offerings are designed to help businesses visualize, monitor, and alert on data from various sources, providing capabilities such as anomaly detection, root cause analysis, and service level objective management using AI/ML insights. Grafana Labs provides both cloud-based and self-managed solutions, ideal for infrastructure, application, and frontend observability. Additionally, their platform supports integration with various data sources like Prometheus and OpenTelemetry, making them a key player in the observability and infrastructure monitoring space.

📋 Description

• Lead multi-quarter technical initiatives from problem framing through rollout, e.g., trace aggregation APIs, Limitless Tempo, autoscaling cells and customer limits, or query engine improvements. • Own the architecture of core Tempo components: ingestion, storage, query, and metrics generation. Drive design reviews, make sharp trade-offs on performance, cost, and complexity, and document the “why” for the team. • Design APIs for humans and agents. Shape the next generation of Tempo’s interfaces (structured, deterministic, discoverable) so that Act 3 products, LLM-driven assistants, and external integrators can build on Tempo reliably. • Own outcomes against concrete SLOs (P99 write latency, incident recurrence, TCO per ingested GB) and push the team toward Zero Ops through automation, parameterized rollouts, and actionable alerts. • Work closely with PMs and with App Observability, Asserts, Drilldown, and Grafana Assistant teams to understand how Tempo gets consumed and to ship what unblocks them. • Raise the engineering bar through code review, design feedback, pairing on hard problems, and writing that leaves the team smarter than you found it. • Participate in on-call for the services you help build, and be a force multiplier in incident response and post-incident learning. • Tempo is OSS. You will engage the community, review external contributions, and help steer the project in the open.

🎯 Requirements

• A track record of leading complex, multi-quarter initiatives that spanned design, delivery, and operations, and made the teams around you better. • Substantial hands-on experience building and operating distributed data systems in production: ingestion pipelines, storage engines, query execution, or similar. • You write clean, robust, performant software that others can maintain, and you know when to optimize vs. when to ship. • We write Tempo in Go. Deep experience in other systems languages (Rust, C, C++) translates well. • You’ve owned production services, carried a pager, reduced toil, and treated SLOs as a product feature, not a chore. • You break complex problems into short feedback loops: analyze, design, deliver an MVP, learn, iterate. • You lead through design docs, reviews, and shipped code, not hierarchy. You communicate clearly in a fully remote, asynchronous environment.

🏖️ Benefits

• 100% Remote, Global Culture • Scaling Organization • Transparent Communication • Innovation-Driven • Open Source Roots • Empowered Teams • Career Growth Pathways • Approachable Leadership • Passionate People • In-Person onboarding • Balance is Key

Apply Now

Similar Jobs

🕒 April 14

Tucows

1001 - 5000

🛍️ eCommerce

📡 Telecommunications

Database Reliability Engineer specializing in PostgreSQL at Wavelo, ensuring high availability and performance in database clusters.

🇨🇦 Canada – Remote

💵 $126.1k - $140.1k / year

💰 Post-IPO Equity on 2017-02

⏰ Full Time

🟠 Senior

🔴 Lead

🔙 Backend Engineer

Grafana

Linux

Postgres

Prometheus

Python

SQL

Go

🕒 April 7

Wavelo

51 - 200

📡 Telecommunications

☁️ SaaS

🤝 B2B

Database Reliability Engineer specializing in PostgreSQL at Wavelo. You will design, optimize, and operationalize database environments.

Ansible

Grafana

Linux

Postgres

Prometheus

Python

SaltStack

SQL

Terraform

Go

🕒 April 7

Dropbox

1001 - 5000

🏢 Enterprise

⚡ Productivity

Staff Backend Software Engineer responsible for evolving Dropbox’s Commerce Platform backend systems with high revenue impact. Leading technical strategy across complex, distributed workflows to ensure system reliability and data integrity.

🕒 April 6

Tucows

1001 - 5000

🛍️ eCommerce

📡 Telecommunications

Database Reliability Engineer focusing on building resilient PostgreSQL data infrastructure for Wavelo. Collaborating with teams to ensure performance and reliability across database environments.

🇨🇦 Canada – Remote

💵 $126.1k - $140.1k / year

💰 Post-IPO Equity on 2017-02

⏰ Full Time

🟠 Senior

🔴 Lead

🔙 Backend Engineer

Grafana

Linux

Postgres

Prometheus

Python

SQL

Go

🕒 April 2

Lookout

501 - 1000

🔒 Cybersecurity

Software Engineer contributing to Lookout's detection engines and analysis systems. Collaborating with security research and machine learning teams to enhance mobile device protection.

🇨🇦 Canada – Remote

💵 $130k - $170k / year

💰 $150M Debt Financing on 2022-06

⏰ Full Time

🔴 Lead

🔙 Backend Engineer

Android

AWS

Cloud

Distributed Systems

Google Cloud Platform

iOS