Staff Site Reliability Engineer

October 9

Apply Now
Logo of AlphaSense

AlphaSense

Artificial Intelligence • Finance • Enterprise

AlphaSense is a market intelligence and search platform that empowers companies to unlock critical insights across an extensive universe of public and private content, including company filings, broker research, expert calls, and market trends. Its AI-driven technology affords users the ability to conduct comprehensive due diligence with speed and accuracy, reducing uncertainty and blind spots in decision-making. Trusted by major corporations, financial institutions, and asset management firms globally, AlphaSense serves sectors such as financial services, health care, and technology by providing generative AI-powered solutions that integrate internal proprietary data with external premium content.

1001 - 5000 employees

Founded 2011

🤖 Artificial Intelligence

💸 Finance

🏢 Enterprise

💰 Debt Financing on 2022-06

📋 Description

• Architect Reliability Paved Paths: Build frameworks and self-service tooling that let teams own the reliability of their services in a “You Build It, You Run It” culture. • Lead AI-Driven Reliability: Drive our AIOps strategy — automating diagnostics, remediation, and proactive failure prevention. • Champion Reliability Culture: Embed SRE practices across engineering via design reviews, production readiness, and operational standards. • Incident Leadership: Act as Incident Commander during critical events, modeling operational excellence, and ensuring blameless postmortems lead to lasting improvements. • Advance Observability: Deliver end-to-end monitoring, tracing, and profiling (Prometheus, Grafana, OTEL, Continuous Profiling) to optimize performance proactively. • Mentor & Multiply: Elevate engineers across SRE and product teams through mentorship, technical guidance, and knowledge sharing.

🎯 Requirements

• 8+ years of experience in Site Reliability Engineering, DevOps, or a similar role, with at least 3+ of those years operating in a Senior+ SRE position • Strong background in running production SaaS systems at scale. • Proficiency in at least one programming/scripting language (Python, Go, or similar). • Hands-on expertise with cloud platforms (AWS, GCP, or Azure) and Kubernetes. • Deep understanding of networking fundamentals (TCP/IP, DNS, HTTP/S, load balancing). • Experience with monitoring & alerting (Prometheus, Grafana, Datadog, ELK). • Familiarity with advanced observability (OTEL, continuous profiling). • Proven incident management experience, including leading high-severity incidents and postmortems. • Strong troubleshooting skills across the full stack. • Excellent communication and collaboration skills.

🏖️ Benefits

• You may also be offered equity, and a generous benefits program.

Apply Now

Similar Jobs

October 7

Professional Services DevOps Architect guiding strategic customers’ DevOps journeys at JFrog. Collaborating with various teams to implement CI/CD pipelines and DevSecOps platforms.

Ansible

AWS

Azure

Cloud

Docker

Google Cloud Platform

Java

Jenkins

Kubernetes

Linux

Maven

Open Source

Terraform

October 6

Kin

2 - 10

Staff DevOps Engineer developing infrastructure and delivery solutions at Kin's platform operations team. Enhancing deployment efficiency and supporting secure innovation for engineers.

AWS

Docker

EC2

Redis

Shell Scripting

Terraform

October 3

Principal Site Reliability Engineer at Expel focusing on service reliability through collaboration and coding. Leading projects on platform features and mentoring junior engineers in high-availability systems.

AWS

Cloud

Google Cloud Platform

JavaScript

Kubernetes

Linux

Python

Go

October 1

Principal Site Reliability Engineer at Blue River Technology creating hybrid infrastructure for edge devices and cloud resources. Focused on optimizing performance, cost, and collaboration across teams.

AWS

Cloud

EC2

Jenkins

Kubernetes

Linux

Python

Terraform

Go

September 25

NBCUniversal

10,000+ employees

📱 Media

Lead SAP BTP platform reliability and integrations for NBCUniversal. Manage incidents, offshore teams, deployments, architecture, and governance for finance transformation.

Cloud

SOAP

Go

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com