Staff Software Engineer, Infrastructure

🕒 February 25

🏢🏡 San Francisco – Hybrid

💵 $300k - $430k / year

⏰ Full Time

🔴 Lead

🧑‍💻 Full-stack Engineer

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Decagon

Decagon

WebsiteLinkedIn

11 - 50 employees

Trusted by world-class companies, Decagon is the most advanced AI platform for customer support.

📋 Description

• Design and implement critical infrastructure services with strong SLOs, clear runbooks, and actionable telemetry. • Partner with research and product teams to architect solutions, set up prototypes, evaluate performance, and scale new features. • Tune service latencies: optimize networking paths, apply smart caching/queuing, and tune CPU/memory/I/O for tight p95/p99s. • Evolve CI/CD, golden paths, and self-service tooling to improve developer velocity and safety. • Support various deployment architectures for customers with robust observability and upgrade paths. • Lead infrastructure-as-code (Terraform) and GitOps practices; reduce drift with reusable modules and policy-as-code. • Participate in on-call and drive down toil through automation and elimination of recurring issues.

🎯 Requirements

• 8+ years building and operating production infrastructure at scale. • Depth in at least one area across Core/Data/AI-ML/Platform/Voice, with curiosity to learn the rest. • Proven track record meeting high availability and low latency targets (owning SLOs, p95/p99, and load testing). • Excellent observability chops (OpenTelemetry, Prometheus/Grafana, Datadog) and incident response (PagerDuty, SLO/error budgets). • Clear written communication and the ability to turn ambiguous requirements into simple, reliable designs. • Experience being an early backend/platform/infrastructure engineer at another company • Strong Kubernetes experience (GKE/EKS/AKS) and experience across multiple cloud providers (GCP, AWS, and Azure) • Experience with customer-managed deployments

🏖️ Benefits

• Medical, dental, and vision benefits • Take what you need vacation policy • Daily lunches, dinners and snacks in the office to keep you at your best

Apply Now

Similar Jobs

🕒 February 24

Autodesk

10,000+ employees

📱 Media

WebsiteLinkedIn

Principal Full Stack Engineer on Autodesk's Connected Delivery platform team. Leading development of scalable web applications while guiding software best practices in an agile environment.

🏢🏡 San Francisco – Hybrid

💵 $139k - $249.3k / year

⏰ Full Time

🔴 Lead

🧑‍💻 Full-stack Engineer

AWS

Cloud

JavaScript

Microservices

Next.js

TypeScript

🕒 February 23

Zigsaw

11 - 50

WebsiteLinkedIn

Principal Engineer for shared compute platform driving innovation and efficiency at Pinterest. Leading technical solutions for AI workloads and distributed systems in fast-paced environment.

🏢🏡 San Francisco – Hybrid

💵 $242.6k - $499.5k / year

⏰ Full Time

🔴 Lead

🧑‍💻 Full-stack Engineer

Cloud

Distributed Systems

Kubernetes

🕒 February 20

Unify

11 - 50

🤝 B2B

🤖 Artificial Intelligence

☁️ SaaS

WebsiteLinkedIn

Staff Software Engineer focusing on scaling AI-powered revenue platform for outbound efficiency. Collaborating with founders to establish engineering standards for growth.

Postgres

🕒 February 19

Modern Treasury

51 - 200

💸 Finance

💳 Fintech

☁️ SaaS

WebsiteLinkedIn

Senior Software Engineer at Modern Treasury designing AWS infrastructure and optimizing payment orchestration. Leading technical direction and contributing to payment rails strategy.

AWS

Distributed Systems

Docker

Ruby on Rails

Terraform

🕒 February 18

Sentry

201 - 500

☁️ SaaS

🏢 Enterprise

WebsiteLinkedIn

Staff Software Engineer designing high-scale distributed systems at Sentry. Architecting systems for Issue Workflow and leading technical strategy in a hybrid work environment.

Distributed Systems

Kafka

Postgres

Python

TypeScript