Senior Site Reliability Engineer

🕒 April 21

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of RapidSOS

RapidSOS

51 - 200 employees

Founded 2013

🔌 API

💰 $75M Venture Round on 2022-10

Emergency Services • Public Safety • API

RapidSOS is a company that connects data from various devices directly to emergency services like 911 to enhance public safety. They offer a range of products designed to provide real-time incident data to emergency communication centers and field responders, improving the speed and effectiveness of emergency response. Their solutions include call transcription, translation, and connecting safety data from devices, apps, and sensors directly to 911. RapidSOS collaborates with businesses, public safety agencies, and vendors to integrate their API into safety solutions, enabling features like text-to-911 and digital alert systems for schools, rail safety, home security, and more.

📋 Description

• Own performance and reliability outcomes: Ownership of how application-level decisions create system-level impact • Design for system resilience: Responsibility for strengthening reliability through proactive design decisions • Build observability into system behavior: Proactively instrument services with structured logging • Own incidents from signal to resolution: Ownership of production issues from first signal through resolution • Work across the stack without a permission slip: You’ll work across infrastructure-as-code, container orchestration, CI/CD pipelines, and service-level application code

🎯 Requirements

• 5+ years of professional engineering experience with deep expertise in Python • Real cloud infrastructure experience with AWS: networking, managed databases, cost implications of traffic routing decisions, IAM, DNS-based routing and failover • Hands-on kubernetes experience with containerized workloads in production across EKS, ECS, or Fargate • Strong understanding of distributed systems and how they fail • Experience operating high-throughput messaging systems (RabbitMQ, Kafka, AWS SNS / SQS, etc.) • Experience building or improving observability through logging, metrics, and alerting • Demonstrable experience in using AI to safely and securely enhance velocity, improve reliability and recoverability of services • Strong proficiency in coding best practices – ability to write clean, maintainable, and testable code • Demonstrated expertise in problem solving

🏖️ Benefits

• Competitive salary and benefits and equity participation • A dynamic, flexible and fun start-up work environment with a highly talented team

Apply Now

Similar Jobs

🕒 April 21

Mistral AI

11 - 50

Join Mistral AI as a Site Reliability Engineer focusing on optimization and reliability. Collaborate with teams to enhance platform performance and ensure system availability.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 April 21

TrueML

51 - 200

💳 Fintech

💸 Finance

👥 B2C

Senior Manager, DevOps leading infrastructure and platform engineering efforts at TrueML. Focus on cloud architecture and CI/CD standards for machine learning-driven products.

🇺🇸 United States – Remote

💵 $150k - $220k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 April 21

Sweed POS

11 - 50

🛒 Retail

🛍️ eCommerce

🤝 B2B

DevOps Engineer optimizing infrastructure and implementing automation for Sweed's cannabis retail platform. Collaborate with global teams to enhance development and deployment processes.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 April 21

Cyngn

51 - 200

🚗 Transport

☁️ SaaS

🔧 Hardware

Deployment Engineer optimizing autonomy for Cyngn's autonomous robotic systems deployed across North America. Leading on-site deployments and ensuring customer satisfaction in a diverse team environment.

🇺🇸 United States – Remote

💵 $100k - $125k / year

💰 $20M Post-IPO Equity on 2022-04

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

info

🕒 April 20

URBN (Urban Outfitters, Anthropologie Group, Free People & Nuuly)

10,000+ employees

👥 B2C

🛒 Retail

👗 Fashion

Senior DevOps Engineer optimizing cloud infrastructure on GCP for Nuuly. Leading CI/CD initiatives and collaborating with developers to enhance system performance.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)