Staff Software Engineer, Site Reliability, SRE

Job not on LinkedIn

August 29

Apply Now
Logo of Optimal Ways

Optimal Ways

eCommerce • Digital Analytics • Training

Optimal Ways is an agency specializing in digital analytics and optimization for eCommerce. Based in Paris and Lille, the company focuses on enhancing the quality of data, analyzing customer journeys, and optimizing conversion rates on eCommerce sites and applications. With a team of experts in digital analytics, they provide strategic support and training to eCommerce teams, helping them maximize their performance and leverage data effectively. Optimal Ways is committed to sustainable eCommerce and is recognized for its responsible business practices through B Corp certification.

2 - 10 employees

Founded 2011

🛍️ eCommerce

📋 Description

• Reliability: Own the company-wide incident lifecycle; standards for detection, escalation, incident command, customer comms, and high-quality postmortems with action tracking. • Define and drive SLIs/SLOs for core services; build guardrails and dashboards that make reliability visible and actionable. • Lead production readiness reviews, capacity/performance planning, load testing, disaster recovery exercises, and resilience engineering (failure testing/chaos where appropriate). • Level-up on-call: right-sizing rotations, paging hygiene, runbooks, auto-remediation, and continuous improvement of MTTA/MTTR. • Security: Embed security into the delivery pipeline: dependency and image scanning, least-privilege/IAM baselines, secrets management, and service-to-service auth. • SOC 2-aligned controls as code; audit-friendly evidence generation in everyday engineering. • Drive secure-by-default patterns in the platform (network posture, data protection, runtime policies). • Platform & DevEx: Build and evolve paved roads for deploys, config, and runtime operations in our monorepo (Bazel) and CI/CD (AWS CodePipeline/CodeBuild). • Partner with product teams to make the secure default the easiest path—templates, tooling, libraries, and automation. • Improve observability end-to-end (traces, logs, metrics, alerts).

🎯 Requirements

• Experienced: Staff-level IC who has led reliability programs at meaningful scale and owned incident response standards. • Technically Grounded: Deep, hands-on experience with infrastructure at scale, cloud, containerization, and more: • AWS (multi-service) • ECS and/or Kubernetes containerization workloads • CICD & IaC (Terraform) • Production Networking/Fundamentals • Python Proficient: You can read/review service code and land operational improvements. • Data Driven: In your approach to SLOs, capacity, performance, and cost efficiency with strong observability chops • Influential: Able to shape direction and create simple, durable standards • Communicative: Excels in both technical and interpersonal communication, with strong written and verbal skills • Nice To Have: FinOps, SOC 2, Data Science/ML collaboration, monorepo frameworks (bazel, buck)

🏖️ Benefits

• Competitive compensation, including Series C level equity • Health / Dental / Vision 100% covered for employee and 50% for dependents • Life Insurance, with optional supplemental insurance • Flexible Spending Account (FSA) • Health Spending Account (HSA) • 401(k) with match • Unlimited PTO (vacation, personal days, sick days, jury duty, military leave, bereavement) • 11 Holidays • Paid Parental Leave for all employees • Short-term and Long-Term Disability Insurances, and AD&D Insurance • Fitness membership reimbursement • Commuter benefits

Apply Now

Similar Jobs

August 20

Syniti

1001 - 5000

🤝 B2B

🏢 Enterprise

Principal SRE for SAP BTP on AWS/Azure; leads runtime architecture and security. Collaborates with SAP and cloud teams to ensure performance and compliance.

🇺🇸 United States – Remote

💰 Private Equity Round on 2017-08

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

August 18

SAPSOL Technologies Inc. : Systems and Process Solutions for your Enterprise

201 - 500

🏢 Enterprise

☁️ SaaS

🤖 Artificial Intelligence

DevOps Architect leading CI/CD for Kubernetes platforms; Jenkins, EKS, Helm, IaC, reliability and security; collaborates with SREs and developers.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

August 12

Agero, Inc.

1001 - 5000

🚗 Transport

Lead AWS cloud DevOps at Agero; guide architecture, automation, and CI/CD. Drive scalable, secure platforms with IaC and cross-functional collaboration.

🇺🇸 United States – Remote

💵 $122.2k - $165k / year

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

June 20

Liatrio

51 - 200

🏢 Enterprise

☁️ SaaS

Consulting firm looking for a technical principal to enhance DevOps delivery and client relationships.

🇺🇸 United States – Remote

💵 $170k - $200k / year

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

May 8

Believe Solutions

51 - 200

🤝 B2B

🏢 Enterprise

☁️ SaaS

Join as a DevOps Engineer to automate deployments and optimize performance in a global infrastructure team.

🇺🇸 United States – Remote

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com