February 14
• Responsible for enabling and ensuring the health, reliability, scalability, and performance of Stack AV’s infrastructure • Contributing to a culture of continuous learning • Providing consultation on architecting for high-availability • Driving the uptime and performance of systems • Building a culture of blameless postmortems and continuous learning • Implementing a standard incident management framework and process • Building observability and alerting for hardware systems • Debugging issues with a methodical approach • Working in a diverse and distributed team • Ensuring required performance and reliability within budget
• Passionate about delivering self-driving (L4) products • Deep experience in fast-paced, rapidly growing tech development environments • Experience building a centralized observability stack • Familiarity with Kubernetes, etcd, and Prometheus • Experience working with hybrid cloud environments • Experience building observability and alerting for hardware systems • Fundamental understanding of Linux OS internals, TCP/IP networking stack, and storage systems • Ability to work in a diverse and distributed team • Desire to achieve required performance and reliability within budget
• Revolutionizing the way businesses transport goods • Creating better and smarter supply chains • Improving business outcomes • Delivering goods faster • Moving the trucking industry forward • Building a culture of inclusion, entrepreneurship, and innovation
Apply NowFebruary 13
501 - 1000
February 13
501 - 1000
February 10
11 - 50
🇺🇸 United States – Remote
💵 $110k - $140k / year
⏰ Full Time
🟡 Mid-level
🟠 Senior
🖥 DevOps & Production Engineering
February 10
1001 - 5000