Staff Software Engineer – Infrastructure

🕒 January 29

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Rad AI

Rad AI

51 - 200 employees

Founded 2018

🔧 Hardware

💰 $25M Series A on 2021-11

Manufacturing • Hardware • Engineering

Rad AI is a company that specializes in custom manufacturing and processing solutions, particularly in the fabrication of components such as casters and material handling devices. They offer a range of products including adjustable casters and OEM parts designed for various industrial applications. Based in Somerset, Michigan, Rad AI is committed to high-quality engineering and manufacturing services.

📋 Description

• Influence the technical direction for infrastructure and platform capabilities that support our rapidly growing AI product suite. • Architect and evolve our cloud infrastructure (primarily on AWS) across container orchestration (Kubernetes, Elastic Container Service), serverless (e.g., Lambda), virtual machines (e.g., EC2), and data stores to support current and future products. • Work closely with Platform leadership, product engineering, data, and ML teams to design systems that are robust, observable, and compliant in a healthcare environment. • Define and drive infrastructure strategy for the Platform org—partnering with engineering leadership to align roadmaps, set standards, and sequence work for maximum business impact. • Secure networking, identity, and access patterns across environments. • Improve reliability and operational excellence by defining SLOs, SLIs, and error budgets for core platform services. • Leading and participating in blameless post-incident reviews and translating learnings into systemic improvements. • Own observability and monitoring strategy across logging, metrics, and tracing, ensuring we can detect, debug, and prevent issues efficiently. • Mentor and level up engineers across Platform and product teams—reviewing design docs, guiding architecture decisions, and modeling high standards for reliability, security, and maintainability. • Partner with security and compliance stakeholders to ensure our infrastructure and operational practices meet HIPAA and other healthcare requirements. • Advocate for and implement developer experience improvements, such as better CI/CD workflows, faster feedback loops, and tooling that reduces cognitive load for product teams.

🎯 Requirements

• Bring 8+ years of hands-on infrastructure / platform development experience (or equivalent practical experience) in modern, cloud-native environments, with a track record of owning critical systems in production. • Have deep expertise with AWS (preferred) and/or GCP, including core networking, compute, storage, and managed services. • Are highly proficient in at least one programming/scripting language used for infrastructure work (Python preferred). • Extensive experience building tooling and automation for other engineers. • Have strong experience with Kubernetes, containers (Docker), and container orchestration, and understand how to operate these systems reliably at scale. • Are comfortable with Infrastructure as Code (Terraform preferred, Pulumi, or similar) and Git-based workflows. • Possess solid Linux fundamentals and are comfortable debugging issues at the OS, networking, and application layers. • Have demonstrable experience leading complex, cross-team initiatives from design through rollout—communicating tradeoffs, aligning stakeholders, de-risking launches, and measuring impact. • Communicate clearly and empathetically with both technical and non-technical partners, and enjoy mentoring engineers at multiple levels. • Take a data-informed, pragmatic approach to decision-making—balancing ideal architecture with business needs, delivery timelines, and team capacity.

🏖️ Benefits

• Comprehensive Medical, Dental, Vision & Life insurance • HSA (with employer match), FSA, & DCFSA • 401(k) • 11 Paid Company Holidays • Location Flexibility (Remote-first company!) • Flexible PTO policy • Annual company-wide offsite • Periodic team offsites • Annual equipment stipend • For roles based outside the US, your recruiter can share more details

Apply Now

Similar Jobs

🕒 January 29

Timely Recruit Ltd

11 - 50

🎯 Recruiter

⚡ Energy

Senior engineer leading product and technical discovery for Timely's educational scheduling software. Building prototypes and engaging with stakeholders to deliver valuable solutions.

🕒 January 28

BayNova

11 - 50

🤝 B2B

🔒 Cybersecurity

☁️ SaaS

Full Stack Developer responsible for design, development, and support of solutions for federal clients. Collaborating with cross-functional teams in a fully remote environment.

Angular

AWS

Azure

Cloud

Cypress

Django

Java

JavaScript

Linux

Maven

Node.js

NoSQL

Python

React

React Native

Spring

Spring Boot

SpringBoot

SQL

Terraform

Vue.js

Webpack

🕒 January 27

Included Health

1001 - 5000

☁️ SaaS

🤝 B2B

👥 HR Tech

Staff Software Engineer shaping product platform architecture and driving growth initiatives at Included Health. Combining engineering expertise with innovative mindset for impactful contributions.

🕒 January 27

Toast

1001 - 5000

☁️ SaaS

🤝 B2B

Staff Software Engineer on the Orders Cloud Sync Team for Toast. Designing and implementing scalable systems critical to restaurant operations and customer workflows.

Distributed Systems

🕒 January 26

Domino Data Lab

201 - 500

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

Staff Software Engineer developing AI solutions for Domino Data Lab. Designing backend systems for governance features and improving user experience.

Cloud

Docker

GraphQL

Hadoop

Java

Kafka

Kubernetes

Python

Scala

Spark

Go