Infrastructure Engineer

August 8

Apply Now
Logo of Roboflow

Roboflow

Artificial Intelligence • Software • Computer Vision

Roboflow is a comprehensive platform designed for developers to build and deploy computer vision applications. Offering tools for image annotation, model training, and deployment, it supports a collaborative workflow optimized for speed and efficiency. With over 16,000 organizations using Roboflow, including more than half of the Fortune 100, the platform provides a streamlined process for creating datasets, training models, and integrating AI solutions across various industries.

📋 Description

• As a member of our infrastructure team, you'll be at the heart of a fast-paced startup environment. • Your primary focus will be on striking the right balance between rapid delivery, high reliability, and robust security. • This isn't a traditional, siloed role; you'll need to wear many hats—acting as an infrastructure engineer one moment, and a developer, or even a security analyst. • You will be securing, scaling, and maintaining the core infrastructure that powers our product. • This includes our cloud architecture, databases, file storage, search clusters, microservices, and machine learning pipelines. • You'll work closely with our product team and collaborate across the company on product, operations, and customer-facing projects, constantly context-switching to solve the next critical challenge. • In this role, you will: • Design, secure, and maintain cloud infrastructure powering production SaaS and ML workloads across AWS and/or GCP • Build and operate scalable, containerized applications using Kubernetes, Helm, and Docker • Develop and manage infrastructure-as-code solutions using Terraform, Bash, and Python • Work directly with customers and internal teams to meet security, compliance, and reliability requirements (SOC 2, HIPAA, GDPR) • Improve observability, reliability, and on-call processes, including SLO/SLAs and incident response • Automate CI/CD workflows with tools like GitHub Actions and Spacelift • Contribute code (Python, Node.js) to product features and platform infrastructure • Identify and act on cost-optimization opportunities across the tech stack

🎯 Requirements

• Experience: 5+ years of hands-on infrastructure or DevOps engineering experience, ideally in fast-paced or startup environments • Cloud & Containers: Strong experience with AWS or GCP, Kubernetes in production, Docker, and Helm • Infrastructure as Code: Proficient with Terraform, scripting (e.g. Bash), and Python for automation • Software Development: Comfortable reading and contributing to application code (Node.js, Python) • Security-Minded: Familiar with security best practices and compliance standards (SOC 2, HIPAA, etc.) in cloud-native environments • Startup Versatility: Thrives in high-ownership environments where priorities shift quickly; able to balance speed with long-term reliability • Collaborative Communicator: Experience working cross-functionally with developers, product teams, and customers • Preferred Background: Experience in early- to mid-stage startups, especially those with AI/ML infrastructure or SaaS platforms

🏖️ Benefits

• $4000/yr Travel Stipend to travel anywhere anytime to work alongside other Roboflowers • $350/mo Productivity stipend to spend on things that make your work environment more productive, like high-speed internet at home or a co-working space • Cover up to 100% of your health insurance costs for you and your partner or family • Remote first/flexible schedule allowing you to work collaboratively with other team members and asynchronously • Unlimited PTO- with an annual 2 week minimum, we encourage you to take time off for yourself • 12 weeks parental leave

Apply Now

Similar Jobs

August 8

Join LanceDB to engineer AI infrastructure for multimodal applications. Drive feature engineering for scalable AI projects.

Airflow

Apache

AWS

Azure

Cloud

Docker

EC2

Google Cloud Platform

Grafana

HBase

HDFS

Kafka

Kubernetes

Open Source

Pandas

Prometheus

Python

PyTorch

Ray

Rust

Spark

Tensorflow

Terraform

August 7

Partner with engineering teams to create reliable infrastructure that transforms data into insights for mineral exploration.

AWS

Cloud

Distributed Systems

Docker

Java

JavaScript

Kubernetes

Node.js

Python

Terraform

Go

August 7

Looking for a Senior Infrastructure Engineer to build scalable architecture for AI systems at Meshy.

AWS

Cloud

EC2

Grafana

Kubernetes

Prometheus

Python

Go

August 4

Vultr is seeking a Data Center Infrastructure Architect to design and implement networking and server infrastructures.

Cloud

Linux

July 10

Join NVIDIA to enhance AI infrastructure. Work on automation and management of machine learning systems.

Cloud

Distributed Systems

Kubernetes

Linux

Python

Go

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com