Software Engineer – Compute Infrastructure

🕒 2 days ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Render

Render

11 - 50 employees

Founded 2018

☁️ SaaS

SaaS • Cloud Computing • Web Hosting

Render is a modern cloud platform for developers and teams that simplifies the process of building, deploying, and scaling applications. It offers a range of services including web services, static sites, cron jobs, background workers, PostgreSQL databases, and Redis management. Render's platform is designed for ease of use with features such as automatic deployments, load-based autoscaling, and infrastructure as code. It also provides robust security measures like DDoS protection and data privacy compliance. Render caters to a wide range of development stacks with support for various programming languages and Docker images, providing developers with a seamless environment to manage and scale their infrastructure. With a focus on speed, reliability, and collaboration, Render helps developers efficiently manage their applications and infrastructure from development to production.

📋 Description

• Own Render's core compute infrastructure across multiple cloud providers, regions, and data centers. You'll shape how we evolve our compute platform as we rapidly scale. • Design and build capabilities that give users greater performance and flexibility in how their services are built, deployed, perform, and stay available even when underlying resources go down. • Investigate challenging cloud and compute issues across the stack, from the kernel and data plane to our kubernetes cluster, control plane, and other orchestration mechanisms. • Improve the performance and reliability of our infrastructure through systematic profiling, experimentation, and tuning. • Partner with engineers across the company to build a platform that is stable, predictable, and secure. • Participate in our on-call rotation. Help continuously improve how we detect, respond to, and learn from incidents.

🎯 Requirements

• At least 7 years of experience building and operating large-scale platform or compute infrastructure. • Deep expertise in operating, scaling, and enhancing Kubernetes clusters or similar resource/container orchestration system. • Experience developing in Go, Rust, or similar languages to develop custom infrastructure components, scheduling, controllers, that apply business logic to resource management. • Comfort going broad and deep in a complex systems, making tradeoffs to improve performance and efficiency without sacrificing reliability. • Strong experience designing, debugging, and operating distributed systems. • Experience planning and executing rapid, high-risk upgrades, changes with minimal downtime to user services.

🏖️ Benefits

• 4 weeks of paid vacation. • 14 weeks of fully paid parental leave for all parents to bond with a newly born, adopted, or fostered child. We will also work with you to create a supportive plan of return. • Long-term disability, life insurance, and 401K plans. • 100% employer-paid medical coverage and 99% employer-paid dental and vision coverage for you and a dependent. FSAs and HSAs are available as well. • Monthly lifestyle stipend for wellness, mental health and therapy, hobbies, etc. • Monthly cell phone and internet subsidy. • Commuter benefits for Renders in the Bay Area, and home office stipends for remote Renders. • Continuous learning benefits & related support.

Apply Now

Similar Jobs

🕒 2 days ago

Weave

1 - 10

Sr. Platform Engineer focused on data infrastructure for Weave. Collaborating on backend systems, data, scalability, and innovations to enhance data access.

AWS

Cloud

Distributed Systems

Google Cloud Platform

Java

Kafka

NoSQL

Python

Go

🕒 5 days ago

General Motors

10,000+ employees

🚗 Transport

⚡ Energy

🏢 Enterprise

Staff ML Infrastructure Engineer developing and deploying machine learning solutions. Leading design and implementation of scalable platforms for autonomous vehicle behavior at General Motors.

Cloud

Distributed Systems

Docker

Kubernetes

Python

PyTorch

Tensorflow

🕒 5 days ago

Coinbase

1001 - 5000

₿ Crypto

💸 Finance

💳 Fintech

Senior Software Engineer building and maintaining cloud infrastructure for Coinbase. Collaborating across AWS and GCP frameworks to enhance network solutions.

AWS

Cloud

DNS

Google Cloud Platform

Python

Terraform

Go

🕒 6 days ago

Aligned Data Centers

501 - 1000

⚡ Energy

Lead Reliability Engineer overseeing electrical systems within hyperscale data center infrastructure. Conduct forensic analyses, audits, and support strategic engineering decisions.

Flash

🕒 6 days ago

F5

5001 - 10000

🔒 Cybersecurity

☁️ SaaS

🏢 Enterprise

Senior Capacity Planning Engineer at F5 ensuring the right resources for global infrastructure. Leading supply planning and capacity management across a growing infrastructure.

Grafana

Prometheus