Senior Engineer: Kubernetes Infrastructure

September 29, 2023

CoreWeave

CoreWeave is a specialized cloud provider, delivering a massive range of GPU compute resources on demand and at scale.

Cloud • Kubernetes • VFX Rendering • Deadline • Maya

11 - 50

💰 $100M Debt Financing on 2022-12

Description

• An engineering practice is only as healthy as its foundational dependencies and CoreWeave’s Kubernetes Infrastructure Team supports the platform and tools that underpin nearly every part of the cloud. Responsible for our internal Kubernetes-on-metal clusters in each datacenter, engineers on this team have the mission to manage and scale Kubernetes in one of one of the fastest growing clouds in the world. The domain of bare-metal day-0+ reliability engineering offers unique and rewarding challenges in orchestration, fleet operations, testing, observability and automation and every team member will have opportunities to develop their skills with Kuberenetes in an environment unique to being a cloud-builder, not just a cloud-consumer. • We are seeking a Senior Engineer to join the Kubernetes Infrastructure team and help us grow our orchestration platforms in scale, reliability, and featureset. This individual will join a team of 4-6 mixed-skill engineers and have the opportunity to work on the full gamut of rewarding challenges that come with the business of building a cloud in a communicative, supportive, and high-performing environment. • As a member of the Kubernetes Infrastructure Team, you would have the opportunity to: - Design and implement solutions to fascinating problems of scale for provisioning and managing (many) bare-metal Kubernetes clusters in a hands-free, growing environment. - Develop a toolchain and program for testing and developing against a complex cloud environment at a scale that remains agile. - Create custom Kubernetes interfaces, gateways, and orchestrators all managed using Gitops tools such as Argo CD and Helm. - Improve the performance, security, and reliability of our internal Kubernetes platforms and participate in the Kubernetes Infrastructure on-call rotation. - Build dashboards, alerts, and insights into the customer experience using Grafana and Prometheus ecosystem tools. - Grow, change, invest in your teammates, be invested-in, share your ideas, listen to others, be curious, have fun, and, above all, be yourself.

Requirements

• You have four or more years of experience in a software or infrastructure engineering industry • You have experience operating services in production and at scale. • You have some experience using Kubernetes with a conceptual understanding of its major components and/or have administered unmanaged (eg, not EKS/GKE) Kubernetes clusters with some form of automation such as KubeSpray. • You’re comfortable with the idea of using Go as your primary programming language. • You know your way around a Linux distro, shell scripting, and/or the Linux storage and networking stacks. • You’re interested in reliability engineering concepts such as the different types of testing, progressive deployments, error budgets, the role observability, and fault-tolerant design. • You can transform problems in elastic architectures, decompose them into achievable tasks, and socialize both to your teammates. • You’re excited about being part of a team of diverse perspectives and backgrounds that believe in tackling challenges, growing hand in hand, and winning together.

Benefits

• Medical, dental and vision insurance - 100% paid for the employee • Life Insurance • Short and long-term disability insurance • Flexible Spending Account • Flexible, full-service childcare support with Kinside • 401(k) with a generous employer match • Flexible PTO • Catered lunch each day in our offices • Weekly massages in NJ office • A casual work environment • Work culture focused on innovative disruption

Apply Now

Similar Jobs

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com
Jobs by Title
Remote Account Executive jobsRemote Accounting, Payroll & Financial Planning jobsRemote Administration jobsRemote Android Engineer jobsRemote Backend Engineer jobsRemote Business Operations & Strategy jobsRemote Chief of Staff jobsRemote Compliance jobsRemote Content Marketing jobsRemote Content Writer jobsRemote Copywriter jobsRemote Customer Success jobsRemote Customer Support jobsRemote Data Analyst jobsRemote Data Engineer jobsRemote Data Scientist jobsRemote DevOps jobsRemote Ecommerce jobsRemote Engineering Manager jobsRemote Executive Assistant jobsRemote Full-stack Engineer jobsRemote Frontend Engineer jobsRemote Game Engineer jobsRemote Graphics Designer jobsRemote Growth Marketing jobsRemote Hardware Engineer jobsRemote Human Resources jobsRemote iOS Engineer jobsRemote Infrastructure Engineer jobsRemote IT Support jobsRemote Legal jobsRemote Machine Learning Engineer jobsRemote Marketing jobsRemote Operations jobsRemote Performance Marketing jobsRemote Product Analyst jobsRemote Product Designer jobsRemote Product Manager jobsRemote Project & Program Management jobsRemote Product Marketing jobsRemote QA Engineer jobsRemote SDET jobsRemote Recruitment jobsRemote Risk jobsRemote Sales jobsRemote Scrum Master + Agile Coach jobsRemote Security Engineer jobsRemote SEO Marketing jobsRemote Social Media & Community jobsRemote Software Engineer jobsRemote Solutions Engineer jobsRemote Support Engineer jobsRemote Technical Writer jobsRemote Technical Product Manager jobsRemote User Researcher jobs