Senior AI Infrastructure Engineer, Cloud Partnerships

October 24

Apply Now
Logo of NVIDIA

NVIDIA

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

📋 Description

• Architect unified systems for integrating infrastructure provider maintenance events into NVIDIA engineering systems • Drive the adoption of operational excellence best practices across all infrastructure providers, partnering with SRE, infra, product, and security teams • Define and operationalize governance models for engineering support engagements, infrastructure maintenance lifecycles, and incident escalation paths • Measure provider availability against projected maintenance schedules using Service Level Objectives (SLOs) • Collaborate with AI/ML teams to integrate intelligent automation into maintenance workflows, such as projecting job capacity impact based on scheduled resource availability and suggesting infrastructure reallocations for high-profile initiatives • Develop a long-term roadmap to guide infrastructure providers in progressively adopting best practices for reliability and production hygiene across existing and new product introductions

🎯 Requirements

• 8+ years of experience in infrastructure architecture, cloud native, or large-scale platform/reliability roles • Bachelor's degree or equivalent experience • Experience designing scalable, maintainable backend systems and writing clear design documentation • Strong understanding of multiple cloud infrastructure provider resource offerings • Demonstrated experience in normalizing and unifying diverse data sources from a variety of systems into broadly applicable schemas, enabling efficient querying and analysis • Proven ability to lead and influence cross-functional technical initiatives at scale across vendors and external partners, especially in reliability or platform domains • Demonstrated ability to design and implement maintainable APIs for internal and external customers • Proficiency in Kubernetes administration, modern CI/CD techniques and Infrastructure as Code (IaC) • Experience building resilient production systems using Golang, Python or Ruby

🏖️ Benefits

• equity • benefits

Apply Now

Similar Jobs

October 24

AbacusNext

201 - 500

☁️ SaaS

🤝 B2B

Senior Infrastructure Engineer responsible for managing IT infrastructure systems for a remote organization. Ensuring stability, security, and performance across multiple platforms.

🇺🇸 United States – Remote

💵 $130k - $160k / year

⏰ Full Time

🟠 Senior

👷 Infrastructure Engineer

Ansible

Azure

Firewalls

Terraform

VMware

October 23

General Dynamics Information Technology

10,000+ employees

🔒 Cybersecurity

🤖 Artificial Intelligence

Infrastructure Engineer providing system administration and support for CNIC applications at GDIT. Transforming technology into opportunity with focus on innovation and operational improvements.

🇺🇸 United States – Remote

💵 $125.5k - $169.8k / year

⏰ Full Time

🟠 Senior

🔴 Lead

👷 Infrastructure Engineer

🦅 H1B Visa Sponsor

October 23

ValidMind

11 - 50

Infrastructure Engineer managing cloud infrastructure and implementing infrastructure-as-code at ValidMind. Supporting engineering teams with reliable and scalable infrastructure.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

👷 Infrastructure Engineer

October 23

Cobalt AI

51 - 200

🤖 Artificial Intelligence

🔐 Security

🏢 Enterprise

Infrastructure Engineer managing Kubernetes infrastructure for global AI-enabled cameras and Edge Processors. Fostering a positive culture while ensuring security and operational excellence.

🇺🇸 United States – Remote

💵 $160k - $200k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

👷 Infrastructure Engineer

October 23

Tempo

11 - 50

₿ Crypto

💳 Fintech

Build out Tempo’s infrastructure stack for a blockchain design partner focused on stablecoins and payments. Ensure efficient processes and deployment within engineering teams.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

👷 Infrastructure Engineer

🦅 H1B Visa Sponsor

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com