Distinguished Engineer – GPU Fleet Operations Automation

🕒 January 2

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of NVIDIA

NVIDIA

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

📋 Description

• Various Architectural Work: define and drive the technical implementation for DGX Cloud operations practice for GPU fleet lifecycle. • Collaborate on Cross Domain Disciplines: drive the technical strategy and awareness for best practices and technical capabilities into DGX Cloud engineering practices. • Accelerate Integration: Guide the technical delivery into DGX Cloud across all delivery environments: enterprise, public cloud, and high security, isolated, sovereign. • Engage Stakeholders: Collaborate with customers, infrastructure providers, and partners to ensure NVIDIA’s solutions set the industry standard for operational excellence. • Full Software and System Lifecycle: From ideation to architecture, design, development, deployment, operations, and full lifecycle management, lead all technical aspects of planning and continuous evolution of large technical scope.

🎯 Requirements

• 15-18+ overall years in technical roles with a focus on operations and automation for cloud infrastructure, platforms, and applications. • 5-10+ years of lead experience • BS/MS or higher or equivalent experience in systems / software engineering, or related engineering fields • Technical proficiency in multi-tenant data center and cloud-native architectures, with bare metal, virtualization, containerization, and higher level abstractions (IaaS, Kubernetes, Slurm), AI/ML platforms and applications. • Shown success delivering high-impact technically complex solutions that achieve high levels of transparency into resource utilization, performance, and operational insights. • Technical Leadership: Ability to synthesize multi-functional needs into architecture and design while guiding internal execution across complementary teams. • Communication and Partnership: Strong collaboration and influence skills, capable of leading engineering engagement, presenting with peers, partners, and working with high performance accelerated computing customers.

🏖️ Benefits

• equity • benefits

Apply Now

Similar Jobs

🕒 December 27, 2025

Obsidian Therapeutics

51 - 200

🧬 Biotechnology

💊 Pharmaceuticals

Associate Director leading IT operations and optimizing Digital Solutions at Obsidian Therapeutics. Delivering secure technology services for clinical-stage biotech teams.

🕒 December 25, 2025

Spherix Global Insights

51 - 200

🧬 Biotechnology

💊 Pharmaceuticals

⚕️ Healthcare Insurance

VP, Operations leading operational workflows and market research initiatives at Spherix. Fostering cross-functional collaboration for efficient delivery and scalable growth.

🕒 December 22, 2025

State of Florida

10,000+ employees

🏛️ Government

📚 Education

QSI Assessor conducting evidence-based assessments for persons with disabilities at the Agency for Persons with Disabilities. Responsible for interviews and assessments to support developmental disability services.

🕒 December 19, 2025

State of Florida

10,000+ employees

🏛️ Government

📚 Education

QSI Assessor conducting evidence-based assessments for the Agency for Persons with Disabilities. Engaging directly with clients and using online systems to record results and referrals.

🕒 December 17, 2025

CertifyOS

51 - 200

⚕️ Healthcare Insurance

☁️ SaaS

📋 Compliance

Operations Analyst managing provider credentials, licenses, and payor enrollment for healthcare efficiency. Conducting research and collaborating across teams to ensure smooth operations.

🇺🇸 United States – Remote

💵 $60k - $80k / year

💰 $14.5M Series A on 2022-09

⏰ Full Time

🟡 Mid-level

🟠 Senior

⚙️ Operations