Senior Software Engineer – AI Research Clusters

🕒 April 30

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of NVIDIA

NVIDIA

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

📋 Description

• Propose and implement engineering solutions to ensure delivery of functional, reliable, secure, and performance-optimal GPU clusters to internal researchers. • Design, develop and maintain engineering solutions to understand the pain points of validating, monitoring and operating GPU clusters at scale. • Research in traditional AIOps and the emerging Agentic AI, and leverage it to further reduce the operation toil. • Participate in on-call support for systems, platforms built and owned by the team.

🎯 Requirements

• BS/MS in Computer Science, Engineering, or equivalent experience. • 5+ years in software/platform engineering, including 3+ years in ML infrastructure or distributed systems. • Experience in software development lifecycle on Linux-based platforms. • Strong coding skills in languages such as Python, C++ or Rust. • Experience with Docker, Kubernetes, GitLab CI, automated deployments. • Experience with AIOps or Agentic AI and apply it successfully in production environment.

🏖️ Benefits

• equity • benefits

Apply Now

Similar Jobs

🕒 April 30

Dropzone AI

11 - 50

🔒 Cybersecurity

🤖 Artificial Intelligence

☁️ SaaS

AI Research Engineer designing and developing next-generation agentic AI systems. Leading the transition of cutting-edge research into scalable solutions for the cybersecurity field.

Python

🕒 April 17

Thermo Fisher Scientific

10,000+ employees

⚕️ Healthcare Insurance

🧬 Biotechnology

💊 Pharmaceuticals

AI Research Engineer at Centific developing advanced Vision AI and Physical AI systems. Join a team pioneering innovations in AI perception and embodied intelligence.

Docker

Kubernetes

Python

PyTorch

Ray

🕒 April 14

Terra

51 - 200

📋 Compliance

☁️ SaaS

Senior–Staff Machine Learning Researcher at Terra AI developing generative models for clean energy and mineral resource exploration. Leading deep learning model development and collaboration with engineering teams.

PyTorch

🕒 April 3

Ensemble Health Partners

5001 - 10000

⚕️ Healthcare Insurance

☁️ SaaS

🏢 Enterprise

Senior Engineer, AI responsible for building AI and machine learning models for Ensemble's healthcare solutions. Collaborating with data scientists to deliver innovative AI-driven solutions.

AWS

Azure

Cloud

Google Cloud Platform

Hadoop

PyTorch

Scikit-Learn

Spark

SQL

Tensorflow

🕒 April 3

Aptima

51 - 200

🤖 Artificial Intelligence

📚 Education

🏛️ Government

Senior Software Engineer developing advanced software systems for AI research and experimental prototypes at Aptima Inc. Collaborating with scientists to implement cutting-edge innovations in national security.

Docker

Python