Senior Software Architect – Deep Learning, HPC Communications

🔥 21 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of NVIDIA

NVIDIA

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

📋 Description

• Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems. • Design and implement new communication technologies to accelerate AI and HPC workloads. • Explore innovative solutions in HW and SW for our next generation platforms as part of co-design efforts involving GPU, Networking, and SW architects. • Build proofs-of-concept, conduct experiments, and perform quantitive modeling to evaluate and drive new innovations. • Use simulation to explore performance of large GPU clusters (think scales of 100s of 1000s of GPUs)

🎯 Requirements

• M.S./Ph.D. degree in CS/CE or equivalent experience. • 12+ years of relevant experience. • Excellent C/C++ programming and debugging skills. • Experience with parallel programming models (MPI, SHMEM) and at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC). • Deep understanding of operating systems, computer and system architecture. • Solid in fundamentals of network architecture, topology, algorithms, and communication scaling relevant to AI and HPC workloads. • Strong experience with Linux. • Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.

🏖️ Benefits

• equity • benefits

Apply Now

Similar Jobs

🔥 26 minutes ago

GEHA Health

1001 - 5000

⚕️ Healthcare Insurance

🧘 Wellness

🏛️ Government

Full Stack Developer designing and implementing digital innovation solutions at G.E.H.A for federal employees. Collaborating with teams on cloud-based applications and adhering to best practices.

Amazon Redshift

Angular

ASP.NET

AWS

Azure

BigQuery

Cloud

Google Cloud Platform

Postgres

React

SQL

.NET

🔥 34 minutes ago

Mixpanel

201 - 500

☁️ SaaS

🏢 Enterprise

🤝 B2B

Software Engineer on DevInfra team at Mixpanel, focusing on automating tooling and enhancing the software development lifecycle. Collaborating across teams to improve engineering processes in a cloud environment.

AWS

Azure

Cloud

Google Cloud Platform

JavaScript

Kubernetes

Python

TypeScript

Go

🔥 37 minutes ago

Rula

501 - 1000

☁️ SaaS

👥 B2C

Senior Software Engineer at Rula working on access to mental health services. Collaborating with teams to enhance provider experience and build sustainable practices on the platform.

AWS

Cloud

Distributed Systems

Docker

Google Cloud Platform

JavaScript

Kubernetes

Microservices

Node.js

Postgres

React

TypeScript

🔥 1 hour ago

LaBella Associates

1001 - 5000

Lead Engineer managing electric utility projects for LaBella Associates in New York. Overseeing design and construction of electrical systems focusing on distribution networks.

🔥 1 hour ago

AIS (Applied Information Sciences)

501 - 1000

🤖 Artificial Intelligence

☁️ SaaS

Senior Software Architect leading the design and development of secure software solutions at AIS. Collaborating with teams to optimize resource usage and drive innovation in technology.

Airflow

Angular

Apache

AWS

Azure

Cloud

Distributed Systems

Docker

Google Cloud Platform

GraphQL

Microservices

React