Senior Deep Learning Software Engineer, Inference

September 17

Apply Now
Logo of NVIDIA

NVIDIA

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

📋 Description

• Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI • Scale performance of DL models across different architectures and types of NVIDIA accelerators • Contribute features and code to NVIDIA’s inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions • Work with cross-collaborative teams across frameworks, NVIDIA libraries and inference optimization innovative solutions • Implement and optimize model serving pipelines using open-source tools and plugins including CUTLASS, OAI Triton, NCCL, and CUDA kernels

🎯 Requirements

• Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI) • 5+ years of relevant software development experience • Excellent C/C++ programming and software design skills • SW Agile skills are helpful and Python experience is a plus • Prior experience with training, deploying or optimizing the inference of DL models in production is a plus • Prior background with performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus • Experience with Multi-GPU Communications (NCCL, NVSHMEM) is a plus • Experience building and shipping products to enterprise customers is a plus • GPU programming experience (CUDA, OAI TRITON or CUTLASS)

🏖️ Benefits

• We offer highly competitive salaries, an extensive benefits package, and a work environment that promotes diversity, inclusion, and flexibility.

Apply Now

Similar Jobs

July 12

Join ClickHouse to build interfaces and dashboards for our cloud platform, ensuring reliability and security.

AWS

Azure

Cloud

Distributed Systems

Google Cloud Platform

JavaScript

Node.js

React

SQL

TypeScript

July 4

As a Fullstack Software Engineer, you'll design and develop AI-driven applications at Dataiku.

Angular

Cloud

Flask

JavaScript

Open Source

Python

React

Vue.js

June 27

As a Tech Lead, determine technical direction and solve complex IT challenges for KPN.

🗣️🇳🇱 Dutch Required

Azure

Informatica

SQL

Vault

.NET

May 1

As Tech Lead, drive the technical direction in a DevOps team at KPN, ensuring robust solutions.

🗣️🇳🇱 Dutch Required

Azure

Informatica

SQL

Vault

.NET

February 11

Join Resato Hydrogen Technology as a Software Engineer in our IoT team, developing critical systems.

🗣️🇳🇱 Dutch Required

Docker

Informatica

IoT

Go

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com