Senior Deep Learning Software Engineer, Inference

September 17

Apply Now
Logo of NVIDIA

NVIDIA

Artificial Intelligence • Gaming • Automotive

NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.

10,000+ employees

Founded 1993

🤖 Artificial Intelligence

🎮 Gaming

📋 Description

• Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI • Scale performance of DL models across different architectures and types of NVIDIA accelerators • Contribute features and code to NVIDIA’s inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions • Work with cross-collaborative teams across frameworks, NVIDIA libraries and inference optimization innovative solutions • Implement and optimize model serving pipelines using open-source tools and plugins including CUTLASS, OAI Triton, NCCL, and CUDA kernels

🎯 Requirements

• Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI) • 5+ years of relevant software development experience • Excellent C/C++ programming and software design skills • SW Agile skills are helpful and Python experience is a plus • Prior experience with training, deploying or optimizing the inference of DL models in production is a plus • Prior background with performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus • Experience with Multi-GPU Communications (NCCL, NVSHMEM) is a plus • Experience building and shipping products to enterprise customers is a plus • GPU programming experience (CUDA, OAI TRITON or CUTLASS)

🏖️ Benefits

• We offer highly competitive salaries, an extensive benefits package, and a work environment that promotes diversity, inclusion, and flexibility.

Apply Now

Similar Jobs

July 12

ClickHouse

51 - 200

☁️ SaaS

🏢 Enterprise

🤖 Artificial Intelligence

Join ClickHouse to build interfaces and dashboards for our cloud platform, ensuring reliability and security.

🇳🇱 Netherlands – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

July 4

DataIQ

11 - 50

🤖 Artificial Intelligence

☁️ SaaS

As a Fullstack Software Engineer, you'll design and develop AI-driven applications at Dataiku.

🇳🇱 Netherlands – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

June 27

KPN

10,000+ employees

📡 Telecommunications

🛍️ eCommerce

🔒 Cybersecurity

As a Tech Lead, determine technical direction and solve complex IT challenges for KPN.

🇳🇱 Netherlands – Remote

💵 €5.7k - €8.9k / month

⏰ Full Time

🟠 Senior

🧑‍💻 Full-stack Engineer

🗣️🇳🇱 Dutch Required

May 1

KPN

10,000+ employees

📡 Telecommunications

🛍️ eCommerce

🔒 Cybersecurity

As Tech Lead, drive the technical direction in a DevOps team at KPN, ensuring robust solutions.

🇳🇱 Netherlands – Remote

💵 €5.7k - €8.9k / month

⏰ Full Time

🟠 Senior

🧑‍💻 Full-stack Engineer

🗣️🇳🇱 Dutch Required

February 11

Resato Hydrogen Technology

51 - 200

⚡ Energy

🚗 Transport

☁️ SaaS

Join Resato Hydrogen Technology as a Software Engineer in our IoT team, developing critical systems.

🇳🇱 Netherlands – Remote

💵 €3.6k - €5.6k / month

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

🗣️🇳🇱 Dutch Required

Docker

Informatica

IoT

Go

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com