
Artificial Intelligence • Gaming • Automotive
NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.
September 17

Artificial Intelligence • Gaming • Automotive
NVIDIA is a leading technology company specializing in accelerated computing and artificial intelligence. NVIDIA pioneers advancements in graphical processing units (GPUs), cloud computing, data centers, and virtual reality, with a focus on gaming, automotive, healthcare, and robotics industries. The company's innovations, such as NVIDIA Omniverse, transform traditional digital processes by enabling high-fidelity simulations and rendering tasks. Their applications span various industries, from autonomous vehicles using NVIDIA DRIVE to healthcare solutions with NVIDIA Clara, and AI-driven analytics and workflows.
• Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI • Scale performance of DL models across different architectures and types of NVIDIA accelerators • Contribute features and code to NVIDIA’s inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions • Work with cross-collaborative teams across frameworks, NVIDIA libraries and inference optimization innovative solutions • Implement and optimize model serving pipelines using open-source tools and plugins including CUTLASS, OAI Triton, NCCL, and CUDA kernels
• Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI) • 5+ years of relevant software development experience • Excellent C/C++ programming and software design skills • SW Agile skills are helpful and Python experience is a plus • Prior experience with training, deploying or optimizing the inference of DL models in production is a plus • Prior background with performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus • Experience with Multi-GPU Communications (NCCL, NVSHMEM) is a plus • Experience building and shipping products to enterprise customers is a plus • GPU programming experience (CUDA, OAI TRITON or CUTLASS)
• We offer highly competitive salaries, an extensive benefits package, and a work environment that promotes diversity, inclusion, and flexibility.
Apply NowJuly 12
Join ClickHouse to build interfaces and dashboards for our cloud platform, ensuring reliability and security.
AWS
Azure
Cloud
Distributed Systems
Google Cloud Platform
JavaScript
Node.js
React
SQL
TypeScript
July 4
As a Fullstack Software Engineer, you'll design and develop AI-driven applications at Dataiku.
Angular
Cloud
Flask
JavaScript
Open Source
Python
React
Vue.js
June 27
As a Tech Lead, determine technical direction and solve complex IT challenges for KPN.
🗣️🇳🇱 Dutch Required
Azure
Informatica
SQL
Vault
.NET
May 1
As Tech Lead, drive the technical direction in a DevOps team at KPN, ensuring robust solutions.
🗣️🇳🇱 Dutch Required
Azure
Informatica
SQL
Vault
.NET
February 11
Join Resato Hydrogen Technology as a Software Engineer in our IoT team, developing critical systems.
🗣️🇳🇱 Dutch Required
Docker
Informatica
IoT
Go