Senior Software Engineer, AI Inference Systems

🕒 vor 1 Monat

🇵🇱 Polen – Remote

💵 zł292.500 - zł650.000 / Jahr

⏰ Vollzeit

🟠 Senior

🧑‍💻 Full-Stack-Entwickler

🗣️🇺🇸🇬🇧 Englisch erforderlich

Jetzt Bewerben
Ähnliche Remote-Jobs finden

📊 Überprüfen Sie Ihre Lebenslauf-Bewertung für diese Stelle

Verbessern Sie Ihre Chancen auf ein Vorstellungsgespräch, indem Sie Ihre Lebenslauf-Bewertung vor der Bewerbung überprüfen.

Logo of NVIDIA

NVIDIA

10.000+ Mitarbeiter

Gegründet 1993

🤖 Künstliche Intelligenz

🎮 Gaming

Artificial Intelligence • Gaming • Automotive

NVIDIA ist ein führendes Technologieunternehmen mit Spezialisierung auf beschleunigtes Computing und Künstliche Intelligenz (AI). NVIDIA treibt Fortschritte bei Grafikprozessoren (GPUs), Cloud Computing, Rechenzentren und Virtual Reality voran und fokussiert dabei Branchen wie Gaming, Automotive, Gesundheitswesen und Robotik. Innovationen des Unternehmens wie NVIDIA Omniverse transformieren traditionelle digitale Prozesse, indem sie hochrealistische Simulationen und Rendering-Aufgaben ermöglichen. Die Anwendungen erstrecken sich über zahlreiche Branchen – von autonomen Fahrzeugen mit NVIDIA DRIVE über Gesundheitslösungen mit NVIDIA Clara bis hin zu AI-gestützten Analysen und Workflows.

Beschreibung

• Contribute features to vLLM that empower the newest models with the latest NVIDIA GPU hardware features; profile and optimize the inference framework (vLLM) with methods like speculative decoding, data/tensor/expert/pipeline-parallelism, prefill-decode disaggregation. • Develop, optimize, and benchmark GPU kernels (hand-tuned and compiler-generated) using techniques such as fusion, autotuning, and memory/layout optimization; build and extend high-level DSLs and compiler infrastructure to boost kernel developer productivity while approaching peak hardware utilization. • Define and build inference benchmarking methodologies and tools; contribute both new benchmark and NVIDIA’s submissions to the industry-leading MLPerf Inference benchmarking suite. • Architect the scheduling and orchestration of containerized large-scale inference deployments on GPU clusters across clouds. • Conduct and publish original research that pushes the pareto frontier for the field of ML Systems; survey recent publications and find a way to integrate research ideas and prototypes into NVIDIA’s software products.

🎯 Anforderungen

• Bachelor’s degree (or equivalent experience) in Computer Science (CS), Computer Engineering (CE) or Software Engineering (SE) with 7+ years of experience; alternatively, Master’s degree in CS/CE/SE with 5+ years of experience; or PhD degree with the thesis and top-tier publications in ML Systems, GPU architecture, or high-performance computing. • Strong programming skills in Python and C/C++; experience with Go or Rust is a plus; solid CS fundamentals: algorithms & data structures, operating systems, computer architecture, parallel programming, distributed systems, deep learning theories. • Knowledgeable and passionate about performance engineering in ML frameworks (e.g., PyTorch) and inference engines (e.g., vLLM and SGLang). • Familiarity with GPU programming and performance: CUDA, memory hierarchy, streams, NCCL; proficiency with profiling/debug tools (e.g., Nsight Systems/Compute). • Experience with containers and orchestration (Docker, Kubernetes, Slurm); familiarity with Linux namespaces and cgroups. • Excellent debugging, problem-solving, and communication skills; ability to excel in a fast-paced, multi-functional setting.

Jetzt Bewerben

Ähnliche Jobs

🕒 vor 1 Monat

Kyriba

501 - 1000

💳 Fintech

🏢 Unternehmen

💸 Finanzen

Senior Software Engineer focused on developing business modules and optimizing code for global fintech company. Collaborating with teams in France and Poland, applying Agile methodologies.

🇵🇱 Polen – Remote

💵 zł233.500 - zł350.350 / Jahr

⏰ Vollzeit

🟠 Senior

🧑‍💻 Full-Stack-Entwickler

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

Akamai Technologies

5001 - 10000

🔒 Cybersecurity

Software Engineer responsible for developing reliable software solutions using Akamai products. Focused on building tools utilizing Terraform and CLI while collaborating with other teams and customers.

🇵🇱 Polen – Remote

💰 Post-IPO Equity im 2001-07

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

🧑‍💻 Full-Stack-Entwickler

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

Akamai Technologies

5001 - 10000

🔒 Cybersecurity

Software Engineer II developing scalable software solutions for network automation at Akamai. Collaborating with global teams and solving complex engineering challenges.

🇵🇱 Polen – Remote

💰 Post-IPO Equity im 2001-07

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

🧑‍💻 Full-Stack-Entwickler

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

Sigma Software Group

1001 - 5000

🎮 Gaming

📡 Telekommunikation

AI-augmented Software Developer designing and optimizing agentic AI systems for Sigma Software, working with advanced AI tools and collaborating with top-tier engineers.

🇵🇱 Polen – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

🧑‍💻 Full-Stack-Entwickler

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

Eskimi

201 - 500

📱 Medien

🤝 B2B

☁️ SaaS

Software Engineer focused on AI tools and real-time bidding platform at Eskimi. Part of a team tackling complex adtech challenges and delivering business value quickly.

🇵🇱 Polen – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

🧑‍💻 Full-Stack-Entwickler

🗣️🇺🇸🇬🇧 Englisch erforderlich