
AI ⢠Enterprise ⢠SaaS
Nebius Group is building one of the worldâs leading AI infrastructure companies, focusing on providing the necessary compute, storage, and tools for developers in the AI space. Based in Europe and listed on Nasdaq, Nebius has a global presence with R&D centers across Europe, North America, and Israel. The company's primary offering is an AI-centric cloud platform designed for intensive AI workloads, complemented by various other businesses involved in generative AI development, edtech, and autonomous technology.
July 15

AI ⢠Enterprise ⢠SaaS
Nebius Group is building one of the worldâs leading AI infrastructure companies, focusing on providing the necessary compute, storage, and tools for developers in the AI space. Based in Europe and listed on Nasdaq, Nebius has a global presence with R&D centers across Europe, North America, and Israel. The company's primary offering is an AI-centric cloud platform designed for intensive AI workloads, complemented by various other businesses involved in generative AI development, edtech, and autonomous technology.
⢠Develop and optimize low-level kernels and runtime components for AI inference. ⢠Improve performance of inference engines GPU platforms. ⢠Profile and debug system-level and hardware-level performance issues. ⢠Integrate support for new hardware architectures (Hopper, Blackwell, Rubin). ⢠Collaborate with ML and backend teams to optimize end-to-end execution.
⢠Strong proficiency in C++, OR expertise in GPU programming with a focus on low-level high-performance coding and memory management. ⢠Experience in GPU programming or systems-level software development, e.g. operating system internals, kernel modules, or device drivers. ⢠Hands-on experience with profiling and debugging tools to identify performance issues on both CPUs and GPUs, and the ability to optimize code based on those findings. ⢠Solid understanding of CPU/GPU architecture and memory hierarchy. ⢠Experience with GPU computing programming: CUDA, ROCm, CUTLASS, Cute, ThunderKittens, Triton, Pallas, Mosaic GPU. ⢠Familiarity with ML inference runtimes (e.g. TensorRT, TVM). ⢠Knowledge of Linux internals, drivers, or compiler toolchains. ⢠Experience with tools like perf, VTune, Nsight, or ROCm profiler. ⢠Familiarity with popular inference engines (e.g. such as vLLM, sglang, TGI).
⢠Competitive salary and comprehensive benefits package. ⢠Opportunities for professional growth within Nebius. ⢠Hybrid working arrangements. ⢠A dynamic and collaborative work environment that values initiative and innovation.
Apply Now