HPC Specialist, Solutions Architect

Vaga não está no LinkedIn

🕒 Fevereiro 5

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $225.000 - $315.000 / ano

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

💻 Engenheiro de Soluções

🗣️🇺🇸🇬🇧 Inglês obrigatório

Candidatar-se
Encontrar Vagas Remotas Similares

📊 Verifique sua pontuação de currículo para esta vaga

Melhore suas chances de conseguir uma entrevista verificando sua pontuação de currículo antes de se candidatar.

Logo of Nebius Group

Nebius Group

1001 - 5000 funcionários

🏢 Corporativo

☁️ SaaS

AI • Enterprise • SaaS

O Nebius Group está construindo uma das principais empresas de infraestrutura de IA do mundo, focando em fornecer os recursos de computação, armazenamento e ferramentas necessários para desenvolvedores no espaço de IA. Com sede na Europa e listado na Nasdaq, o Nebius tem uma presença global com centros de P&D em toda a Europa, América do Norte e Israel. A principal oferta da empresa é uma plataforma de nuvem centrada em IA projetada para cargas de trabalho intensivas de IA, complementada por vários outros negócios envolvidos em desenvolvimento de IA generativa, edtech e tecnologia autônoma.

Descrição

• Architect and implement scalable HPC clusters optimized for AI, simulation, and distributed training, leveraging container orchestration frameworks and schedulers (e.g., Kubernetes, Slurm). • Design and integrate GPU-accelerated compute infrastructures featuring NVIDIA Hopper, Blackwell architectures, NVLink/NVSwitch, and InfiniBand/RoCE Interconnects. • Deploy, and manage GPU Operator and Network Operator stacks for automated lifecycle management of GPU and high-speed networking components. • Design and validate cloud HPC environments, focusing on low-latency, high-bandwidth networking, multi-GPU scaling, and efficient workload scheduling. • Lead reference architectures for AI/ML model training, data pipelines, and MLOps integrations using modern observability and CI/CD tooling. • Collaborate with hardware vendors (e.g., NVIDIA) and cloud providers to evaluate and optimize emerging HPC and GPU technologies. • Benchmark system performance, identify bottlenecks, and tune resource utilization across compute, network, and storage tiers. • Provide expert-level technical guidance to customers, internal teams, and partners on HPC architecture patterns, operational excellence reviews and customer engagements

🎯 Requisitos

• Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field (Ph.D. a plus) • 3+ years of hands-on experience architecting HPC or large-scale GPU clusters. • Expertise in Linux systems, Kubernetes, container runtimes (containers, CRI-O, Docker), and related CI/CD practices. • Strong understanding of HPC networking protocols and RDMA stacks (InfiniBand, NVLink/NVSwitch) • Deep understanding of storage and I/O optimization for large datasets (Ceph, Lustre, NFS, GPUDirect Storage) • Familiarity with Terraform, Ansible, Helm, and GitOps workflows. • Strong scripting skills in Python or Bash for automation and tool integration. • Excellent communication and documentation skills; ability to lead design reviews and customer engagements.

🏖️ Benefícios

• Health Insurance: 100% company-paid medical, dental, and vision coverage for employees and families. • 401(k) Plan: Up to 4% company match with immediate vesting. • Parental Leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers. • Remote Work Reimbursement: Up to $85/month for mobile and internet. • Disability & Life Insurance: Company-paid short-term, long-term, and life insurance coverage.

Candidatar-se

Vagas Similares

🕒 Fevereiro 4

NVIDIA

10.000+ funcionários

🤖 Inteligência Artificial

🎮 Jogos

Solutions Architect at NVIDIA supporting innovative companies in AdTech and Media. Collaborating with teams to optimize workflows and drive technology adoption with advanced computing.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $224.000 - $356.500 / ano

⏰ Tempo Integral

🟠 Sênior

💻 Engenheiro de Soluções

🦅 Patrocina Visto H1B

info

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Fevereiro 4

Aquila

1001 - 5000

☁️ SaaS

🏢 Corporativo

Software Implementation Consultant for treasury management at SymPro, helping public sector clients manage financial workflows. Leading software implementations and providing training and ongoing support.

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Fevereiro 4

KnowBe4

1001 - 5000

🔒 Cibersegurança

☁️ SaaS

📚 Educação

Solution Architect designing integrations and architecture for Salesforce and other SaaS platforms. Leading AI/ML strategy and collaborating with teams on technical designs.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $150.000 - $170.000 / ano

⏰ Tempo Integral

🟠 Sênior

🔴 Especialista

💻 Engenheiro de Soluções

🦅 Patrocina Visto H1B

info

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Fevereiro 3

Solutions Engineer providing technical solutions and support in the cybersecurity sector. Collaborating with sales to meet customer needs and enhance product implementation.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

💻 Engenheiro de Soluções

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Fevereiro 3

First Quality

1001 - 5000

⚕️ Seguro de Saúde

🛒 Varejo

⚡ Produtividade

Solution Engineer designing, building, and deploying IT infrastructure for First Quality's data center services. Collaborating with team members and ensuring high-quality service delivery.

🗣️🇺🇸🇬🇧 Inglês obrigatório

Cloud

DNS

ITSM

Linux

Terraform

VMware