HPC Specialist, Solutions Architect

Emploi pas sur LinkedIn

🕒 il y a 4 mois

🇺🇸 États-Unis – Télétravail

💵 $225 000 - $315 000 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

💻 Ingénieur Solutions

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of Nebius Group

Nebius Group

1001 - 5000 employés

🏢 Entreprise

☁️ SaaS

AI • Enterprise • SaaS

Le groupe Nebius construit l’une des principales entreprises d’infrastructure AI au monde, en se concentrant sur la fourniture de la puissance de calcul, du stockage et des outils nécessaires aux développeurs dans le domaine de l’AI. Basée en Europe et cotée au Nasdaq, Nebius dispose d’une présence mondiale avec des centres de R&D en Europe, en Amérique du Nord et en Israël. L’offre principale de l’entreprise est une plateforme cloud centrée sur l’AI, conçue pour des workloads AI intensifs, complétée par diverses autres activités impliquées dans le développement de l’AI générative, l’edtech et les technologies autonomes.

Description

• Architect and implement scalable HPC clusters optimized for AI, simulation, and distributed training, leveraging container orchestration frameworks and schedulers (e.g., Kubernetes, Slurm). • Design and integrate GPU-accelerated compute infrastructures featuring NVIDIA Hopper, Blackwell architectures, NVLink/NVSwitch, and InfiniBand/RoCE Interconnects. • Deploy, and manage GPU Operator and Network Operator stacks for automated lifecycle management of GPU and high-speed networking components. • Design and validate cloud HPC environments, focusing on low-latency, high-bandwidth networking, multi-GPU scaling, and efficient workload scheduling. • Lead reference architectures for AI/ML model training, data pipelines, and MLOps integrations using modern observability and CI/CD tooling. • Collaborate with hardware vendors (e.g., NVIDIA) and cloud providers to evaluate and optimize emerging HPC and GPU technologies. • Benchmark system performance, identify bottlenecks, and tune resource utilization across compute, network, and storage tiers. • Provide expert-level technical guidance to customers, internal teams, and partners on HPC architecture patterns, operational excellence reviews and customer engagements

🎯 Exigences

• Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field (Ph.D. a plus) • 3+ years of hands-on experience architecting HPC or large-scale GPU clusters. • Expertise in Linux systems, Kubernetes, container runtimes (containers, CRI-O, Docker), and related CI/CD practices. • Strong understanding of HPC networking protocols and RDMA stacks (InfiniBand, NVLink/NVSwitch) • Deep understanding of storage and I/O optimization for large datasets (Ceph, Lustre, NFS, GPUDirect Storage) • Familiarity with Terraform, Ansible, Helm, and GitOps workflows. • Strong scripting skills in Python or Bash for automation and tool integration. • Excellent communication and documentation skills; ability to lead design reviews and customer engagements.

🏖️ Avantages

• Health Insurance: 100% company-paid medical, dental, and vision coverage for employees and families. • 401(k) Plan: Up to 4% company match with immediate vesting. • Parental Leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers. • Remote Work Reimbursement: Up to $85/month for mobile and internet. • Disability & Life Insurance: Company-paid short-term, long-term, and life insurance coverage.

Postuler Maintenant

Emplois Similaires

🕒 il y a 4 mois

NVIDIA

10 000+ employés

🤖 Intelligence artificielle

🎮 Jeux vidéo

Solutions Architect at NVIDIA supporting innovative companies in AdTech and Media. Collaborating with teams to optimize workflows and drive technology adoption with advanced computing.

🇺🇸 États-Unis – Télétravail

💵 $224 000 - $356 500 / an

⏰ Temps Plein

🟠 Senior

💻 Ingénieur Solutions

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 4 mois

Aquila

1001 - 5000

☁️ SaaS

🏢 Entreprise

Software Implementation Consultant for treasury management at SymPro, helping public sector clients manage financial workflows. Leading software implementations and providing training and ongoing support.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 4 mois

KnowBe4

1001 - 5000

🔒 Cybersecurity

☁️ SaaS

📚 Éducation

Solution Architect designing integrations and architecture for Salesforce and other SaaS platforms. Leading AI/ML strategy and collaborating with teams on technical designs.

🇺🇸 États-Unis – Télétravail

💵 $150 000 - $170 000 / an

⏰ Temps Plein

🟠 Senior

🔴 Expert

💻 Ingénieur Solutions

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 4 mois

Solutions Engineer providing technical solutions and support in the cybersecurity sector. Collaborating with sales to meet customer needs and enhance product implementation.

🇺🇸 États-Unis – Télétravail

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

💻 Ingénieur Solutions

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 4 mois

First Quality

1001 - 5000

⚕️ Assurance santé

🛒 Commerce de détail

⚡ Productivité

Solution Engineer designing, building, and deploying IT infrastructure for First Quality's data center services. Collaborating with team members and ensuring high-quality service delivery.

🗣️🇺🇸🇬🇧 Anglais requis

Cloud

DNS

ITSM

Linux

Terraform

VMware