AI Infrastructure, Platform Operations Engineer

Ähnliche Remote-Jobs finden

501 - 1000 Mitarbeiter

🏢 Unternehmen

☁️ SaaS

Cloud Computing • Enterprise • SaaS

Mirantis ist ein Unternehmen, das sich auf Container-Management und Cloud-Infrastrukturlösungen spezialisiert hat. Das Portfolio umfasst unter anderem Mirantis Kubernetes Engine (MKE), Mirantis OpenStack for Kubernetes (MOSK) und Mirantis Container Cloud (MCC) – Plattformen für Kubernetes und Container-Management auf Enterprise-Niveau. Darüber hinaus entwickelt Mirantis Werkzeuge für sichere Software-Lieferketten, etwa die Mirantis Container Runtime (MCR) und die Mirantis Secure Registry (MSR). Als Verfechter von Open-Source-Technologien unterstützt Mirantis verschiedene Projekte und stellt Ressourcen wie Lens Desktop, eine beliebte Kubernetes-IDE, sowie technischen Support für Unternehmen bereit, die Cloud-native Technologien einführen. Die Lösungen von Mirantis richten sich an Bereiche wie den öffentlichen Sektor, Finanzdienstleistungen sowie SaaS- und Technologiedienstleistungen.

AI Infrastructure, Platform Operations Engineer

🔥 vor 18 Minuten

🇪🇺 Europa – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

👷 IT-Infrastrukturingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

Cloud

Distributed Systems

Grafana

Kubernetes

Linux

Prometheus

Jetzt Bewerben

📊 Überprüfen Sie Ihre Lebenslauf-Bewertung für diese Stelle

Verbessern Sie Ihre Chancen auf ein Vorstellungsgespräch, indem Sie Ihre Lebenslauf-Bewertung vor der Bewerbung überprüfen.

Mirantis

501 - 1000 Mitarbeiter

🏢 Unternehmen

☁️ SaaS

Cloud Computing • Enterprise • SaaS

Beschreibung

• Monitor, operate, and support production AI infrastructure platforms. • Investigate and resolve infrastructure, networking, hardware, and platform-related incidents. • Support NVIDIA GPU infrastructure and associated platform services. • Monitor and troubleshoot Kubernetes-based environments. • Investigate performance, availability, and reliability issues across infrastructure and platform components. • Collaborate with engineering teams, hardware vendors, datacenter personnel, and service delivery teams to resolve technical issues. • Participate in incident response, root cause analysis, and operational improvement activities. • Contribute to improvements in monitoring, observability, automation, and operational processes. • Maintain operational documentation, runbooks, and knowledge articles.

🎯 Anforderungen

• 3+ years of experience in infrastructure operations, platform operations, network operations, site reliability engineering, cloud operations, datacenter operations, or related technical roles. • Strong Linux administration and troubleshooting skills. • Good understanding of networking concepts and experience diagnosing infrastructure-related issues. • Working knowledge of Kubernetes in production environments. • Experience supporting production infrastructure and services. • Strong analytical and problem-solving skills. • Experience working within structured operational and incident management processes. • Excellent communication and collaboration skills. • Ability to work within a shift-based operational environment. • Experience in one or more of the following areas is highly desirable: NVIDIA GPU infrastructure and accelerated computing platforms. • InfiniBand networking and NVIDIA UFM. • Kubernetes platform operations. • AI infrastructure or HPC environments. • Site Reliability Engineering (SRE) or Platform Engineering. • Observability platforms such as Grafana, Prometheus, ELK, or OpenTelemetry. • Infrastructure automation technologies and Infrastructure-as-Code practices. • Large-scale distributed systems and production platforms.

🏖️ Vorteile

• Work with some of the most advanced AI infrastructure environments in production today. • Gain exposure to NVIDIA GPU technologies, Kubernetes platforms, and high-performance networking environments. • Help define how next-generation AI infrastructure is operated and supported. • Be part of a team shaping the future of AI-powered operations through k0rdent AI. • Join a growing organisation investing heavily in AI infrastructure and platform services.

Jetzt Bewerben

Ähnliche Jobs

Senior Infrastruktur-Engineer

🔥 vor 13 Stunden

Somnia

11 - 50

🥽 AR/VR

🧘 Wellness

🤖 Künstliche Intelligenz

Senior Infrastructure Engineer, der Somnias zentrale Backend-Services entwickelt und betreibt. Verantwortlich für SLIs, SLOs, Monitoring und Incident-Response.

🇪🇺 Europa – Remote

⏰ Vollzeit

🟠 Senior

👷 IT-Infrastrukturingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

Cloud

Distributed Systems

Docker

Grafana

Kubernetes

Linux

Node.js

Prometheus

Terraform

TypeScript

Web3

Senior Unity VR Engineer, Client-Plattform-Infrastruktur

🕒 vor 7 Tagen

NIR-YU

201 - 500

🎯 Rekrutierung

👥 HR Tech

🏢 Unternehmen

Senior Unity-Ingenieur, der die clientseitige Infrastruktur für eine VR-Trainingsplattform entwickelt. Schwerpunkt auf Architektur und Optimierung in einer flexiblen, vollständig remote ausgelegten Arbeitsumgebung.

🇪🇺 Europa – Remote

⏰ Vollzeit

🟠 Senior

👷 IT-Infrastrukturingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

Unity

Data-Warehouse- und Infrastruktur-Ingenieur

🕒 vor 1 Monat

Thrill

11 - 50

🎮 Gaming

🥽 AR/VR

Data-Warehouse- und Infrastruktur-Ingenieur, der ClickHouse-Abfragen optimiert und die Dateninfrastruktur bei Thrill Labs verwaltet. Verantwortlich für die Pflege von Datenmodellen und Dashboards sowie die Sicherstellung von Datenqualität und Performance.

🇪🇺 Europa – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

👷 IT-Infrastrukturingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

Ansible

Docker

Kafka

Kubernetes

Linux

Shell Scripting

SQL

Terraform

Zookeeper

Initiativbewerbung – Infrastructure Engineer

🕒 vor 3 Monaten

Amplemarket

51 - 200

🤖 Künstliche Intelligenz

🤝 B2B

☁️ SaaS

Infrastructure Engineer bei Amplemarket, das KI für B2B-Vertriebslösungen einsetzt. Aufbau skalierbarer Systeme für Zuverlässigkeit und Förderung bereichsübergreifender Zusammenarbeit.

🇪🇺 Europa – Remote

💰 €12.000.000 Series A im 2022-04

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

👷 IT-Infrastrukturingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

Cloud