AI Infrastructure &amp; Platform Operations Engineer

Ähnliche Remote-Jobs finden

501 - 1000 Mitarbeiter

🏢 Unternehmen

☁️ SaaS

Cloud Computing • Enterprise • SaaS

Mirantis ist ein Unternehmen, das sich auf Container-Management und Cloud-Infrastrukturlösungen spezialisiert hat. Das Portfolio umfasst unter anderem Mirantis Kubernetes Engine (MKE), Mirantis OpenStack for Kubernetes (MOSK) und Mirantis Container Cloud (MCC) – Plattformen für Kubernetes und Container-Management auf Enterprise-Niveau. Darüber hinaus entwickelt Mirantis Werkzeuge für sichere Software-Lieferketten, etwa die Mirantis Container Runtime (MCR) und die Mirantis Secure Registry (MSR). Als Verfechter von Open-Source-Technologien unterstützt Mirantis verschiedene Projekte und stellt Ressourcen wie Lens Desktop, eine beliebte Kubernetes-IDE, sowie technischen Support für Unternehmen bereit, die Cloud-native Technologien einführen. Die Lösungen von Mirantis richten sich an Bereiche wie den öffentlichen Sektor, Finanzdienstleistungen sowie SaaS- und Technologiedienstleistungen.

AI Infrastructure & Platform Operations Engineer

🔥 vor 18 Minuten

🇪🇺 Europa – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

👷 IT-Infrastrukturingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

Cloud

Distributed Systems

Grafana

Kubernetes

Linux

Prometheus

Jetzt Bewerben

📊 Überprüfen Sie Ihre Lebenslauf-Bewertung für diese Stelle

Verbessern Sie Ihre Chancen auf ein Vorstellungsgespräch, indem Sie Ihre Lebenslauf-Bewertung vor der Bewerbung überprüfen.

Mirantis

501 - 1000 Mitarbeiter

🏢 Unternehmen

☁️ SaaS

Cloud Computing • Enterprise • SaaS

Beschreibung

• Monitor, operate, and support production AI infrastructure platforms. • Investigate and resolve infrastructure, networking, hardware, and platform-related incidents. • Support NVIDIA GPU infrastructure and associated platform services. • Monitor and troubleshoot Kubernetes-based environments. • Investigate performance, availability, and reliability issues across infrastructure and platform components. • Collaborate with engineering teams, hardware vendors, datacenter personnel, and service delivery teams to resolve technical issues. • Participate in incident response, root cause analysis, and operational improvement activities. • Contribute to improvements in monitoring, observability, automation, and operational processes. • Maintain operational documentation, runbooks, and knowledge articles.

🎯 Anforderungen

• 3+ years of experience in infrastructure operations, platform operations, network operations, site reliability engineering, cloud operations, datacenter operations, or related technical roles. • Strong Linux administration and troubleshooting skills. • Good understanding of networking concepts and experience diagnosing infrastructure-related issues. • Working knowledge of Kubernetes in production environments. • Experience supporting production infrastructure and services. • Strong analytical and problem-solving skills. • Experience working within structured operational and incident management processes. • Excellent communication and collaboration skills. • Ability to work within a shift-based operational environment. • Experience in one or more of the following areas is highly desirable: NVIDIA GPU infrastructure and accelerated computing platforms. InfiniBand networking and NVIDIA UFM. Kubernetes platform operations. AI infrastructure or HPC environments. Site Reliability Engineering (SRE) or Platform Engineering. Observability platforms such as Grafana, Prometheus, ELK, or OpenTelemetry. Infrastructure automation technologies and Infrastructure-as-Code practices. Large-scale distributed systems and production platforms.

🏖️ Vorteile

• Work with some of the most advanced AI infrastructure environments in production today. • Gain exposure to NVIDIA GPU technologies, Kubernetes platforms, and high-performance networking environments. • Help define how next-generation AI infrastructure is operated and supported. • Be part of a team shaping the future of AI-powered operations through k0rdent AI. • Join a growing organisation investing heavily in AI infrastructure and platform services.

Jetzt Bewerben

Ähnliche Jobs

Senior Infrastruktur-Engineer

🔥 vor 13 Stunden

Somnia

11 - 50

🥽 AR/VR

🧘 Wellness

🤖 Künstliche Intelligenz

Senior Infrastructure Engineer, der Somnias zentrale Backend-Services entwickelt und betreibt. Verantwortlich für SLIs, SLOs, Monitoring und Incident-Response.

🇪🇺 Europa – Remote

⏰ Vollzeit

🟠 Senior

👷 IT-Infrastrukturingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

Cloud

Distributed Systems

Docker

Grafana

Kubernetes

Linux

Node.js

Prometheus

Terraform

TypeScript

Web3

Senior Unity VR Engineer, Client-Plattform-Infrastruktur

🕒 vor 7 Tagen

NIR-YU

201 - 500

🎯 Rekrutierung

👥 HR Tech

🏢 Unternehmen

Senior Unity-Ingenieur, der die clientseitige Infrastruktur für eine VR-Trainingsplattform entwickelt. Schwerpunkt auf Architektur und Optimierung in einer flexiblen, vollständig remote ausgelegten Arbeitsumgebung.

🇪🇺 Europa – Remote

⏰ Vollzeit

🟠 Senior

👷 IT-Infrastrukturingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

Unity

Data-Warehouse- und Infrastruktur-Ingenieur

🕒 vor 1 Monat

Thrill

11 - 50

🎮 Gaming

🥽 AR/VR

Data-Warehouse- und Infrastruktur-Ingenieur, der ClickHouse-Abfragen optimiert und die Dateninfrastruktur bei Thrill Labs verwaltet. Verantwortlich für die Pflege von Datenmodellen und Dashboards sowie die Sicherstellung von Datenqualität und Performance.

🇪🇺 Europa – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

👷 IT-Infrastrukturingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

Ansible

Docker

Kafka

Kubernetes

Linux

Shell Scripting

SQL

Terraform

Zookeeper

Initiativbewerbung – Infrastructure Engineer

🕒 vor 3 Monaten

Amplemarket

51 - 200

🤖 Künstliche Intelligenz

🤝 B2B

☁️ SaaS

Infrastructure Engineer bei Amplemarket, das KI für B2B-Vertriebslösungen einsetzt. Aufbau skalierbarer Systeme für Zuverlässigkeit und Förderung bereichsübergreifender Zusammenarbeit.

🇪🇺 Europa – Remote

💰 €12.000.000 Series A im 2022-04

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

👷 IT-Infrastrukturingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

Cloud