T3 Operations & Support Specialist – Compute & OS

Job not on LinkedIn

🔥 1 minute ago

🗣️🇩🇪 German Required

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Interval Group

Interval Group

51 - 200 employees

We are a boutique consulting and recruitment firm, and we specialise in providing expert resources to enable industry-leading organisations to achieve their goals.

📋 Description

• Providing T3 operational ownership for Compute & OS services: handling complex incidents, troubleshooting and RCA, and driving permanent fixes and preventive measures • Ensuring compute/OS readiness for releases and changes: monitoring/alerting coverage, performance baselines, hardening, patch strategy, rollback and recovery procedures, and runbooks • Executing and improving standard operational procedures through automation to reduce toil and improve MTTR and stability • Coordinating with Kubernetes, Data, Network and Storage SMEs to resolve cross-domain production issues • Validating deployment artefacts from an operations perspective and enforcing quality assurance measures • Monitoring system health, performance metrics and service availability across multi-tenant environments • Identifying, analysing and resolving incidents to minimise service disruption, and triggering RCA and corrective actions • Implementing monitoring and logging strategies to support audit and compliance requirements • Performing routine security scans and remediating identified vulnerabilities

🎯 Requirements

• 5 to 10+ years in IT operations, service delivery, or platform operations • Proven experience implementing and leading Incident, Problem, Change and Release governance in production • Hands-on experience with VMware 8 virtualisation • Operating Systems: Red Hat Enterprise Linux and Ubuntu • OS tooling: Satellite, IPA, Certificate Server • ITSM/collaboration tooling: Jira Service Management, Jira, Confluence • Fundamental understanding of core operations processes (Incident, Change, Problem management, ITSM) and SRE concepts • Experience gathering operational insights from monitoring/observability including SLI/SLA/SLO management and tracking • Hands-on experience documenting procedures and enforcing clear runbooks and playbooks • Hands-on experience with monitoring and logging tools (e.g. Prometheus, Grafana, Datadog, Mimir, Loki) • Understanding of modern platform operations (Kubernetes/containers, automation, observability) sufficient to govern specialists • Fluent English and German (C1 minimum in both)

🏖️ Benefits

• Flexible working hours • Freedom to choose projects • Access to exciting projects in various industries • Support in advancing your career • Competitive pay • Dedicated team for assistance

Apply Now

Similar Jobs

🕒 June 4

Cafeyn

201 - 500

Customer Care Specialist responsible for handling user requests via email, phone, and social media. Contributing to service improvement and optimizing customer relationship follow-up.

🗣️🇩🇪 German Required

🕒 May 7

proSenio

51 - 200

🤝 Non-profit

🌍 Social Impact

🎯 Recruiter

Freelancer in telephone customer service for senior care inquiries, engaging with potential clients and providing support via phone. Documenting conversations and maintaining communication within specified time slots.

🗣️🇩🇪 German Required

🕒 May 5

WeFi

11 - 50

₿ Crypto

🏦 Banking

Customer Support Specialist handling customer inquiries via chat and social media. Collaborating with technical support teams to resolve issues in a remote Berlin role.

🗣️🇩🇪 German Required

🕒 March 31

Upway

51 - 200

🛍️ eCommerce

🛒 Retail

🧘 Wellness

Customer Care Specialist focusing on delivering unforgettable service during the bike purchase process. Supporting customers in selecting the right E-bike and ensuring satisfaction in after-sales service.

🗣️🇩🇪 German Required