Technical Service Operations Lead

🕒 vor 2 Monaten

🇨🇦 Kanada – Remote

💵 $120.000 - $150.000 / Jahr

⏰ Vollzeit

🟠 Senior

⚙️ Operations

🗣️🇺🇸🇬🇧 Englisch erforderlich

Jetzt Bewerben
Ähnliche Remote-Jobs finden

📊 Überprüfen Sie Ihre Lebenslauf-Bewertung für diese Stelle

Verbessern Sie Ihre Chancen auf ein Vorstellungsgespräch, indem Sie Ihre Lebenslauf-Bewertung vor der Bewerbung überprüfen.

Logo of Xsolla

Xsolla

201 - 500 Mitarbeiter

Gegründet 2005

🎮 Gaming

🛍️ eCommerce

☁️ SaaS

Gaming • eCommerce • SaaS

Xsolla ist auf Lösungen für Game-Commerce spezialisiert. Das Unternehmen stellt Tools und Services bereit, mit denen Entwickler von Videospielen ihre Spiele monetarisieren, vertreiben und finanzieren können – und so in jeder Entwicklungsphase ihren Umsatz steigern. Xsolla bietet ein breites Spektrum an Lösungen, darunter anpassbare Webshops, Zahlungsabwicklung, Betrugsprävention (Fraud-Management) sowie Social-Pakete zur Steigerung der Spielerbindung. Mit über 19 Jahren Erfahrung in der Videospielbranche arbeitet Xsolla weltweit mit Entwicklern und Publishern zusammen, um Compliance, Steuern und operative Komplexität zu managen – damit sie sich auf die Verbindung zu ihrer Spielerschaft konzentrieren können. Die Services sind für gängige Entwicklungsplattformen wie Unity, Unreal und Cocos Creator ausgelegt und unterstützen weltweit eine große Bandbreite lokaler Zahlungsmethoden.

Beschreibung

• Serve as Incident Commander for major incidents — coordinating cross-functional response teams, driving investigation, making escalation decisions, and ensuring incidents are resolved within SLA targets. • Own all incident communications: draft and send clear, timely updates to senior leadership, Customer Success, and partner/customer contacts throughout the incident lifecycle, and manage customer-facing status page updates (status.xsolla.com). • Facilitate blameless Post-Incident Reviews (PIRs) for major incidents — leading root cause identification, assigning corrective actions with clear owners and deadlines, and tracking them to closure. • During non-incident periods, proactively analyze incident trends, recurring issues, and production bugs — identify patterns, create Problem tickets, and report findings and recommendations to product and engineering teams on a regular cadence. • Enforce the incident management framework across the organization, including the severity model, priority matrix, SLA targets, escalation procedures, and deployment readiness gates. • Oversee and mentor the Operations Engineer on your shift — coaching on triage, investigation, runbook execution, and documentation quality while conducting regular knowledge transfer sessions to build depth across the service portfolio. • Produce shift handoff reports and deliver regular operational reporting: incident trends, KPI performance (MTTD, MTTA, MTTR), SLA adherence, proactive detection rate, and repeat incident analysis. • Audit service catalogue completeness on a regular cadence and govern JIRA Service Management workflows for incident, PIR, and problem management. • Cover for the Operations Engineer role during absences, breaks, or surge incidents. Participate in weekend on-call rotation for major incidents.

🎯 Anforderungen

• 6+ years of experience in incident management, SRE, NOC leadership, or technical operations in a production environment supporting high-availability, high-transaction systems (payments, e-commerce, SaaS, or gaming platforms preferred). • Proven incident management experience — coordinating multi-team response, making real-time escalation decisions, and communicating with executive stakeholders under pressure. • Excellent written and verbal communication skills in English — ability to draft clear, concise executive updates at 3 AM under pressure, facilitate blameless PIRs, present operational metrics to senior leadership, and communicate incident status to customers and partners with clarity and professionalism. • Strong ITIL foundation — understanding of incident, problem, and change management lifecycles with practical experience implementing or operating ITIL-aligned workflows. • Technical depth across the observability stack — ability to read and interpret logs, traces, and metrics in Datadog (or equivalent: Grafana, Splunk, New Relic). Understanding of APM, SLOs, error budgets, burn-rate alerting, and synthetic monitoring. • Hands-on experience with incident tooling: Datadog, PagerDuty or OpsGenie, JIRA or JIRA Service Management, Slack, and Confluence. • Analytical mindset — ability to identify trends, patterns, and recurring issues from incident data and translate them into actionable recommendations for product and engineering teams. • Experience with SLA/SLO-driven operations where MTTD, MTTA, and MTTR are measured, reported, and improved. • Experience with or strong interest in AI/ML-assisted operations: anomaly detection, alert correlation, predictive alerting, automated remediation, or self-healing automation. • Comfort with 24x7 shift-based operations as part of a follow-the-sun model with handoff overlaps. Weekend on-call (rotating) for critical severities is required.

🏖️ Vorteile

• We are passionate about fostering a supportive environment for our team, so we prioritize the physical, mental, and emotional well-being of our employees and their families through a comprehensive Benefits Program. • This includes medical, dental, and vision, PTO, and a personalized career roadmap for each employee. • By investing in professional development through training and educational opportunities, we ensure that our team thrives both personally and professionally. • Together, we're not just building a business; we're cultivating a community that values creativity, collaboration, and the transformative power of play.

Jetzt Bewerben

Ähnliche Jobs

🕒 vor 3 Monaten

Juniper Creates

51 - 200

🛍️ eCommerce

🛒 Einzelhandel

☁️ SaaS

Senior Manager, Creative Operations managing Creative Proposals for Juniper's crowdfunding platform. Collaborating with cross-functional teams to drive product launches and ensure quality deliverables.

🇨🇦 Kanada – Remote

💵 $100.000 - $130.000 / Jahr

💰 Pre Seed Round im 2015-12

⏰ Vollzeit

🟠 Senior

⚙️ Operations

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 3 Monaten

Veeva Systems

1001 - 5000

☁️ SaaS

⚕️ Krankenversicherung

💊 Pharmazie

Data Operations Manager at Veeva overseeing HCP Access data product lifecycle and ensuring quality customer deliverables. Involves cross-team collaboration and technical problem-solving for data reliability.

🇨🇦 Kanada – Remote

💵 $90.000 - $130.000 / Jahr

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⚙️ Operations

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 3 Monaten

Fanatics, Inc.

1001 - 5000

🎮 Gaming

🛒 Einzelhandel

🛍️ eCommerce

Marketing Manager leading digital operations for Fanatics Betting & Gaming. Focusing on martech integration and campaign tracking across multiple platforms.

🇨🇦 Kanada – Remote

💵 $84.000 - $115.000 / Jahr

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⚙️ Operations

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 3 Monaten

ENERCON

10.000+ Mitarbeiter

⚡ Energie

🔧 Hardware

Coordination of high voltage electrical network operations at ENERCON. Providing technical support for maintenance technicians while improving safety and efficiency remotely.

🇨🇦 Kanada – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⚙️ Operations

🗣️🇫🇷 Französisch erforderlich

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 6 Monaten

Procurify

51 - 200

☁️ SaaS

💸 Finanzen

🤝 B2B

Lead product strategy and operations alignment in a remote fintech SaaS company. Influence cross-functional execution and improve organizational efficiency.

🇨🇦 Kanada – Remote

💵 $128.000 - $160.000 / Jahr

💰 Series B im 2019-06

⏰ Vollzeit

🟠 Senior

⚙️ Operations

🗣️🇺🇸🇬🇧 Englisch erforderlich