DevOps – Infrastructure Engineer

Job not on LinkedIn

🕒 May 1

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Strive Gaming

Strive Gaming

51 - 200 employees

🎲 Gambling

🎮 Gaming

🤝 B2B

Gambling • Gaming • B2B

Strive Gaming is a leading platform provider in the North American iGaming market, offering a full-service omni-channel wagering solution specifically tailored for the region. They specialize in B2B products, including online and on-premise wagering platforms, real-time player management, and white-label and native apps. Their services include managed compliance, deep third-party integrations, and Class II mobile casino options. Strive Gaming addresses the challenges faced by multi-state and province operators with its modern, multi-tenant, and scalable architecture designed to improve operational efficiency and player engagement. They offer a wide range of casino games and have integrated with the top sports betting providers to ensure swift market entry and growth for their clients.

📋 Description

• Design, build, and maintain our observability platform—metrics, logs, traces, and everything in between • Get hands-on with infrastructure: deploy services, troubleshoot incidents, and fix things when they break (because they will) • Instrument applications and services to capture meaningful telemetry data that drives real insights • Build dashboards and alerting systems that teams actually use—not just noise generators • Dive into production issues, correlate data across systems, and lead root cause analysis • Champion observability best practices across engineering teams and help developers instrument their own code • Automate everything you can: infrastructure provisioning, deployment pipelines, and operational runbooks • Work closely with SRE and development teams to improve system reliability and performance • Evaluate and integrate new observability tools and technologies as the landscape evolves

🎯 Requirements

• 3+ years of experience in DevOps, Infrastructure, or SRE roles—with real production battle scars • Deep hands-on experience with observability tools: Prometheus, Grafana, Datadog, New Relic, Splunk, ELK stack, Jaeger, or similar • Strong proficiency with cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code (Terraform, Pulumi, CloudFormation) • Solid scripting and automation skills (Python, Bash, Go, or similar) • Experience with containerisation and orchestration (Docker, Kubernetes) • Understanding of distributed systems, microservices architectures, and the unique observability challenges they present • Familiarity with CI/CD pipelines and GitOps workflows • Excellent troubleshooting skills—you're the person who doesn't give up until you've found the root cause

🏖️ Benefits

• Competitive salary and equity package • Flexible working arrangements • Learning and development budget • Modern tech stack and the autonomy to make real impact • A team that values doing things properly over just doing things quickly

Apply Now

Similar Jobs

🕒 May 1

Ticketmaster

10,000+ employees

🛍️ eCommerce

⚽ Sports

Lead Site Reliability Engineer at Ticketmaster facilitating reliability improvements and mentoring teams across the globe. Manage consulting work while driving sustainable engineering practices.

AWS

Distributed Systems

Kubernetes

🕒 May 1

Live Nation Entertainment

10,000+ employees

📱 Media

Lead Site Reliability Engineer leading consulting work at Ticketmaster for reliability improvements across multiple teams. Aligning stakeholders and driving adoption of SRE principles.

AWS

Distributed Systems

Kubernetes

🕒 April 30

Civica US

51 - 200

🏛️ Government

☁️ SaaS

📚 Education

Senior Site Reliability Engineer ensuring the reliability, performance and security of Civica’s cloud platform. Collaborating with teams to drive automation and best practices in cloud environments.

Ansible

AWS

Azure

Cloud

Google Cloud Platform

Grafana

Java

Kubernetes

OpenShift

Packer

Prometheus

Python

Terraform

VMware

Go

.NET

🕒 April 25

Atos

10,000+ employees

🔒 Cybersecurity

DevOps Engineer supporting cloud transformation and application portfolios for clients. Collaborating with stakeholders and developers to improve technology and infrastructure in a remote-first environment.

AWS

Azure

Cloud

Cyber Security

Docker

Kubernetes

🕒 April 24

GitLab

1001 - 5000

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

Cloud Cost Utilization SRE responsible for making cloud spending actionable. Collaborating with Finance and Engineering at GitLab to optimize resource usage.

Ansible

AWS

Cloud

Google Cloud Platform

Grafana

Prometheus

Terraform