Staff DevOps Engineer

Gaming • Sports • B2C

Fliff Inc. is a social sportsbook platform that transforms sports predictions into a play-for-fun social game. The company provides users with free daily credits to make sports picks and earn rewards without requiring any purchase. Users can win Fliff Coins to progress in leaderboards and earn badges, and Fliff Cash in promotional games, which can be redeemed for prizes. Fliff emphasizes social interaction, offering challenges, loyalty rewards, and referral bonuses while remaining compliant with gaming regulations across the United States, excluding Washington.

11 - 50 employees

🎮 Gaming

⚽ Sports

👥 B2C

💰 Venture Round on 2022-08

Staff DevOps Engineer

November 25

🔔 Pennsylvania – Remote

💵 $180k - $220k / year

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Cassandra

Cloud

Distributed Systems

DynamoDB

Grafana

Kafka

Kubernetes

Microservices

Postgres

Prometheus

RabbitMQ

Redis

Terraform

Apply Now

Fliff Inc

Gaming • Sports • B2C

11 - 50 employees

🎮 Gaming

⚽ Sports

👥 B2C

💰 Venture Round on 2022-08

📋 Description

• Architect, build, and operate highly available, high-performance infrastructure that supports large volumes of real-time traffic, especially during peak sports windows. • Lead the design and development of internal platforms, tooling, and automated workflows that accelerate engineering productivity. • Own and improve observability (metrics, logs, traces), monitoring, and alerting to ensure fast detection and resolution of issues. • Drive incident response, root-cause analysis, and reliability improvements across the engineering organization. • Enhance and maintain our Kubernetes-based platform, including Helm charts, multi-environment pipelines, cluster upgrades, and operational hardening. • Implement and evolve security best practices around IAM, network architecture, secrets management, and infrastructure governance. • Build and maintain CI/CD pipelines that support safe, rapid, and stable deployments. • Partner with engineering leadership on capacity planning, cost optimization, and infrastructure roadmap planning. • Mentor and guide engineers across teams, influencing architectural direction and DevOps best practices. • Champion Infrastructure-as-Code, GitOps workflows, and automation as the default mode of operating.

🎯 Requirements

• 7+ years of SRE/DevOps/Cloud Infrastructure experience supporting production systems at scale. • Proven experience running distributed systems and microservices in high-traffic or high-availability environments. • Deep knowledge of Kubernetes, Helm, and cloud-native architecture (preferably AWS). • Strong proficiency in Go or another backend programming language. • Exposure to PostgreSQL, Redis, DynamoDB, or Cassandra performance tuning in a production environment. • Deep familiarity with event-driven systems, streaming pipelines, or messaging platforms such as Kinesis, Kafka, Pub/Sub, or RabbitMQ. • Advanced hands-on experience with Terraform or a comparable IaC solution. • Strong practical understanding of observability stacks (Datadog, Prometheus, Grafana, OpenTelemetry, etc.). • Experience implementing SLOs, SLIs, error budgets, and reliability-driven engineering practices. • Expertise with CI/CD pipelines and GitOps workflows. • Solid foundation in security fundamentals: IAM, network policies, workload hardening, secrets management. • Demonstrated leadership in technical decision-making, cross-functional influence, and on-call/incident management. • Excellent communication skills and the ability to work effectively across product, engineering, and leadership teams. • Ability to influence org-wide DevOps maturity, champion best practices, and drive standards across multiple teams.

🏖️ Benefits

• Unlimited/ Flexible Time Off: Flexible vacation policy • Health benefits with 100% paid premiums* for medical, dental, and vision plans for employees and dependents, plus an on-demand healthcare concierge. • Pre-tax savings plans for healthcare, with up to a $500 annual employer contribution to the HSA (if enrolled in the HSA medical plan). • Employee-sponsored 401(k) to help reach your financial goals. • Fully remote work environment. • Generous parental leave. • Professional development opportunities in a dynamic, global setting. • Work Remotely. • $500 work-from-home stipend + Equipment & Accessories. • A supportive, collaborative, and knowledge-driven workplace. • An engaging and challenging role with the freedom to innovate and develop effective solutions.

Apply Now

Similar Jobs

Staff DevOps Engineer

November 25

LILT

201 - 500

🤖 Artificial Intelligence

☁️ SaaS

🏢 Enterprise

DevOps Engineer building reliable software systems for AI-driven language translation startup. Collaborating on infrastructure needs, deployments, and enhancing multi-region configurations while working remotely.

🇺🇸 United States – Remote

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Docker

Google Cloud Platform

Jenkins

Kubernetes

Linux

Python

Terraform

Staff Site Reliability Engineer

November 24

SentinelOne

1001 - 5000

🔒 Cybersecurity

🤖 Artificial Intelligence

☁️ SaaS

Site Reliability Engineer responsible for managing incidents and ensuring uptime promise at SentinelOne's cybersecurity team. Collaborate across teams to improve reliability and incident response processes.

🇺🇸 United States – Remote

💵 $148k - $185k / year

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

Cloud

Grafana

Kubernetes

Prometheus

Python

Engineering Manager, Site Reliability (SRE)

November 24

SentinelOne

1001 - 5000

🔒 Cybersecurity

🤖 Artificial Intelligence

☁️ SaaS

Manager of SRE team at SentinelOne ensuring product reliability and scalability. Leading operational efforts and cross-team collaboration to improve customer experience in production services.

🇺🇸 United States – Remote

💵 $160k - $200k / year

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

AWS

Cloud

Distributed Systems

Google Cloud Platform

Grafana

Kubernetes

Prometheus

Terraform

Principal DevOps Engineer

November 22

SambaNova Systems

201 - 500

🤖 Artificial Intelligence

🔧 Hardware

🏢 Enterprise

DevOps Engineer managing CI/CD pipelines for SambaNova's AI inference platforms. Collaborating with engineering teams to ensure robust release infrastructure and deployment efficiency.

🇺🇸 United States – Remote

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

AWS

Docker

Jenkins

Kubernetes

Linux

Python

Unix

Staff Site Reliability Engineer, Streaming

November 21

Alpaca

201 - 500

🔌 API

💳 Fintech

₿ Crypto

Site Reliability Engineer ensuring the reliability and performance of systems at Alpaca. Collaborate with teams to implement solutions and improve the infrastructure.

🇺🇸 United States – Remote

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

Distributed Systems

Kafka

Kubernetes

Linux

Prometheus

RabbitMQ