
Gaming ⢠Sports ⢠B2C
Fliff Inc. is a social sportsbook platform that transforms sports predictions into a play-for-fun social game. The company provides users with free daily credits to make sports picks and earn rewards without requiring any purchase. Users can win Fliff Coins to progress in leaderboards and earn badges, and Fliff Cash in promotional games, which can be redeemed for prizes. Fliff emphasizes social interaction, offering challenges, loyalty rewards, and referral bonuses while remaining compliant with gaming regulations across the United States, excluding Washington.
November 25
đ Pennsylvania â Remote
đľ $180k - $220k / year
â° Full Time
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
AWS
Cassandra
Cloud
Distributed Systems
DynamoDB
Grafana
Kafka
Kubernetes
Microservices
Postgres
Prometheus
RabbitMQ
Redis
Terraform
Go

Gaming ⢠Sports ⢠B2C
Fliff Inc. is a social sportsbook platform that transforms sports predictions into a play-for-fun social game. The company provides users with free daily credits to make sports picks and earn rewards without requiring any purchase. Users can win Fliff Coins to progress in leaderboards and earn badges, and Fliff Cash in promotional games, which can be redeemed for prizes. Fliff emphasizes social interaction, offering challenges, loyalty rewards, and referral bonuses while remaining compliant with gaming regulations across the United States, excluding Washington.
⢠Architect, build, and operate highly available, high-performance infrastructure that supports large volumes of real-time traffic, especially during peak sports windows. ⢠Lead the design and development of internal platforms, tooling, and automated workflows that accelerate engineering productivity. ⢠Own and improve observability (metrics, logs, traces), monitoring, and alerting to ensure fast detection and resolution of issues. ⢠Drive incident response, root-cause analysis, and reliability improvements across the engineering organization. ⢠Enhance and maintain our Kubernetes-based platform, including Helm charts, multi-environment pipelines, cluster upgrades, and operational hardening. ⢠Implement and evolve security best practices around IAM, network architecture, secrets management, and infrastructure governance. ⢠Build and maintain CI/CD pipelines that support safe, rapid, and stable deployments. ⢠Partner with engineering leadership on capacity planning, cost optimization, and infrastructure roadmap planning. ⢠Mentor and guide engineers across teams, influencing architectural direction and DevOps best practices. ⢠Champion Infrastructure-as-Code, GitOps workflows, and automation as the default mode of operating.
⢠7+ years of SRE/DevOps/Cloud Infrastructure experience supporting production systems at scale. ⢠Proven experience running distributed systems and microservices in high-traffic or high-availability environments. ⢠Deep knowledge of Kubernetes, Helm, and cloud-native architecture (preferably AWS). ⢠Strong proficiency in Go or another backend programming language. ⢠Exposure to PostgreSQL, Redis, DynamoDB, or Cassandra performance tuning in a production environment. ⢠Deep familiarity with event-driven systems, streaming pipelines, or messaging platforms such as Kinesis, Kafka, Pub/Sub, or RabbitMQ. ⢠Advanced hands-on experience with Terraform or a comparable IaC solution. ⢠Strong practical understanding of observability stacks (Datadog, Prometheus, Grafana, OpenTelemetry, etc.). ⢠Experience implementing SLOs, SLIs, error budgets, and reliability-driven engineering practices. ⢠Expertise with CI/CD pipelines and GitOps workflows. ⢠Solid foundation in security fundamentals: IAM, network policies, workload hardening, secrets management. ⢠Demonstrated leadership in technical decision-making, cross-functional influence, and on-call/incident management. ⢠Excellent communication skills and the ability to work effectively across product, engineering, and leadership teams. ⢠Ability to influence org-wide DevOps maturity, champion best practices, and drive standards across multiple teams.
⢠Unlimited/ Flexible Time Off: Flexible vacation policy ⢠Health benefits with 100% paid premiums* for medical, dental, and vision plans for employees and dependents, plus an on-demand healthcare concierge. ⢠Pre-tax savings plans for healthcare, with up to a $500 annual employer contribution to the HSA (if enrolled in the HSA medical plan). ⢠Employee-sponsored 401(k) to help reach your financial goals. ⢠Fully remote work environment. ⢠Generous parental leave. ⢠Professional development opportunities in a dynamic, global setting. ⢠Work Remotely. ⢠$500 work-from-home stipend + Equipment & Accessories. ⢠A supportive, collaborative, and knowledge-driven workplace. ⢠An engaging and challenging role with the freedom to innovate and develop effective solutions.
Apply NowNovember 25
DevOps Engineer building reliable software systems for AI-driven language translation startup. Collaborating on infrastructure needs, deployments, and enhancing multi-region configurations while working remotely.
AWS
Docker
Google Cloud Platform
Jenkins
Kubernetes
Linux
Python
Terraform
November 24
Site Reliability Engineer responsible for managing incidents and ensuring uptime promise at SentinelOne's cybersecurity team. Collaborate across teams to improve reliability and incident response processes.
đşđ¸ United States â Remote
đľ $148k - $185k / year
â° Full Time
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
đŚ H1B Visa Sponsor
Cloud
Grafana
Kubernetes
Prometheus
Python
November 24
Manager of SRE team at SentinelOne ensuring product reliability and scalability. Leading operational efforts and cross-team collaboration to improve customer experience in production services.
đşđ¸ United States â Remote
đľ $160k - $200k / year
â° Full Time
đ Senior
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
đŚ H1B Visa Sponsor
AWS
Cloud
Distributed Systems
Google Cloud Platform
Grafana
Kubernetes
Prometheus
Terraform
November 22
DevOps Engineer managing CI/CD pipelines for SambaNova's AI inference platforms. Collaborating with engineering teams to ensure robust release infrastructure and deployment efficiency.
đşđ¸ United States â Remote
â° Full Time
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
đŚ H1B Visa Sponsor
AWS
Docker
Jenkins
Kubernetes
Linux
Python
Unix
November 21
Site Reliability Engineer ensuring the reliability and performance of systems at Alpaca. Collaborate with teams to implement solutions and improve the infrastructure.
Distributed Systems
Kafka
Kubernetes
Linux
Prometheus
RabbitMQ
Go