Senior Infrastructure Engineer

Job not on LinkedIn

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Somnia

Somnia

11 - 50 employees

🥽 AR/VR

🧘 Wellness

🤖 Artificial Intelligence

AR/VR • Wellness • Artificial Intelligence

Somnia is a company that seems to explore the realms of imagination and subconscious experiences, as suggested by its thought-provoking questions about what one sees when they close their eyes. Their focus may be on introspective experiences or technologies related to enhancing or understanding dreams and vision.

📋 Description

• Define and maintain SLOs, SLIs, and error budgets, plus the observability—metrics, logs, traces and alerts—that catches regressions before users do. • Build repeatable, self-service infrastructure through infrastructure-as-code, CI/CD and golden paths so teams can provision, deploy and recover without reinventing the wheel. • Own rollouts end-to-end—progressive delivery, canaries, safe migrations and clean rollbacks. • Operate the systems behind Somnia's nodes, validators, RPC and indexing, tuning for performance and cost across regions. • Lead incident response and on-call, run blameless postmortems, and continuously harden the platform. • Partner with product and protocol teams to design and operate production-ready services. You'll rotate between embedding with engineering teams and building the shared platform, tooling and operational standards that underpin the wider organisation.

🎯 Requirements

• Strong experience operating production infrastructure at scale (cloud and/or bare metal), with deep Linux fundamentals. • Experience with infrastructure-as-code such as Terraform or Pulumi, alongside configuration management. • Experience running containers and orchestration platforms (Docker, Kubernetes) in production. • Strong programming skills, ideally in Go and/or TypeScript, for building automation and internal tooling. • Experience with observability stacks (Prometheus, Grafana, OpenTelemetry or equivalents). • Experience operating and monitoring distributed systems, including capacity planning and performance tuning. • Comfortable operating in high-stakes production environments and responding to incidents. • Genuine interest in crypto and on-chain systems. • Experience operating blockchain node infrastructure (validators, RPC, archive nodes) for an L1/L2. • Experience with high-performance networking, low-latency systems or load balancing at scale. • Multi-region and geo-distributed deployments with failover strategies. • Security and key management (HSMs, secrets management, hardening). • EVM tooling and the wider Web3 infrastructure ecosystem.

🏖️ Benefits

• Competitive compensation with token incentives

Apply Now

Similar Jobs

🔥 11 hours ago

Mirantis

501 - 1000

🏢 Enterprise

☁️ SaaS

Senior AI Infrastructure & Platform Operations Engineer at Mirantis overseeing AI infrastructure. Providing technical leadership and troubleshooting across complex production environments.

Cloud

Distributed Systems

Grafana

Kubernetes

Linux

Prometheus

🔥 11 hours ago

Mirantis

501 - 1000

🏢 Enterprise

☁️ SaaS

Senior Engineer leading AI Infrastructure & Platform Operations at Mirantis. Supporting large-scale AI infrastructure environments and providing technical leadership for reliability and operations teams.

Cloud

Distributed Systems

Kubernetes

Linux

🕒 Yesterday

Mirantis

501 - 1000

🏢 Enterprise

☁️ SaaS

Senior AI Infrastructure Engineer at Mirantis managing large-scale AI infrastructures powered by NVIDIA GPUs and Kubernetes. Leading technical operations and incident management with a focus on platform reliability and automation.

Cloud

Distributed Systems

Grafana

Kubernetes

Linux

Prometheus

🕒 June 25

NIR-YU

201 - 500

🎯 Recruiter

👥 HR Tech

🏢 Enterprise

Senior Unity engineer developing the client-side infrastructure for a VR training platform. Focused on architecture and optimization in a flexible, remote setup.

Unity

🕒 June 2

Thrill

11 - 50

🎮 Gaming

🥽 AR/VR

Data Warehouse and Infrastructure Engineer tuning ClickHouse queries and managing data infrastructure at Thrill Labs. Responsible for maintaining data models and dashboards, ensuring data quality and performance.

Ansible

Docker

Kafka

Kubernetes

Linux

Shell Scripting

SQL

Terraform

Zookeeper