
Web 3 • Compliance • SaaS
Lumos is a company that appears to be encountering operational issues, as indicated by an account suspension notice. The notice suggests that the company may provide online services or content that is currently unavailable due to a restriction, which could relate to hosting or compliance problems.
October 29
🇺🇸 United States – Remote
💵 $160k - $190k / year
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
AWS
Azure
Cloud
Distributed Systems
Docker
Google Cloud Platform
Grafana
Jenkins
Kubernetes
Microservices
Prometheus
Python
Terraform
Vault

Web 3 • Compliance • SaaS
Lumos is a company that appears to be encountering operational issues, as indicated by an account suspension notice. The notice suggests that the company may provide online services or content that is currently unavailable due to a restriction, which could relate to hosting or compliance problems.
• Develop and maintain infrastructure as code (IaC) to ensure repeatability and scalability. • Implement monitoring, logging, and alerting systems to proactively detect and resolve issues. • Ensure high availability and disaster recovery mechanisms are in place. • Design, implement, and optimize CI/CD pipelines to accelerate software delivery. • Automate deployment and rollback processes for minimal downtime. • Enforce best practices for build, test, and release automation. • Optimize cloud resource utilization to balance performance and cost. • Implement auto-scaling, load balancing, and caching strategies for improved efficiency. • Continuously evaluate and recommend cost-effective infrastructure solutions. • Work closely with engineering teams to align DevOps strategies with business goals. • Promote a culture of continuous improvement through automation and innovation. • Provide mentorship and training to other engineers on DevOps best practices.
• Proficiency in cloud platforms (AWS, GCP, or Azure) and containerization (Docker, Kubernetes). • Strong experience with Infrastructure as Code (Terraform, CloudFormation, Pulumi, etc.). • Deep knowledge of CI/CD tools (Jenkins, GitHub Actions, GitLab CI/CD, ArgoCD, etc.). • Knowledge with build optimization tools/frameworks (poetry, pants/bazel, docker, etc.). • Experience with observability tools (Datadog, Prometheus, Grafana, ELK Stack, etc.) and defining SLOs. • Strong bias towards software development with no fear of writing/contributing code to large codebases. • Comfortable reading, debugging and optimizing large codebases (Python experience preferred). • Ability to diagnose and resolve infrastructure and performance issues. • Strong debugging skills across distributed systems and microservices architectures. • Understanding of security principles such as IAM, encryption, network security, and secrets management. • Familiarity with compliance frameworks and security tooling (e.g., Wiz, Vault, Falco, AWS Security Hub). • Ability to work cross-functionally with engineering, security, and product teams. • Strong written and verbal communication skills to document and share knowledge effectively. • Experience in an agile development environment and ability to contribute to DevOps culture.
• 💯 Remote work culture (+/-4 hours Pacific Time) • ⛑ Medical, Vision, & Dental coverage covered by Lumos • 🛩 Company and team bonding trips throughout the year fully covered by Lumos • 💻 Optimal WFH setup to set you up for success • 🌴 Unlimited PTO, with • minimum time off to make sure you are rested and able to be at your best • 👶🏽 Up to 16 weeks for expecting parents • 💰 Wellness stipend to keep you awesome and healthy • 🏦 401k matching plan
Apply NowOctober 29
DevOps Engineer managing AWS infrastructure and collaborating with engineering teams on Web 3.0 technology. Build and automate solutions for an advanced web form platform.
AWS
Cloud
EC2
Java
JavaScript
Microservices
Node.js
Python
SDLC
Spring
Spring Boot
SpringBoot
October 29
Site Reliability Engineer ensuring high uptime and performance for cloud systems at Hydra Host. Collaborating with teams to integrate monitoring and QA tools for reliability and observability.
🇺🇸 United States – Remote
💵 $140k - $200k / year
💰 $10M Seed Round on 2022-04
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
Cloud
Grafana
Kubernetes
Prometheus
Python
Go
October 28
Senior Site Reliability Engineer ensuring daily operations and incident handling for large scale GPU platforms at NVIDIA. Contributing to feature design and cluster validation for optimal performance and resilience.
🇺🇸 United States – Remote
💵 $168k - $333.5k / year
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
Kubernetes
Linux
Python
October 28
201 - 500
Senior Site Reliability Engineer for platform infrastructure in a growing travel tech company. Enhancing automated, self-service tools for engineers while ensuring performance and reliability.
🇺🇸 United States – Remote
💵 $150k - $350k / year
💰 $96M Venture Round on 2022-11
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
Cloud
Distributed Systems
DNS
Google Cloud Platform
Kubernetes
NoSQL
Python
SQL
Terraform
October 28
201 - 500
Senior Site Reliability Engineer at Hopper's Platform Infrastructure team. Building and operating cloud foundation for products used by millions of travelers worldwide.
🇺🇸 United States – Remote
💰 $96M Venture Round on 2022-11
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
Cloud
Distributed Systems
DNS
Google Cloud Platform
Kubernetes
NoSQL
Python
SQL
Terraform