DevOps Engineer

October 29

Apply Now
Logo of Lumos

Lumos

Web 3 • Compliance • SaaS

Lumos is a company that appears to be encountering operational issues, as indicated by an account suspension notice. The notice suggests that the company may provide online services or content that is currently unavailable due to a restriction, which could relate to hosting or compliance problems.

51 - 200 employees

Founded 2020

🌐 Web 3

📋 Compliance

☁️ SaaS

📋 Description

• Develop and maintain infrastructure as code (IaC) to ensure repeatability and scalability. • Implement monitoring, logging, and alerting systems to proactively detect and resolve issues. • Ensure high availability and disaster recovery mechanisms are in place. • Design, implement, and optimize CI/CD pipelines to accelerate software delivery. • Automate deployment and rollback processes for minimal downtime. • Enforce best practices for build, test, and release automation. • Optimize cloud resource utilization to balance performance and cost. • Implement auto-scaling, load balancing, and caching strategies for improved efficiency. • Continuously evaluate and recommend cost-effective infrastructure solutions. • Work closely with engineering teams to align DevOps strategies with business goals. • Promote a culture of continuous improvement through automation and innovation. • Provide mentorship and training to other engineers on DevOps best practices.

🎯 Requirements

• Proficiency in cloud platforms (AWS, GCP, or Azure) and containerization (Docker, Kubernetes). • Strong experience with Infrastructure as Code (Terraform, CloudFormation, Pulumi, etc.). • Deep knowledge of CI/CD tools (Jenkins, GitHub Actions, GitLab CI/CD, ArgoCD, etc.). • Knowledge with build optimization tools/frameworks (poetry, pants/bazel, docker, etc.). • Experience with observability tools (Datadog, Prometheus, Grafana, ELK Stack, etc.) and defining SLOs. • Strong bias towards software development with no fear of writing/contributing code to large codebases. • Comfortable reading, debugging and optimizing large codebases (Python experience preferred). • Ability to diagnose and resolve infrastructure and performance issues. • Strong debugging skills across distributed systems and microservices architectures. • Understanding of security principles such as IAM, encryption, network security, and secrets management. • Familiarity with compliance frameworks and security tooling (e.g., Wiz, Vault, Falco, AWS Security Hub). • Ability to work cross-functionally with engineering, security, and product teams. • Strong written and verbal communication skills to document and share knowledge effectively. • Experience in an agile development environment and ability to contribute to DevOps culture.

🏖️ Benefits

• 💯 Remote work culture (+/-4 hours Pacific Time) • ⛑ Medical, Vision, & Dental coverage covered by Lumos • 🛩 Company and team bonding trips throughout the year fully covered by Lumos • 💻 Optimal WFH setup to set you up for success • 🌴 Unlimited PTO, with • minimum time off to make sure you are rested and able to be at your best • 👶🏽 Up to 16 weeks for expecting parents • 💰 Wellness stipend to keep you awesome and healthy • 🏦 401k matching plan

Apply Now

Similar Jobs

October 29

DevOps Engineer managing AWS infrastructure and collaborating with engineering teams on Web 3.0 technology. Build and automate solutions for an advanced web form platform.

AWS

Cloud

EC2

Java

JavaScript

Microservices

Node.js

Python

SDLC

Spring

Spring Boot

SpringBoot

October 29

Site Reliability Engineer ensuring high uptime and performance for cloud systems at Hydra Host. Collaborating with teams to integrate monitoring and QA tools for reliability and observability.

Cloud

Grafana

Kubernetes

Prometheus

Python

Go

October 28

Senior Site Reliability Engineer ensuring daily operations and incident handling for large scale GPU platforms at NVIDIA. Contributing to feature design and cluster validation for optimal performance and resilience.

Kubernetes

Linux

Python

October 28

Hopper

201 - 500

Senior Site Reliability Engineer for platform infrastructure in a growing travel tech company. Enhancing automated, self-service tools for engineers while ensuring performance and reliability.

Cloud

Distributed Systems

DNS

Google Cloud Platform

Kubernetes

NoSQL

Python

SQL

Terraform

October 28

Hopper

201 - 500

Senior Site Reliability Engineer at Hopper's Platform Infrastructure team. Building and operating cloud foundation for products used by millions of travelers worldwide.

Cloud

Distributed Systems

DNS

Google Cloud Platform

Kubernetes

NoSQL

Python

SQL

Terraform

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com