
Telecommunications ⢠Enterprise ⢠SaaS
Aalyria is a company dedicated to creating, organizing, and managing the world's most advanced networks to enable ubiquitous connectivity at the speed of discovery. It utilizes atmospheric laser communications technology and a software platform originally developed by Alphabet. Aalyria's platform orchestrates networks across land, sea, air, space, and beyond. Key technological components include Tightbeam, a free space optics technology, and Spacetime, a software platform for network orchestration. Aalyria is backed by significant investors and has engaged in various high-profile projects, including working with NASA and developing 5G/6G networking platforms.
51 - 200 employees
đĄ Telecommunications
đ˘ Enterprise
âď¸ SaaS
November 11
đşđ¸ United States â Remote
đľ $160k - $200k / year
â° Full Time
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)

Telecommunications ⢠Enterprise ⢠SaaS
Aalyria is a company dedicated to creating, organizing, and managing the world's most advanced networks to enable ubiquitous connectivity at the speed of discovery. It utilizes atmospheric laser communications technology and a software platform originally developed by Alphabet. Aalyria's platform orchestrates networks across land, sea, air, space, and beyond. Key technological components include Tightbeam, a free space optics technology, and Spacetime, a software platform for network orchestration. Aalyria is backed by significant investors and has engaged in various high-profile projects, including working with NASA and developing 5G/6G networking platforms.
51 - 200 employees
đĄ Telecommunications
đ˘ Enterprise
âď¸ SaaS
⢠Design, build, and own the technical roadmap for Aalyria's centralized observability platform, integrating and scaling tools for metrics (Prometheus), logging (Loki), and distributed tracing (Tempo/OpenTelemetry) ⢠Define, implement, and manage a robust framework of Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets for our core products, ensuring we are launch-ready ⢠Establish and evangelize observability best practices, providing standards, documentation, and tooling (e.g., OpenTelemetry libraries) to empower our Go and Java application teams to instrument their services effectively ⢠Partner with core software engineers to provide the tools and insights needed to debug performance, optimize computational pipelines (including CPU/GPU workloads), and ensure the reliability of large-scale distributed systems ⢠Automate the deployment, scaling, and management of the entire observability stack using Infrastructure as Code (Terraform) and GitOps principles (ArgoCD) ⢠Partner closely with the core infrastructure team to ensure deep visibility into our Kubernetes clusters and underlying GCP and AWS environments ⢠Develop and lead the company's monitoring, alerting, and incident response strategy, driving a culture of proactive reliability and blameless post-mortems
⢠7+ years of experience in an SRE or platform engineering role ⢠Deep, hands-on expertise building, scaling, and managing observability platforms (e.g., Prometheus, Grafana, Loki/ELK, OpenTelemetry, Tempo/Jaeger, Honeycomb, etc.) ⢠Strong production-level experience with Google Cloud Platform (GCP) and Kubernetes ⢠Proven mastery of Infrastructure as Code (IaC) with Terraform and GitOps principles (e.g., ArgoCD) ⢠Proficiency in a systems programming language, with a strong preference for Go and Python for debugging and writing tooling ⢠Demonstrable experience defining, implementing, and managing SLOs, SLIs, and error budgets for production services
⢠Competitive salary ⢠Comprehensive benefits (401(k), dental, vision, health, life insurance) ⢠Paid time off ⢠Equity options ⢠Flexible working arrangements including hybrid remote/in-office schedules
Apply NowNovember 6
Galaxy VP, Site Reliability Engineer in charge of AWS and containerized infrastructure. Focusing on automation, reliability, and cloud best practices.
đşđ¸ United States â Remote
â° Full Time
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
đŚ H1B Visa Sponsor
November 5
AWS DevOps Engineer designing cloud-native applications for SAP S/4HANA processes. Optimizing AWS cost/performance in fully remote work environment.
đşđ¸ United States â Remote
â° Full Time
đ Senior
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
November 5
DevSecOps Engineer leading customer onboarding to the Game Warden platform for national security. Working in a collaborative environment to enhance secure deployments for government and defense.
đşđ¸ United States â Remote
đľ $135k - $160k / year
â° Full Time
đ Senior
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
October 31
AI Cloud Engineer at Raytheon Technologies leading design and optimization of scalable AI solutions on cloud platforms. Collaborating with teams to drive innovation and support mission objectives.
đşđ¸ United States â Remote
đľ $124k - $250k / year
â° Full Time
đ Senior
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
October 29
Director of DevOps and Product Security at DDN leading operational excellence across Infinia platform. Ensuring security and compliance while driving automation and scalability for AI workloads.
đşđ¸ United States â Remote
đ° $10M Funding Round on 2011-06
â° Full Time
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
đŚ H1B Visa Sponsor