
Crypto • Finance • Blockchain
Galaxy is a digital asset and blockchain leader helping institutions, startups, and qualified individuals shape a changing economy through innovative crypto solutions. Galaxy provides a wide array of services, including asset management, trading, lending, custodial technology, and blockchain infrastructure solutions. With a focus on both traditional finance integration and digital asset expertise, Galaxy is committed to advancing the adoption and functionality of cryptocurrencies and blockchain technologies across the globe.
November 6
🇺🇸 United States – Remote
⏰ Full Time
🔴 Lead
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor

Crypto • Finance • Blockchain
Galaxy is a digital asset and blockchain leader helping institutions, startups, and qualified individuals shape a changing economy through innovative crypto solutions. Galaxy provides a wide array of services, including asset management, trading, lending, custodial technology, and blockchain infrastructure solutions. With a focus on both traditional finance integration and digital asset expertise, Galaxy is committed to advancing the adoption and functionality of cryptocurrencies and blockchain technologies across the globe.
• Architect, deploy, and maintain robust, scalable, secure AWS-based infrastructure. • Drive adoption and optimization of EKS and Kubernetes for containerized workloads. • Support migration initiatives, moving workloads from legacy VMs to containers in AWS. • Implement and fine-tune SLOs, SLAs, and error budgets to balance innovation and stability. • Collaborate on best practices with Security and Engineering teams for workload reliability. • Build Infrastructure as Code (IaC) with Terraform; maintain compliant, repeatable environments. • Enhance CI/CD pipelines for efficient, secure, and reliable cloud delivery. • Develop and refine automated solutions for autoscaling, failover, and disaster recovery. • Design and implement metrics, logging, and tracing tools (Datadog, OpenTelemetry). • Set up robust monitoring and alerting to proactively detect and address failures. • Lead incident analysis and post-mortems; drive improvements in operational playbooks. • Serve as a subject matter expert for AWS, EKS, and cloud-native tooling within the SRE team. • Optimize AWS resources, cost management, and resiliency best practices. • Ensure secure key management and regulatory compliance for decentralized workloads.
• 8+ years in SRE, DevOps, or Infrastructure Engineering (IC capacity preferred). • Deep hands-on expertise in AWS, Kubernetes/EKS, and containerization. • Extensive IaC experience (Terraform) and cloud-native automation. • Proven track record migrating VM-based workloads to containers in AWS at scale. • Strong experience with observability stacks (Datadog, Prometheus, Grafana, OpenTelemetry). • Excellent analytical, problem-solving, and incident management abilities. • Clear communicator who thrives in team environments, collaborating cross-functionally.
• Galaxy respects diversity and seeks to provide equal employment opportunities to all employees and job applicants for employment. • We will endeavor to make a reasonable accommodation to the known limitations of a qualified applicant with a disability.
Apply NowNovember 5
AWS DevOps Engineer designing cloud-native applications for SAP S/4HANA processes. Optimizing AWS cost/performance in fully remote work environment.
AWS
Cloud
DynamoDB
Kafka
November 5
DevSecOps Engineer leading customer onboarding to the Game Warden platform for national security. Working in a collaborative environment to enhance secure deployments for government and defense.
🇺🇸 United States – Remote
💵 $135k - $160k / year
⏰ Full Time
🟠 Senior
🔴 Lead
⛑ DevOps & Site Reliability Engineer (SRE)
AWS
Azure
Cloud
Google Cloud Platform
Kubernetes
Python
Terraform
Go
October 31
AI Cloud Engineer at Raytheon Technologies leading design and optimization of scalable AI solutions on cloud platforms. Collaborating with teams to drive innovation and support mission objectives.
🇺🇸 United States – Remote
💵 $124k - $250k / year
⏰ Full Time
🟠 Senior
🔴 Lead
⛑ DevOps & Site Reliability Engineer (SRE)
AWS
Azure
Cloud
Docker
Google Cloud Platform
Java
Kubernetes
Python
October 29
Director of DevOps and Product Security at DDN leading operational excellence across Infinia platform. Ensuring security and compliance while driving automation and scalability for AI workloads.
🇺🇸 United States – Remote
💰 $10M Funding Round on 2011-06
⏰ Full Time
🔴 Lead
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
Ansible
AWS
Azure
Cloud
Google Cloud Platform
Jenkins
Terraform
October 29
DevOps Engineer focusing on software development efficiency and reliability at Creyos. Joining a diverse team to innovate healthtech solutions with automated processes.
AWS
Cloud
Python
Ruby
Ruby on Rails
Terraform