Senior DevOps Engineer

Job not on LinkedIn

October 22

Apply Now
Logo of MetaMask

MetaMask

Crypto • Web 3 • Fintech

MetaMask is a leading self-custodial crypto wallet that serves as a gateway to blockchain applications. Available as a browser extension and mobile app, it enables users to track and manage their web3 assets securely and conveniently. With features such as buying, selling, staking, and bridging crypto tokens, MetaMask empowers over 100 million users worldwide to control their digital assets and interact on the decentralized web with privacy and security.

51 - 200 employees

Founded 2016

₿ Crypto

🌐 Web 3

💳 Fintech

📋 Description

• Architect, build, and maintain AWS cloud infrastructure supporting Linea Mainnet/L2 nodes and supporting services. • Design and optimize Kubernetes clusters for scaling, resiliency, and healthy node operations, leveraging Karpenter autoscaling and spot instance capabilities. • Implement infrastructure-as-code using Terraform for reproducible, secure, and audit-ready deployments. • Own monitoring, alerting, and observability pipelines (Grafana, Prometheus, Loki) for end-to-end production health and actionable insights. • Drive automation of deployment and operational workflows, enabling zero-downtime upgrades and rapid rollbacks. • Participate in incident response, root cause analysis, and postmortem reporting for production blockchain services. • Collaborate closely with engineering teams to identify opportunities for reliability and performance improvements. • Document infrastructure, operational protocols, and DevOps best practices, ensuring knowledge sharing and team alignment. • Stay current with new tools and cloud advancements relevant to blockchain, DevOps, and Kubernetes ecosystems.

🎯 Requirements

• Senior/Staff experience with AWS cloud services and advanced Kubernetes ops, in production-grade environments. • Proven experience in infrastructure-as-code, particularly Terraform, and delivering best practices for security and scalability. • Proficiency in monitoring/observability tools: Grafana, Prometheus, Loki. • Hands-on with Kubernetes autoscaling (Karpenter or Cluster Autoscaler) and EC2 spot instance cost optimization. • Strong scripting/programming skills (Bash, Python, Go, or similar). • Strong troubleshooting, communication, and documentation abilities. • Blockchain/Layer 2 protocol, cryptography, and Web3 infrastructure experience preferred but not required. • Experience working in agile, high-performance teams within top technology environments.

🏖️ Benefits

• Health insurance • 401(k) matching • Flexible work hours • Paid time off • Remote work options

Apply Now

Similar Jobs

October 21

Site Reliability Engineer responsible for building and maintaining libraries and infrastructure at Circle. Collaborating with teams to enhance software shipping experience and support rapid development.

AWS

Azure

Cloud

Google Cloud Platform

Java

Kubernetes

Microservices

SQL

Go

October 17

Senior DevOps Engineer specializing in cloud technologies at Cyderes. Responsible for maintaining system stability and leading initiatives for improvement.

Ansible

AWS

Azure

Chef

Cloud

Cyber Security

Docker

Google Cloud Platform

Grafana

Jenkins

Kubernetes

MySQL

Postgres

Prometheus

Puppet

SaltStack

SDLC

Spinnaker

SQL

Terraform

VMware

October 16

Site Reliability Engineer joining Tecsys responsible for optimizing and maintaining performance in mission-critical SaaS environments. Collaborating with teams to drive automation and incident management.

Ansible

AWS

Cloud

EC2

Java

Jenkins

Kubernetes

Python

Terraform

October 15

Senior DevOps Engineer building risk management solutions for a cybersecurity platform. Collaborating with development teams to optimize CI/CD and cloud infrastructure.

Ansible

Cloud

Docker

Firewalls

JavaScript

Jenkins

Kubernetes

Linux

Python

Terraform

October 15

DevOps Developer responsible for backend development and DevOps practices on AWS platform. Collaborating with teams to maintain cloud-native services and infrastructure in a remote-first environment.

AWS

Cloud

Distributed Systems

Docker

EC2

Kubernetes

Linux

Microservices

Postgres

Python

RabbitMQ

Redis

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com