
1001 - 5000 employees
Founded 2011
₿ Crypto
💸 Finance
💳 Fintech
Crypto • Finance • Fintech
Kraken Digital Asset Exchange is a cryptocurrency platform that facilitates the buying and selling of over 200 cryptocurrencies, including Bitcoin, Ethereum, and many others. Founded in 2011, Kraken provides a comprehensive suite of features for both beginner and advanced traders, such as advanced trading interfaces and margin trading. The platform emphasizes industry-leading security, deep liquidity, and 24/7 customer support, making it a trusted choice for users worldwide. Kraken caters to individual investors as well as institutional clients, offering services like OTC trading and custody. The company is committed to transparency with its proof of reserves and mission-driven values. Kraken operates globally, supporting clients in over 190 countries, with a quarterly trading volume exceeding $207 billion. However, users are advised of the high risk of crypto investments and the lack of regulation in some jurisdictions.
🔥 23 hours ago
🇺🇸 United States – Remote
💵 $96k - $192k / year
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
Improve your chances of getting an interview by checking your resume score before you apply.

1001 - 5000 employees
Founded 2011
₿ Crypto
💸 Finance
💳 Fintech
Crypto • Finance • Fintech
Kraken Digital Asset Exchange is a cryptocurrency platform that facilitates the buying and selling of over 200 cryptocurrencies, including Bitcoin, Ethereum, and many others. Founded in 2011, Kraken provides a comprehensive suite of features for both beginner and advanced traders, such as advanced trading interfaces and margin trading. The platform emphasizes industry-leading security, deep liquidity, and 24/7 customer support, making it a trusted choice for users worldwide. Kraken caters to individual investors as well as institutional clients, offering services like OTC trading and custody. The company is committed to transparency with its proof of reserves and mission-driven values. Kraken operates globally, supporting clients in over 190 countries, with a quarterly trading volume exceeding $207 billion. However, users are advised of the high risk of crypto investments and the lack of regulation in some jurisdictions.
• Design, build, and operate the infrastructure layer supporting AI agent workflows in production • Ensure reliability, scalability, and observability of agentic systems across internal and external products • Design and develop platform services, APIs, SDKs, and self-service capabilities that allow engineering teams to easily consume AI infrastructure and agent platform services • Manage and maintain the compute, orchestration, and serving infrastructure powering model inference and agent execution • Implement robust monitoring, alerting, and incident response procedures tailored to AI/ML workloads • Utilize Infrastructure as Code (IaC) tools such as Terraform to provision and manage cloud (AWS) infrastructure components • Build and maintain CI/CD pipelines that support rapid, reliable deployment of AI services and agent workflows • Define and implement guardrails, failure handling, and recovery patterns specific to agentic and LLM-powered systems • Collaborate with AI and Data Engineering teams to translate experimental agent prototypes into hardened production systems • Manage containerized workloads using Kubernetes, ensuring efficient deployment, scaling, and orchestration of AI services • Implement access controls and security best practices across AI infrastructure environments • Document architecture, runbooks, and best practices to support knowledge sharing across the team.
• 5+ years of experience as a Site Reliability Engineer, Infrastructure Engineer, Platform Engineer, or similar role in a production environment • Hands-on experience supporting ML infrastructure, model serving, or MLOps workflows in production • Experience building developer platforms, internal tooling, APIs, or SDKs consumed by engineering teams at scale • Strong understanding of platform engineering principles, including developer experience, self-service infrastructure, and API-driven platform design • Proficiency with Infrastructure as Code tools, particularly Terraform • Experience with containerization and orchestration, particularly Kubernetes and Docker • Solid understanding of cloud infrastructure, preferably AWS • Strong scripting skills (bash/shell) and proficiency in at least one programming language (Python preferred) • Experience designing and operating observability, monitoring, and alerting systems • Experience implementing incident response procedures and participating in on-call rotations • Strong collaboration skills working across data, AI, and engineering teams • High ownership mindset in a fast-moving, high-stakes production environment.
• Offers Equity • Offers Bonus • Wellness allowance • Health insurance (medical, dental, vision) • 401(k)
Apply Now🕒 Yesterday
Lead DevOps Engineer at Prominent Edge working on varied technology stacks and automating infrastructure. Delivering scalable solutions and ensuring security and performance in environments.
Ansible
AWS
Azure
Chef
Cloud
EC2
Google Cloud Platform
Groovy
JavaScript
Jenkins
Kubernetes
Linux
Logstash
Puppet
Python
Go
🕒 Yesterday
Designing and deploying AI-driven software features for data center networks at Cisco. Collaborating with teams to innovate and enhance client experiences in a fast-paced environment.
🇺🇸 United States – Remote
💵 $149.1k - $218.9k / year
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
Ansible
Cloud
Switching
Terraform
🕒 Yesterday
Software Architect leading architectural direction on DevOps/AI/LLM technologies for ReliaSoft's cloud and desktop products. Collaborating with teams to enhance product capabilities and modernize systems.
🇺🇸 United States – Remote
💵 $100k - $130k / year
⏰ Full Time
🟠 Senior
🔴 Lead
⛑ DevOps & Site Reliability Engineer (SRE)
Cloud
🕒 Yesterday
DevOps Engineer at Mind Computing responsible for AWS infrastructure and automation. Implementing cloud architecture and CI/CD pipelines for project with Department of Veterans Affairs.
🇺🇸 United States – Remote
💵 $105k - $115k / year
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
AWS
Cloud
EC2
Python
Terraform
🕒 Yesterday
Associate Site Reliability Engineer maintaining service reliability and scalability through agile teamwork at Red Hat. Collaborating and resolving customer issues while contributing to code and quality assurance.
🇺🇸 United States – Remote
💵 $92.1k - $147.5k / year
💰 Corporate Round on 1999-03
⏰ Full Time
🟢 Junior
🟡 Mid-level
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
AWS
Azure
Cloud
Google Cloud Platform
Kubernetes
Linux
Microservices
Python
Go