Senior Site Reliability Engineer

🕒 March 31

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Akamai Technologies

Akamai Technologies

5001 - 10000 employees

🔒 Cybersecurity

💰 Post-IPO Equity on 2001-07

Cloud Computing ‱ Cybersecurity ‱ Content Delivery

Akamai Technologies is a leading cloud services provider that specializes in delivering security, cloud computing, and content delivery solutions. It offers a range of services such as API security, DDoS protection, and performance optimization for web applications, ensuring secure and reliable user experiences. With a robust global infrastructure, Akamai empowers businesses to streamline their digital presence while safeguarding against various cyber threats and enhancing application performance.

📋 Description

‱ Owning reliability workstreams for Akamai's serverless inference platform ‱ Building automation and tooling, contributing to architecture and operational decisions ‱ Managing observability for AI workloads, including telemetry, dashboards, alerts, SLO/SLI tracking ‱ Writing automation and tooling to reduce operational toil and improve incident response ‱ Integrating AI workloads into incident management processes ‱ Collaborating with product engineering teams to improve reliability and ensure operational readiness for product releases ‱ Contributing to capacity planning, autoscaling configuration, and workload scheduling for AI compute infrastructure

🎯 Requirements

‱ 5+ years of experience in SRE, infrastructure engineering, or platform engineering, working with large-scale distributed systems ‱ Extensive experience with Kubernetes and containerization at scale ‱ Experience defining SLOs and working with observability tools such as Prometheus, Grafana, and distributed tracing ‱ Coding ability in Python or Go for automation and tooling, with experience in CI/CD pipelines, deployment safety, and infrastructure-as-code ‱ Interest in or experience with AI/ML infrastructure, model serving, or GPU workloads ‱ Ability to take ownership of problems and drive them to resolution independently

đŸ–ïž Benefits

‱ Health insurance ‱ 401K savings plan ‱ Company holidays ‱ Vacation (in the form of PTO) ‱ Sick time ‱ Family friendly benefits including parental leave ‱ Employee assistance program including a focus on mental and financial wellness

Apply Now

Similar Jobs

🕒 March 31

Tessera Labs

11 - 50

đŸ€– Artificial Intelligence

🏱 Enterprise

☁ SaaS

Cloud Infrastructure/DevOps Engineer responsible for building multi-cloud infrastructure for AI systems. Collaborating with various teams and automating workflows for efficiency.

AWS

Azure

Cloud

Google Cloud Platform

Kubernetes

Oracle

Python

Terraform

🕒 March 31

Ivanti

1001 - 5000

🏱 Enterprise

🔐 Security

☁ SaaS

Site Reliability Engineer managing cloud-based SaaS applications for Ivanti. Collaborating with global teams to enhance reliability and automation in a dynamic environment.

Ansible

Apache

AWS

Azure

Cloud

ElasticSearch

Java

Jenkins

Kafka

Linux

MongoDB

NGINX

Postgres

Python

Redis

Splunk

SQL

Go

.NET

🕒 March 31

Technical Lead role assisting with DevOps strategy and overseeing microservices deployment architecture for development teams. Providing hands-on support and expertise for automation and Continuous Delivery pipeline.

Cloud

Docker

J2EE

Java

Jenkins

Kubernetes

Linux

Microservices

NoSQL

Shell Scripting

Spring

Spring Boot

SpringBoot

🕒 March 31

Moody's

10,000+ employees

🏩 Banking

💾 Finance

💳 Fintech

Senior DevOps & MLOps Engineer handling deployment and production of machine learning models at Moody’s. Collaborating in a fast-paced environment with a focus on security and infrastructure.

Ansible

AWS

Azure

Cloud

Distributed Systems

Docker

ElasticSearch

Kubernetes

Linux

MySQL

Postgres

Python

Shell Scripting

Terraform

🕒 March 31

Entefy

11 - 50

đŸ€– Artificial Intelligence

☁ SaaS

🏱 Enterprise

DevOps Engineer at Entefy creating fast data syncing experiences. Role focuses on security and technology to manage high performing servers.

SQL