Site Reliability Engineer – II

🕒 March 31

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Akamai Technologies

Akamai Technologies

5001 - 10000 employees

🔒 Cybersecurity

💰 Post-IPO Equity on 2001-07

Cloud Computing • Cybersecurity • Content Delivery

Akamai Technologies is a leading cloud services provider that specializes in delivering security, cloud computing, and content delivery solutions. It offers a range of services such as API security, DDoS protection, and performance optimization for web applications, ensuring secure and reliable user experiences. With a robust global infrastructure, Akamai empowers businesses to streamline their digital presence while safeguarding against various cyber threats and enhancing application performance.

📋 Description

• Building and maintaining dashboards, alerts, and monitoring for inference workloads using Akamai's existing observability platform • Writing automation and tooling in Python or Go to reduce operational toil and improve system reliability • Building and improving runbooks for inference-specific operational procedures, integrating into Akamai's existing incident management processes • Contributing to SLO tracking and reporting, identifying trends and areas for improvement • Supporting CI/CD pipeline maintenance, deployment safety checks, and rollback procedures • Collaborating with product engineering teams to troubleshoot complex problems across the stack • Participating in on-call rotations, responding to production incidents, and conducting blameless post-mortems

🎯 Requirements

• 2+ years of experience in Site Reliability Engineering • Bachelor's Degree or its equivalent experience • Coding ability in at least one programming language (Python or Go) • Experience with Linux systems administration and ability to troubleshoot complex infrastructure issues • Familiarity with Kubernetes and containerization concepts • Experience with monitoring and observability tools (Prometheus, Grafana, or similar) • Exposure to CI/CD pipelines and infrastructure-as-code tools (Terraform, SaltStack, or equivalent) • Willingness to learn and grow, with genuine curiosity about AI infrastructure and distributed systems

🏖️ Benefits

• Healthcare • 401K savings plan • Company holidays • Vacation (in the form of PTO) • Sick time • Family friendly benefits including parental leave • Employee assistance program including a focus on mental and financial wellness

Apply Now

Similar Jobs

🕒 March 31

Tessera Labs

11 - 50

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

Cloud Infrastructure/DevOps Engineer responsible for building multi-cloud infrastructure for AI systems. Collaborating with various teams and automating workflows for efficiency.

AWS

Azure

Cloud

Google Cloud Platform

Kubernetes

Oracle

Python

Terraform

🕒 March 31

Ivanti

1001 - 5000

🏢 Enterprise

🔐 Security

☁️ SaaS

Site Reliability Engineer managing cloud-based SaaS applications for Ivanti. Collaborating with global teams to enhance reliability and automation in a dynamic environment.

Ansible

Apache

AWS

Azure

Cloud

ElasticSearch

Java

Jenkins

Kafka

Linux

MongoDB

NGINX

Postgres

Python

Redis

Splunk

SQL

Go

.NET

🕒 March 31

Technical Lead role assisting with DevOps strategy and overseeing microservices deployment architecture for development teams. Providing hands-on support and expertise for automation and Continuous Delivery pipeline.

Cloud

Docker

J2EE

Java

Jenkins

Kubernetes

Linux

Microservices

NoSQL

Shell Scripting

Spring

Spring Boot

SpringBoot

🕒 March 31

Vimeo

1001 - 5000

📱 Media

☁️ SaaS

🏢 Enterprise

Backend Software Developer / DevOps at Duckietown, focusing on robotics education and backend development. Responsibilities include ownership of backend infrastructure and collaboration on complex software systems.

Docker

Postgres

Python

SQL

🕒 March 31

Veeva Systems

1001 - 5000

☁️ SaaS

⚕️ Healthcare Insurance

💊 Pharmaceuticals

Release Engineer at Veeva Systems, a SaaS leader, building cloud applications. Collaborating with teams to lead product releases and ensure deployment readiness.

Jenkins

Linux

SDLC

Unix