Senior Site Reliability Engineer, Compute Platform Services Team

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Akamai Technologies

Akamai Technologies

5001 - 10000 employees

🔒 Cybersecurity

💰 Post-IPO Equity on 2001-07

Cloud Computing • Cybersecurity • Content Delivery

Akamai Technologies is a leading cloud services provider that specializes in delivering security, cloud computing, and content delivery solutions. It offers a range of services such as API security, DDoS protection, and performance optimization for web applications, ensuring secure and reliable user experiences. With a robust global infrastructure, Akamai empowers businesses to streamline their digital presence while safeguarding against various cyber threats and enhancing application performance.

📋 Description

• Collaborating with our support, operations and engineering teams, investigate and troubleshoot complex problems. • Developing processes, plans, and infrastructure to deploy new software components and updates safely and efficiently at scale. • Participating in on-call rotations, guiding restoration and repair of service-impacting issues. • Improving our system monitoring and analysis platform to speed error detection and remediation, enhancing performance and reliability.

🎯 Requirements

• Have 7+ years of relevant experience and a Bachelors degree in Computer Science or related field • Possess expert level experience in a Systems engineering or DevOps or Software engineering role, working with large scale distributed systems. • Can troubleshoot any kind of systemic issues and develop large scale automations. • Demonstrate proficiency in Python or Golang and hands-on experience with SaltStack, Ansible, and Terraform for infrastructure automation. • Demonstrate expertise with observability or monitoring tools like Prometheus, Grafana, ELK/OpenSearch, Datadog, and Splunk. • Gain experience with any cloud platform, such as AWS, GCP, Azure, or an equivalent alternative.

🏖️ Benefits

• We support your health, well-being, finances, and life beyond work. See our benefits. • FlexBase adapts to your job's needs. • Our commitment to providing employees with an exceptional workplace experience. It’s not about telling employees where to work; it’s about supporting employees to do their best work.

Apply Now

Similar Jobs

🔥 7 hours ago

BETSOL

501 - 1000

🏢 Enterprise

☁️ SaaS

Senior Cloud Engineer at BETSOL building and operating cloud portal workloads across Azure and GCP. Focused on DevOps and DevSecOps with AI-first development practices.

Ansible

Azure

Cloud

Google Cloud Platform

Grafana

JavaScript

Jenkins

Kubernetes

Prometheus

Python

Terraform

TypeScript

Vault

🔥 8 hours ago

Cisco

10,000+ employees

🔧 Hardware

🔐 Security

🏢 Enterprise

Site Reliability Engineer focusing on automation and compliance workflows in Meraki's infrastructure. Building technical solutions to enhance security and efficiency for Cisco Meraki.

Ansible

AWS

Cloud

Docker

Grafana

Kubernetes

Linux

Python

Terraform

Go

🔥 8 hours ago

Cisco

10,000+ employees

🔧 Hardware

🔐 Security

🏢 Enterprise

Senior Site Reliability Engineer maintaining and improving Kubernetes-based platform for Meraki cloud. Collaborating with cross-functional teams to enhance reliability and compliance standards.

AWS

Azure

Cloud

Distributed Systems

Grafana

Kubernetes

Linux

Prometheus

Python

Terraform

Go

🕒 Yesterday

Cisco

10,000+ employees

🔧 Hardware

🔐 Security

🏢 Enterprise

Site Reliability Engineer responsible for Kubernetes platform support at Cisco. Collaborating with senior engineers to enhance reliability, scalability, and operational efficiency.

AWS

Azure

Cloud

Kubernetes

Linux

Python

Terraform

Go

🕒 Yesterday

Cisco

10,000+ employees

🔧 Hardware

🔐 Security

🏢 Enterprise

Site Reliability Engineer designing and operating production-grade Kubernetes platforms at Cisco Meraki. Collaborating with teams to improve system reliability and ensure developer-friendly environments.

Cloud

Grafana

Jenkins

Kubernetes

Linux

Prometheus

Python

Terraform

Go