Site Reliability Engineer

Job not on LinkedIn

November 26

Apply Now
Logo of ScalePad

ScalePad

SaaS • Compliance • Security

ScalePad is a company that provides a comprehensive platform for Managed Service Providers (MSPs) to enhance client engagement and operational efficiency. With products like Lifecycle Manager, Backup Radar, and Cognition360, ScalePad offers solutions that streamline compliance, backup monitoring, and client communications. Their platform integrates seamlessly with third-party tools to offer a cohesive ecosystem that empowers MSPs to deliver superior client experiences and strategic insights. ScalePad is committed to innovation and excellence, helping MSPs transform and scale their offerings with automation and data-driven insights.

201 - 500 employees

☁️ SaaS

📋 Compliance

🔐 Security

💰 Private Equity Round on 2021-07

📋 Description

• Ensure the reliability, scalability, and efficiency of our infrastructure and development platforms • Support developer experience, automate operational tasks, and optimize system performance • Monitor and optimize system performance using observability tools like Prometheus and Grafana • Participate in the 24/7 on-call rotation, responding to and resolving system outages • Document incident responses and contribute to post-mortem analysis to improve system resilience

🎯 Requirements

• Strong proficiency in system operations, observability, and infrastructure monitoring • Full understanding of AWS offerings, including core compute, networking, storage, IAM • Experience with Infrastructure as Code (IaC) tools such as Terraform • Proficiency in scripting and automation using Python, Bash, or equivalent languages • Base knowledge of Java, Go, and Python is a strong plus • Knowledge of CI/CD pipelines and best practices for continuous integration and delivery • Experience with containerization and orchestration technologies such as Kubernetes and Docker • Familiarity with Agile methodologies and DevOps culture

🏖️ Benefits

• 100% medical and dental coverage fully employer-paid • RRSP matching after one year of employment • Monthly stipend to help offset the costs of the hybrid experience • Annual budget for professional development • Unlimited flex-time policy

Apply Now

Similar Jobs

November 25

DevOps Engineer designing, implementing, and maintaining CI/CD systems for a broadband service provider. Collaborating closely with teams to enhance deployment pipelines and improve infrastructure delivery.

Ansible

AWS

Cloud

Google Cloud Platform

Groovy

Jenkins

Kubernetes

Linux

OpenShift

Python

Terraform

November 22

FedRAMP Site Reliability Engineer ensuring federal compliance and high operational standards for Confluent Cloud systems. Collaborating with federal agencies to innovate and maintain systems for real-time data processing.

Cloud

Distributed Systems

Grafana

Java

Kubernetes

Prometheus

Python

Terraform

Go

November 21

Senior Site Reliability Engineer responsible for maintaining production systems' reliability at Zensurance. Collaborate with engineering teams, automate tasks and improve incident management processes.

AWS

Cloud

Distributed Systems

Grafana

Kubernetes

Prometheus

Splunk

Terraform

November 20

Akinox

51 - 200

Team Lead DevOps guiding and evolving the Infrastructure team for a health tech company. Improving deployment automation and collaborating across teams.

🗣️🇫🇷 French Required

Azure

Cloud

Kubernetes

November 17

Deployment Engineer at Versaterm collaborating with law enforcement for technical data interfaces. Ensuring smooth deployment processes for public safety technology with strong customer focus.

Cloud

ETL

SOAP

SQL

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com