Senior Site Reliability Engineer, SRE

October 22

Apply Now
Logo of Talent 360 ME

Talent 360 ME

HR Tech • B2B • Enterprise

Talent 360 ME is your dedicated HR management partner, specializing in providing outsourced HR services to startups and SMEs. Their team of specialized HR consultants helps businesses focus on growth by managing recruitment, onboarding, payroll, performance, and compliance, while ensuring access to advanced HR technologies and resources. Talent 360 aims to streamline HR processes, mitigate compliance risks, and enhance overall productivity within organizations, enabling founders and CEOs to concentrate on core business objectives.

51 - 200 employees

Founded 2018

👥 HR Tech

🤝 B2B

🏢 Enterprise

📋 Description

• Provide scalable, reliable, durable, and secure global database services for our clients’ cloud infrastructure hosted on AWS or GCP • Identify significant projects that improve reliability, cost savings, and/or revenue • Identify changes in product architecture with a data-driven approach • Influence the product roadmap for improved resiliency and reliability • Proactively work on efficiency and capacity planning • Identify parts of the system that do not scale and drive long-term resolution • Identify Service Level Indicators (SLIs) • Lead initiatives and problem definition, design, and planning • Perform and run blameless RCAs on incidents and outages • Maintain awareness and actively influence stage group plans

🎯 Requirements

• 5+ years of related experience • Performs application-specific production support, incident management, problem management, RCAs, and service restoration as needed • Collaborating with engineering and development teams to evaluate and identify optimal cloud solutions • Plan and achieve high availability, performance, and availability of the product service • Development/coding experience and skills for writing custom automation solutions • Strong understanding of web hosting infrastructure and high availability architecture • Demonstrated knowledge of fundamental cloud security (e.g., Identity and Access Management, ACL, firewalls) • Deep understanding of AWS cloud services and how to leverage them • Strong Experience in Infrastructure as Code (IaC) technologies like Terraform • Familiarity with Kubernetes-specific platform components

🏖️ Benefits

• Professional development • Flexible work arrangements

Apply Now

Similar Jobs

October 21

Senior Site Reliability Engineer managing multi-region cloud infrastructure for a B2B Fin-Tech company. Focused on reliability, performance, and scalability through automation and incident management.

Cloud

DNS

Kubernetes

Microservices

MS SQL Server

Oracle

Python

SQL

Terraform

.NET

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com