Site Reliability Engineering Manager

Job not on LinkedIn

September 18

Apply Now
Logo of SS&C Technologies

SS&C Technologies

Banking • Healthcare • Fintech

SS&C Technologies is a global leader in financial services and healthcare technology, recognized as the world’s largest independent hedge fund and private equity administrator, as well as the largest mutual fund transfer agency. The company offers a comprehensive range of solutions including asset management, banking, healthcare, insurance, and wealth management, leveraging proprietary software and extensive expertise to meet the operational needs of clients across multiple industries.

10,000+ employees

Founded 1986

🏦 Banking

💳 Fintech

📋 Description

• Lead, mentor, and grow a high-performing team of SREs • Foster a culture of ownership, continuous learning, and operational excellence • Drive career development and performance management for team members • Own the availability, latency, performance, and capacity of critical services • Define and monitor SLAs, SLOs, and SLIs to ensure service reliability • Lead incident response and postmortem processes, driving root cause analysis and long-term fixes • Champion automation to reduce toil and improve system reliability • Oversee the development and maintenance of internal observability, tools and platforms • Collaborate with engineering and DevOps teams to embed reliability into the software development lifecycle • Partner with product, engineering, DevOps and Customer Support teams to align on priorities and roadmaps • Contribute to the strategic direction of infrastructure and reliability initiatives • Advocate for best practices in observability, CI/CD, and infrastructure as code

🎯 Requirements

• Proven experience managing or leading SRE, DevOps, or infrastructure teams • Strong background in systems engineering, cloud platforms (AWS, Azure), and container orchestration (Kubernetes) • Proficiency in monitoring, alerting, and incident management tools (Prometheus, Grafana, PagerDuty) • Solid understanding of networking, distributed systems, and performance tuning • Excellent communication, leadership, and stakeholder management skills • Preferred: Experience in a high-scale, high-availability SaaS environment • Preferred: Familiarity with security and compliance in cloud-native environments

🏖️ Benefits

• Professional Development Reimbursement, including access to SS&C University • Competitive holiday scheme • Competitive benefits designed to support the well-being of our staff • Diversity & Inclusion initiatives • Hands-On, Team-Customised Training throughout your career

Apply Now

Similar Jobs

September 13

Anaplan

1001 - 5000

☁️ SaaS

🏢 Enterprise

💸 Finance

Senior SRE supporting Anaplan's AI-infused scenario planning platform. Improve reliability, automation, and production performance across global systems.

September 12

LineTen

51 - 200

🛍️ eCommerce

☁️ SaaS

🚗 Transport

Site Reliability Engineer improving developer tooling, observability, and container workflows at LineTen, a data-driven last-mile delivery startup.

September 12

Anaplan

1001 - 5000

☁️ SaaS

🏢 Enterprise

💸 Finance

Senior SRE at Anaplan improving reliability and scalability of its AI-infused scenario planning platform. Responsible for production support, automation, observability, and mentoring SREs.

September 12

Anaplan

1001 - 5000

☁️ SaaS

🏢 Enterprise

💸 Finance

Senior Site Reliability Engineer at Anaplan improving production platform reliability, scalability, automation and observability. Support on-call, mentor SREs, and collaborate with engineering teams.

September 6

Amber Labs

51 - 200

🤖 Artificial Intelligence

🤝 B2B

🏢 Enterprise

DevOps Engineer managing AWS and Kafka infrastructure remotely for London-based team.

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com