DevOps Team Lead

Job not on LinkedIn

October 30

Apply Now
Logo of Runware

Runware

Artificial Intelligence • API • Media

Runware is a flexible generative AI platform that specializes in high-quality media creation for images and videos through an affordable and fast API. It is capable of advanced tasks such as image generation, video inference, upscaling, and background removal, making it an essential tool for developers seeking to enhance their projects with AI technology. Powered by a custom Sonic Inference Engine™ and renewable energy, Runware ensures quick and efficient media generation without requiring complex infrastructure or machine learning expertise.

11 - 50 employees

Founded 2023

🤖 Artificial Intelligence

🔌 API

📱 Media

📋 Description

• - Lead the design and operation of Runware’s infrastructure and orchestration systems • - Build automation and tooling to streamline model deployments, scaling, and hardware utilisation across distributed nodes • - Drive observability, alerting, and reliability practices to detect and resolve issues quickly and proactively • - Collaborate with engineers to optimise throughput, latency, and platform performance at every layer of the stack • - Develop and maintain infrastructure as code and deployment automation to ensure consistency and reproducibility across environments • - Establish and continuously evolve incident management, post-mortems, and reliability reviews as core engineering practices • - Mentor and coach engineers to think operationally, designing systems that fail gracefully and scale predictably • - Champion forward-looking improvements to our orchestration layer, hardware management, and overall infrastructure efficiency

🎯 Requirements

• - Have experience operating production systems on bare metal or hybrid environments such as HPC or GPU clusters, optimised for performance and low latency • - Are comfortable writing automation and systems tooling in Python, Go, or similar languages • - Understand container runtimes like Docker and containerd, and have built or worked with orchestration systems beyond Kubernetes • - Are fluent in observability and debugging practices across distributed systems, using logs, metrics, traces, and profiling to drive insight and reliability • - Care deeply about reliability, efficiency, and engineering quality, and know how to embed those values into team culture and everyday practice • - Thrive in fast-moving, evolving environments where impact is measured by how much better systems and teams perform over time

🏖️ Benefits

• We’re a remote-first collective, meeting in person twice a year to plan, brainstorm, celebrate wins, and enjoy some face-to-face time. We have core hours for cooperative working and calls, but outside of that your calendar is yours. Work the hours that let you perform at your peak while also building a healthy life. • Our release cycles are fast and intense, but they’re followed by real downtime. After big pushes we expect the team to unplug, recharge, and come back ready & stronger than ever for the next leap. • - **Generous paid time off** – vacation, sick days, public holidays • - **Meaningful stock options** – share in the upside you create • - **Remote-first setup** – work from home anywhere we can employ you • - **Flexible hours** – own your schedule outside core collaboration blocks • - **Family leave** – paid maternity, paternity, and caregiver time • - **Company retreats** – twice-yearly gatherings in inspiring locations

Apply Now

Similar Jobs

October 24

Kraken

201 - 500

⚡ Energy

☁️ SaaS

🏢 Enterprise

Site Reliability Engineer ensuring performance and scalability for energy platform at Kraken. Collaborating with product teams for optimal product performance and reliability improvements.

October 24

Circle

501 - 1000

💳 Fintech

₿ Crypto

🌐 Web 3

Site Reliability Engineer at Circle designing and operating blockchain infrastructure. Collaborating with teams to enhance system reliability and performance for a fast-growing platform.

October 23

Input Output (IOHK)

201 - 500

₿ Crypto

🌐 Web 3

Site Reliability Engineer ensuring system reliability and performance for open-source blockchain projects at IOHK. Involves service operations, engineering principles, and collaborative project engagement.

🇬🇧 United Kingdom – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

October 23

Intermedia Cloud Communications

1001 - 5000

🤝 B2B

🏢 Enterprise

☁️ SaaS

DevOps Team Lead overseeing engineers managing impactful projects at a leading cloud communications provider. Fostering teamwork and technical excellence to ensure efficient CI/CD and automation processes.

🇬🇧 United Kingdom – Remote

💵 £60k - £70k / year

💰 Venture Round on 2017-02

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

October 22

Anchor Conzult

11 - 50

🎯 Recruiter

🤝 B2B

📚 Education

Senior DevOps Engineer for a technology-driven customer acquisition company. Elevating infrastructure and automation efforts by managing CI/CD and Infrastructure as Code.

🇬🇧 United Kingdom – Remote

💵 £85k - £100k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com