Senior Cloud Operations Engineer

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Progress

Progress

1001 - 5000 employees

Founded 1981

🤖 Artificial Intelligence

💰 Post-IPO Equity on 1995-01

Software • Artificial Intelligence • Infrastructure Management

Progress is a software company that offers a wide range of products and services aimed at improving digital experiences, data management, and infrastructure management. The company provides AI-powered applications across its platforms, enabling businesses to develop and manage these applications efficiently. Progress offers solutions such as the Progress Data Platform for data, AI, and analytics projects, and Sitefinity for digital content and experience management. Additionally, Progress provides infrastructure management products to help streamline network and application management, as well as data connectivity tools under the DataDirect brand. The acquisition of ShareFile has expanded Progress's capabilities in secure file transfer and collaboration. With a strong focus on artificial intelligence, Progress works to deliver innovative solutions for its customers and partners across various industries.

📋 Description

• Own and operate the cloud infrastructure that powers Progress Agentic RAG across multiple cloud providers and regions. • Design and run production-grade, multi-cloud platforms using Infrastructure as Code and GitOps principles. • Lead our GitOps-driven infrastructure workflows, ensuring reliable, secure, and auditable changes. • Operate and scale Kubernetes environments globally, supporting highly available, secure, and scalable workloads. • Enable platform and infrastructure delivery through modern CI/CD and automation practices. • Design and maintain secure, zero-trust networking and identity models across cloud and on‑prem environments. • Build and evolve monitoring, incident response, and operational readiness for a 24/7 production platform. • Collaborate with engineering, security, and product teams to continuously improve reliability and developer experience. • Mentor engineers and help define infrastructure standards, documentation, and best practices.

🎯 Requirements

• Strong experience designing and operating cloud or platform infrastructure at scale, including high availability, security, and disaster recovery. • Deep hands-on expertise with Terraform and Infrastructure as Code. • Proven experience running Kubernetes in production (GKE, EKS, AKS), including scaling, security, and observability. • Experience with GitOps-based workflows and tools such as ArgoCD or similar. • Solid hands-on experience with AWS and/or GCP and their core networking, compute, storage, and identity services. • Experience building and maintaining CI/CD pipelines, ideally with GitHub Actions. • Good understanding of cloud networking and identity, including zero-trust concepts and workload identity. • Ability to automate operational tasks using Python/Go. • Proven experience architecting on-premises infrastructure and managing Kubernetes on bare metal. • Hands-on expertise in hybrid release management and artifact distribution for restricted environments. • Strong communication and collaboration skills in cross-functional, distributed teams. • Comfortable owning complex systems end to end and making decisions in production environments. • A proactive, automation-first mindset with a focus on operational excellence. • Willingness to mentor others and contribute to shared standards and documentation. • Comfortable working in a remote-first, global environment.

🏖️ Benefits

• Generous remuneration package • Employee Stock Purchase Plan Enrollment • Vacation, Family, and Health 23 vacation days annually • Birthday day off • Community service time off • International Women's Day - March 8 is an official holiday for all employees • Life and Medical Insurance

Apply Now

Similar Jobs

🕒 3 days ago

Airalo

51 - 200

📡 Telecommunications

Senior Site Reliability Engineer building and maintaining reliable systems for Airalo's eSIM platform. Collaborating with software engineers and leading SRE principles for innovative solutions.

AWS

Java

Kubernetes

Prometheus

Python

SDLC

Terraform

Go

🕒 5 days ago

Devoteam

5001 - 10000

🤖 Artificial Intelligence

🔒 Cybersecurity

Cloud Engineer (AWS) focusing on DevOps for a European consulting firm with remote work from Spain. Responsible for maintaining real-time data flows and cloud application connectivity.

🗣️🇪🇸 Spanish Required

Apache

AWS

Grafana

Kafka

🕒 June 19

Devoteam

5001 - 10000

🤖 Artificial Intelligence

🔒 Cybersecurity

Google Workspace Deployment Engineer at Devoteam planning and executing Google Workspace deployments for clients. Responsible for management and user adoption strategies in cloud environments.

Cloud

🕒 June 19

QAD

1001 - 5000

🏢 Enterprise

☁️ SaaS

Senior Site Reliability Engineer at Redzone, ensuring reliability and performance of mission-critical services. Evolving SRE practices while driving automation and operational excellence within the team.

Distributed Systems

🕒 June 18

Tempo Software

201 - 500

☁️ SaaS

🏢 Enterprise

⚡ Productivity

Site Reliability Engineer at Tempo working on infrastructure to support various global engineering products. Collaborating with teams and ensuring high availability and performance standards.

Ansible

AWS

Cloud

Docker

Java

Kotlin

Kubernetes

Linux

Terraform