Senior Cloud Engineer – Azure/OpenShift

Job not on LinkedIn

🕒 May 13

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Thinkahead Consultant Psychologist Pty Ltd

Thinkahead Consultant Psychologist Pty Ltd

1 - 10 employees

Thinkahead is a privately owned psychology firm working across both the clinical field of private practice as well as corporate consulting space.

📋 Description

• Lead the support and operation of cloud infrastructure, platform services, identity, networking, security controls, and operational tooling across customer environments. • Able to architect and lead deployment of moderately complex solutions related to cloud solutions. • Understands performance, scaling and functional characteristics of software technologies. • Ability to understand open-source and cloud use-cases and recommend standard design patterns commonly used in such solutions (best practices). • Own complex incidents, escalations, and problem investigations; perform advanced troubleshooting, coordination, service restoration, and follow-through durable resolution. • Plan and execute complex changes and recurring operational activities including provisioning, access changes, maintenance events, backup and recovery validation, patching coordination, and platform hygiene. • Serve as a senior escalation point within the on-call rotation for major incidents, high-impact issues, and customer-approved after-hours change activity. • Follow and reinforce established ITSM processes for incident, request, change, problem, escalation, documentation, and customer-facing status communication. • Develop and maintain runbooks, SOPs, standards, knowledge articles, and technical documentation that improve consistency and service quality. • Mentor other Cloud Engineers, review work for quality and completeness, and provide technical guidance on operational best practices. • Drive monitoring, alerting, logging, tagging, policy, compliance, and cost-visibility improvements that strengthen managed cloud operations. • Use scripting, automation, and AI to reduce repetitive effort, improve consistency, and scale service delivery. • General familiarity with DevOps/SRE tooling is required but is not the primary emphasis of the role. • Participate in customer meetings, service reviews, and advisory discussions; translate technical issues, risk, and improvement opportunities into clear business-facing communication. • Operate and support Red Hat OpenShift (Kubernetes) clusters in production, including cluster health, upgrades, scaling, and lifecycle management. • Manage OpenShift access and security controls, including RBAC, SCCs, NetworkPolicies, secrets management, and certificate/ingress considerations. • Troubleshoot platform and workload issues across Kubernetes/OpenShift constructs (nodes, operators, routes/ingress, services, deployments, pods, persistent volumes) and coordinate remediation with application, network, and security teams. • Implement and validate platform backup, restore, and disaster recovery procedures (e.g., etcd, cluster resources, and persistent data) in accordance with customer requirements. • Support platform automation and standardization efforts using infrastructure as code and GitOps practices (e.g., Terraform, Ansible, Helm, Argo CD) to improve repeatability and reduce operational risk. • Define and improve observability for cloud and OpenShift platforms (metrics, logs, traces), tune alerting to reduce noise, and contribute to availability, performance, and capacity planning. • Other job duties as assigned.

🎯 Requirements

• 5+ years in customer-facing IT infrastructure, cloud operations, systems administration, or managed services support, including work in production environments. • Strong operational expertise in at least one major cloud platform, with the ability to lead complex support and administration activities in Azure. • Experience with other clouds such as GCP, AWS, and OCI is a strong preference. • Minimum 3+ years of experience supporting a production OpenShift environment (on-premises, ROSA, ARO, etc.). • Experience leading complex incidents, escalations, change execution, and problem investigations in production environments. • Experience with Windows and/or Linux server operations, networking fundamentals, identity and access management, monitoring, governance, and operational documentation. • Experience in a managed services, consulting, or multi-customer support environment, ideally supporting complex enterprise customers (preferred). • Strong working knowledge of PowerShell, Python, Bash, infrastructure as code, automation, CI/CD, or related platform tooling used to improve cloud operations (preferred). • Relevant advanced cloud, operations, or platform certifications are a plus (preferred).

🏖️ Benefits

• Medical, Dental, and Vision Insurance • 401(k) • Paid company holidays • Paid time off • Paid parental and caregiver leave • Plus more! See benefits https://www.aheadbenefits.com/ for additional details.

Apply Now

Similar Jobs

🕒 May 13

3Cloud

501 - 1000

☁️ SaaS

🤖 Artificial Intelligence

🏢 Enterprise

Cloud Architect providing technical delivery leadership on Azure projects at 3Cloud. Collaborating with clients and guiding teams in infrastructure delivery and architectural design.

AWS

Azure

Cloud

DNS

Google Cloud Platform

Terraform

VMware

🕒 May 13

Leidos

10,000+ employees

🔒 Cybersecurity

🔬 Science

AWS Administrator managing AWS cloud infrastructure and workload automation for healthcare contract. Responsible for operations, security, and performance with a focus on automation and scheduling.

AWS

Cloud

EC2

🕒 May 13

Alpha Omega

501 - 1000

🏛️ Government

🔒 Cybersecurity

🤖 Artificial Intelligence

Senior Cloud Architect at Alpha Omega modernizing federal customer systems with cloud frameworks. Responsible for leading and implementing secure, high-quality cloud solutions.

AWS

Cloud

DNS

EC2

Microservices

SDLC

🕒 May 12

Accenture Federal Services

10,000+ employees

🤖 Artificial Intelligence

🔒 Cybersecurity

🏛️ Government

Cloud Platform Engineer designing, developing, and maintaining SaaS software solutions at Accenture Federal Services. Working with Microsoft 365, Entra ID, automation solutions, and cross-disciplinary teams.

🕒 May 12

1Password

501 - 1000

🔒 Cybersecurity

☁️ SaaS

⚡ Productivity

Senior Cloud Platform Engineer at 1Password focused on building and maintaining AWS infrastructure. Collaborate in a fast-paced environment while implementing automation and observability solutions.

AWS

Distributed Systems

EC2

Grafana

Kubernetes

Microservices

Prometheus

Python

Terraform

Go