Customer Reliability Engineer – Infrastructure

🔥 1 hour ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Astronomer

Astronomer

201 - 500 employees

Founded 2018

☁️ SaaS

🤖 Artificial Intelligence

💰 $213M Series C on 2022-03

Data Fabric • SaaS • Artificial Intelligence

Astronomer is a company focused on enhancing data workflows with its platform, Astro, which is built on Apache Airflow®. It offers a fully-managed platform to streamline and elevate data orchestration, from AI-driven models to data observability and ETL processes. Astronomer provides solutions for scalable, secure, and reliable data management, catering primarily to industries such as financial services, gaming, healthcare, manufacturing, retail, and eCommerce. With extensive integrations and a focus on governance and infrastructure management, Astronomer helps organizations accelerate productivity and optimize business decisions.

📋 Description

• Provide solutions to customers to make them successful using our products. • Troubleshoot customer environments and engage in active triaging with customers • Participate in on-call rotation for weekend coverage • Provide feedback to the product development teams on customer needs and pain points. • Build out our monitoring and alerting systems. • Build and maintain automation to ensure daily operational tasks are handled as efficiently as possible. • Help direct the architecture of the products and contribute where possible. • Own the customer experience, working directly with customers to prioritize and solve issues, meet SLAs, and provide “white glove” guidance on the path to production. • Participate remotely within a fully distributed team. • Enhance and enrich customer documentation • Work with the latest technology and multi-cloud implementations

🎯 Requirements

• 5 years of experience, preferably with large, complex cloud infrastructures operating at scale • 3 years of experience with Kubernetes • Experience managing a Production distributed system with at least one major cloud provider (one or all: AWS, GCP, Azure) • Strong Linux experience • Knowledge of how to operate and monitor issues for distributed systems • Previous experience in handling customers issues (internal or external) • Strong communication skills • DevOps or CI/CD experience • Python scripting • Good troubleshooting Skills

🏖️ Benefits

• equity component • comprehensive benefits package

Apply Now

Similar Jobs

🔥 1 hour ago

ALTEN Technology USA

501 - 1000

🚀 Aerospace

⚡ Energy

Senior Electronic Component Reliability Engineer providing reliability support across Electronic Control Units for clients in aerospace, medical, automotive, and more.

🔥 3 hours ago

Jitsu

51 - 200

🚗 Transport

🔌 API

🤝 B2B

Senior DevOps/Platform Engineer evolving infrastructure for Jitsu’s logistics platform, improving automation, configuration, and security for a global engineering team.

🇺🇸 United States – Remote

💵 $120k - $147k / year

💰 $7.8M Venture Round - Jitsu on 2022-10

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🔥 5 hours ago

Cryoport Systems

1001 - 5000

🧬 Biotechnology

💊 Pharmaceuticals

🚗 Transport

Senior DevOps Engineer at Cryoport Systems designing, implementing, and maintaining sustainable cloud infrastructure. Collaborating with teams to ensure scalability, reliability, and compliance while optimizing workflows.

🔥 5 hours ago

MSD

10,000+ employees

🧬 Biotechnology

💊 Pharmaceuticals

⚕️ Healthcare Insurance

Senior Reliability Engineer leading the evolution of reliability practices for critical digital solutions at a global health care leader. Collaborating with engineering teams for system reliability and resilience.

🔥 5 hours ago

Knock

1 - 10

🔌 API

☁️ SaaS

🏢 Enterprise

DevOps Engineer at Knock, responsible for platform scalability and reliability. Building, scaling, and maintaining core services as a remote-first team.