Site Reliability Engineer, Core Streaming

🕒 April 12

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Yelp

Yelp

1001 - 5000 employees

Founded 2004

Yelp is a platform that connects consumers with local businesses, allowing users to discover and review a wide variety of services including restaurants, home services, and automotive services. It aims to help consumers find trusted recommendations for goods, services, and experiences in their local area, while offering business owners tools to manage customer interactions and promote their offerings.

📋 Description

• Design, deploy, and maintain large-scale Kafka event streaming infrastructure across hybrid and multi-cloud environments. • Collaborate with engineers to enable new features, ensure data pipeline reliability, and advise on best practices for real-time data processing. • Execute and automate Kafka cluster upgrades, migrations, and major version rollouts with minimal impact to critical services. • Build or enhance self-service capabilities and automation for cluster operations, scaling, and incident recovery. • Troubleshoot complex issues affecting data flow, performance, or stability, and drive root cause analyses. • Participate in on-call rotations.

🎯 Requirements

• Strong hands-on experience designing and implementing large-scale Kafka event streaming capabilities in production, across hybrid or multi-cloud and Linux environments, including upgrades and migrations between platforms or versions. • In-depth knowledge of event streaming/data-in-motion design principles, architecture, and operational nuances. • Programming proficiency in Java, Python, or similar modern languages for tooling, integration, and automation. • Familiarity with Kafka Client APIs (Producer, Consumer, Streams), as well as sizing and capacity planning for high-throughput clusters. • Experience designing and optimizing real-time data streaming solutions with technologies like Apache Flink. • Knowledge of automating infrastructure and operational tasks (configuration management, IaC, scripting, or related). • Problem-solving mindset with an eagerness to learn, take initiative, and advocate for infrastructure best practices in a fast-paced environment. • A Bachelor’s Degree or an equivalent work experience is required.

🏖️ Benefits

• There may be flexibility with the range included in this posting should a candidate be leveled higher or lower than the posted range. • This opportunity has the option to be fully remote in all locations across the US.

Apply Now

Similar Jobs

🕒 April 10

Postscript

201 - 500

🤝 B2B

🏢 Enterprise

Senior DevOps Engineer managing and optimizing AWS infrastructure for ecommerce marketing platform. Collaborating with engineering teams and streamlining software delivery processes.

AWS

Cloud

Python

Terraform

🕒 April 10

Weidenhammer

51 - 200

🤖 Artificial Intelligence

☁️ SaaS

Senior Azure DevSecOps Engineer at Weidenhammer responsible for designing secure Azure cloud solutions and CI/CD pipelines. Collaborate across teams to integrate security and foster cloud migrations.

Ansible

Azure

Chef

Cloud

Gradle

Jenkins

Linux

Maven

Puppet

Python

Terraform

🕒 April 10

eSimplicity

51 - 200

⚕️ Healthcare Insurance

📡 Telecommunications

🤖 Artificial Intelligence

Senior DevOps Engineer managing CI/CD pipelines and cloud infrastructure security at eSimplicity. Collaborating on healthcare solutions for CMS in a remote capacity.

Amazon Redshift

AWS

Cloud

Django

Docker

Java

Python

Terraform

🕒 April 10

MLabs

51 - 200

Senior Site Reliability Engineer architecting Azure infrastructure for high-growth distributed systems platform, ensuring reliability and scalability across environments.

Azure

Distributed Systems

Python

Terraform

Go

🕒 April 10

NEC Software Solutions

5001 - 10000

🏢 Enterprise

🏛️ Government

Senior DevOps Engineer managing cloud infrastructure and DevOps solutions for public service systems in a hybrid setup. Role requires AWS expertise and contributions to major national programmes.

AWS

Azure

Cloud

Kubernetes

Python

Terraform