Senior Site Reliability Engineer, Observability

🕒 December 25, 2025

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Chainlink Labs

Chainlink Labs

201 - 500 employees

Founded 2017

💸 Finance

💳 Fintech

🌐 Web 3

Finance • Fintech • Web 3

Chainlink Labs is a leading player in the field of decentralized finance (DeFi) and blockchain technology. The company is pioneering the use of decentralized systems to facilitate onchain transactions for financial institutions and marketplaces. By collaborating with financial market infrastructures, asset managers, and top DeFi protocols, Chainlink Labs is driving the transition to a tokenized asset economy and aims to become the global standard for onchain finance. With expertise in cryptography and a robust track record in security, Chainlink Labs provides a platform that powers a global system of onchain finance.

📋 Description

• Build and orchestrate Modern OTEL-based Observability Platform • Support multiple telemetry types, like metrics, logs and traces. • Define and support modern governance in observability and problems at scale. • Ensure reliability, security, and performance exceed our defined SLAs • Work with engineers from across the company to help troubleshoot issues, deploy new products and services, and increase velocity while decreasing cognitive load • Lead the design and deployment of monitoring/observability services to detect and alert the team of needed action. • Ingest, aggregate, transform, and utilize data from a multitude of sources in our real time data pipeline. • Oversee the availability, performance, and supportability of our observability infrastructure. • Create processes around alert response operations and support the team to ensure the reliable delivery of oracle data. • Make recommendations to ensure sufficient metrics are collected to create alerts with every new feature release. • Champion reliability and security by taking the time to do your work right the first time

🎯 Requirements

• 7+ years of relevant professional experience. You probably have worked on a devops, infrastructure, SRE, and/or platform team before • Ability to develop software outside of the scope of typical infrastructure requirements and configurations • Experience programming in C, C++, Java, Python, Go, Perl, or Ruby • Expert knowledge in all aspects of designing, developing, and managing large real-time systems • Experience with monitoring and logging. You know how to export metrics using Prometheus, have built a Grafana dashboard or two, and have experience with a centralized logging solution like an ELK Stack, Splunk or Grafana Stack. • Experience with distributed systems and container orchestration. You have maintained or even built Kubernetes clusters before and feel comfortable deploying completely new services on them • Strong communication skills. You can give and receive constructive feedback, and you do not shy away from planning meetings and code reviews

🏖️ Benefits

• Health insurance • 401(k) matching • Flexible work hours • Paid time off • Remote work options

Apply Now

Similar Jobs

🕒 December 24, 2025

Upshop

51 - 200

☁️ SaaS

🛒 Retail

🛍️ eCommerce

SRE / DevOps Manager at Upshop leading reliability and operations engineering team. Responsible for scalability, security, and performance of infrastructure.

🕒 December 17, 2025

Atria Institute

51 - 200

⚕️ Healthcare Insurance

🔬 Science

🤝 Non-profit

Technical Lead driving engineering excellence in DevOps at Atria healthcare. Leading initiatives in cloud infrastructure, CI/CD, and observability with a focus on healthcare technology.

🕒 December 16, 2025

Cyera

201 - 500

🔒 Cybersecurity

🏢 Enterprise

DevOps Engineer at Cyera designing and optimizing data security infrastructure. Collaborating with cross-functional teams to ensure secure, scalable, and automated environments.

🕒 December 15, 2025

Stand Together

5001 - 10000

🤲 Charity

📚 Education

🌍 Social Impact

Senior DevSecOps Engineer at Stand Together designing and securing cloud infrastructure solutions for a philanthropic organization. Collaborating with teams to enhance security in software delivery pipelines.

🕒 December 15, 2025

Mida Technologies

11 - 50

💳 Fintech

🤝 B2B

🤖 Artificial Intelligence

Site Reliability Engineer ensuring the availability and performance of cloud infrastructure at Mida Technologies. Involved in automation, monitoring, and incident response for production systems.