Senior Site Reliability Engineer, Platform Engineering

2 days ago

Apply Now
Logo of Vultr

Vultr

Cloud Computing • Artificial Intelligence

Vultr is a cloud infrastructure provider offering a wide range of services including compute instances, storage, managed databases, and GPU clusters. The company focuses on providing high-performance and accessible cloud solutions, leveraging both AMD and NVIDIA technologies to power applications in artificial intelligence, high-performance computing, and general workloads. Vultr offers services that are designed to be simpler and more cost-effective than major competitors like AWS, GCP, and Azure, with global data center locations to support diverse deployment needs.

51 - 200 employees

Founded 2014

🤖 Artificial Intelligence

📋 Description

• Configuration Management: Write and maintain configuration code in Puppet. • Infrastructure-as-Code: Help define our existing infrastructure as code so it’s easy to rebuild. • Toil Automation: Take manual processes and automate them providing efficiencies in the way we manage our services. • Grafana Dashboarding: Get hands on with data producing visual representations of our systems and services. • Collaboration: Work closely with other engineering teams to align development efforts with reliability, scalability, and business objectives. • Documentation: Produce high-quality documentation for the systems and services the team is responsible for.

🎯 Requirements

• Experience writing configuration code for Puppet, Chef, or Salt. • Experience automating manual processes in Ansible or Python. • Experience troubleshooting Kubernetes deployments. • Experience with time-series databases such as Graphite, Mimir, Loki. • Experience operating Observability Pipelines using OpenTelemetry and Kafka. • Experience with Monitoring and Alerting tools such as Grafana and Icinga. • Proven problem-solving skills with the ability to address complex technical challenges. • Effective communication and collaboration abilities to work cross-functionally with teams and stakeholders. • A commitment to continuous learning and fostering a culture of technical excellence.

🏖️ Benefits

• 100% company-paid insurance premiums for employee medical, dental and vision plans. • 401(k) plan that matches 100% up to 4%, with immediate vesting • Professional Development Reimbursement of $2,500 each year • 11 Holidays + Paid Time Off Accrual + Rollover Plan • Commitment matters to Vultr! Increased PTO at 3 year and 10 year anniversary + 1 month paid sabbatical every 5 years + Anniversary Bonus each year • $500 stipend for remote office setup in first year + $400 each following year • Internet reimbursement up to $75 per month • Gym membership reimbursement up to $50 per month • Company paid Wellable subscription

Apply Now

Similar Jobs

2 days ago

Site Reliability Engineer ensuring the reliability, scalability, and performance of McKesson’s systems. Combining software engineering and systems administration for resilient infrastructure and automation.

Ansible

Azure

Cloud

Docker

Java

Kubernetes

Linux

Python

2 days ago

Senior Software Development Engineer managing reliability and performance of CVS retail and pharmacy technologies. Focusing on observability, monitoring, and DevOps practices for distributed store technologies.

AWS

Azure

Cloud

Docker

Grafana

Jenkins

Kubernetes

Microservices

OpenShift

Prometheus

Splunk

3 days ago

Site Reliability Engineer working with software engineers to maintain and enhance service reliability. Supporting government technology solutions with hands-on experience in production environments.

AWS

Azure

Cloud

Python

Ruby

SQL

3 days ago

Site Reliability Engineer ensuring reliable and high-performing services for Granicus, focusing on cloud-based solutions and systems improvement.

AWS

Azure

Cloud

Python

SQL

3 days ago

Senior DevOps Engineer responsible for CI/CD infrastructure design and optimization. Working with cloud platforms for efficient software delivery in a remote environment.

🗣️🇪🇸 Spanish Required

AWS

Cloud

Grafana

Jenkins

Kubernetes

Prometheus

Terraform

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com