Database Reliability Engineer

🔥 8 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of CrowdStrike

CrowdStrike

5001 - 10000 employees

Founded 2011

🔒 Cybersecurity

☁️ SaaS

🤖 Artificial Intelligence

Cybersecurity • SaaS • Artificial Intelligence

CrowdStrike is a cybersecurity company that provides cloud-based security services to stop breaches. It is recognized as a leader in endpoint protection, identity and cloud security, and managed detection and response. CrowdStrike's platform, Falcon, integrates artificial intelligence to offer real-time visibility, detection, and protection against sophisticated cyber threats. The company is lauded for its effectiveness in securing networks and data, making it a trusted partner for businesses worldwide.

📋 Description

• Maintain a deep understanding of the data components - including Cassandra, ElasticSearch/OpenSearch, Kafka, Zookeeper, MySQL and PostgreSQL, and use that understanding to operate and automate properly configured clusters. • Operate and manage databases in Kubernetes (K8s) environments, including cluster orchestration, container image management, and workload deployment. • Design and implement Kubernetes-native solutions for data services, including StatefulSets, persistent volumes, and resource management for stateful applications. • Develop infrastructure services to support the CrowdStrike engineering team's pursuit of a full devops model. • Work closely with Engineering and Customer Support to troubleshoot time-sensitive production issues, regardless of when they happen. • Keep petabytes of critical business data safe, secure, and available.

🎯 Requirements

• Configuration management (Chef) • Scripting in Python and bash • Experience with large scale datastores using technologies like Cassandra, ElasticSearch, Kafka, Zookeeper, and MySQL. • Hands-on experience operating and managing databases in Kubernetes environments, including container orchestration, StatefulSets, persistent storage, and resource optimization. • Proficiency with Kubernetes tools and concepts: kubectl, Helm, YAML manifests, networking, storage classes, and cluster administration. • Experience with large-scale, business-critical Linux environments • Experience operating within the cloud, preferably Amazon Web Services, GCP and OCI • Proven ability to work effectively with both local and remote teams • Track record of making great decisions, particularly when it matters most • Excellent communication skills, both verbal and written • A combination of confidence and independence with the prudence to know when to ask for help from the rest of the team • Bachelor's degree in an applicable field, such as CS, CIS or Engineering

🏖️ Benefits

• Market leader in compensation and equity awards • Comprehensive physical and mental wellness programs • Competitive vacation and holidays for recharge • Paid parental and adoption leaves • Professional development opportunities for all employees regardless of level or role • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections • Vibrant office culture with world class amenities • Great Place to Work Certified™ across the globe

Apply Now

Similar Jobs

🕒 6 days ago

Omilia - Conversational Intelligence

201 - 500

🤖 Artificial Intelligence

🛍️ eCommerce

Senior Site Reliability Engineer maintaining production clusters and developing observability solutions. Collaborate with teams to ensure platform reliability and performance using automation and monitoring tools.

Ansible

AWS

Cloud

Docker

Grafana

Kubernetes

Linux

MySQL

NoSQL

Postgres

Prometheus

Python

RDBMS

Redis

TCP/IP

Terraform

VoIP

Go

🕒 June 1

Red Hat

10,000+ employees

🏢 Enterprise

Customer Site Reliability Engineer managing critical services and driving reliability and customer satisfaction at Red Hat. Engaging with cross-functional teams and enhancing system resilience.

🗣️🇯🇵 Japanese Required

Ansible

AWS

Azure

Cloud

Distributed Systems

Google Cloud Platform

Kubernetes

Linux

OpenShift

Prometheus

TCP/IP

Terraform

Go

🕒 May 8

Megaport

201 - 500

📡 Telecommunications

Senior Platform Engineer at Megaport, focusing on DevOps and SRE practices across their systems. Responsible for reliability and stakeholder engagement in a collaborative tech environment.

AWS

Cassandra

Cloud

Kubernetes

Linux

Postgres

Python

Terraform

Go

🕒 April 28

Sigma Prime

11 - 50

🌐 Web 3

₿ Crypto

🔒 Cybersecurity

Devops Engineer building decentralized network infrastructure with Sigma Prime. Assist developers and create testnets while maintaining production instances of Ethereum software.

Ansible

DNS

Firewalls

Kubernetes

Linux

Terraform

🕒 April 10

Axon

1001 - 5000

🔐 Security

🤖 Artificial Intelligence

📚 Education

Site Reliability Engineer delivering solutions for real-time problems in cloud-native services at Axon. Collaborating with engineering teams to ensure system stability and performance.

AWS

Azure

Cloud

Java

Kubernetes

Linux

Python

Terraform

Go