Site Reliability Engineer

🕒 vor 17 Tagen

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

Jetzt Bewerben
Ähnliche Remote-Jobs finden

📊 Überprüfen Sie Ihre Lebenslauf-Bewertung für diese Stelle

Verbessern Sie Ihre Chancen auf ein Vorstellungsgespräch, indem Sie Ihre Lebenslauf-Bewertung vor der Bewerbung überprüfen.

Logo of OXIO

OXIO

51 - 200 Mitarbeiter

📡 Telekommunikation

☁️ SaaS

💳 Fintech

Telecommunications • SaaS • Fintech

OXIO ist eine Cloud-native Telecom-as-a-Service-Plattform, die es Unternehmen ermöglicht, eigene Mobilfunknetze aufzubauen, zu verwalten und anzupassen, ohne die typischen Komplexitäten der Telekommunikation. Mit Lösungen für Mobile Virtual Network Operators (MVNOs), Einzelhändler, Fintech-Unternehmen und mehr befähigt OXIO Anwender, Echtzeit-Konnektivität in ihre bestehenden Geschäftsprozesse zu integrieren und dabei von KI-gesteuerten Einblicken für eine verbesserte Kundenbindung zu profitieren. Der API-First-Ansatz der Plattform fördert schnelle Innovationen und ermöglicht es Unternehmen, einzigartige Tarife und Dienstleistungen über mehrere Netzwerke weltweit zu starten.

Beschreibung

• Design and implement platform on the cloud to support OXIO backend services • Automate technical operations: deployments, scaling, recovery, etc. • Monitor and maintain mission-critical production infrastructure to ensure maximum uptime • Participate in an on-call rotation and culture of continuous improvement through blameless postmortems • Enable the Engineering/Telecom/Data Engineering teams by providing them the tools to operate the service they build

🎯 Anforderungen

• Understanding of Linux/Unix systems (most systems are Linux-based). • Familiarity with Linux/Unix system internals like process management, filesystems, memory management, and networking. • Proficiency in at least one programming language (Python, Go, or Ruby) and strong skills in scripting (Bash, Perl). • Experience with infrastructure provisioning tools such as Terraform, CloudFormation, or Ansible. • Familiarity with containerization (Docker) and orchestration tools (Kubernetes). • Familiarity with monitoring tools like Prometheus, Grafana, or Datadog. • Knowledge of setting up alerts, analyzing logs, and creating dashboards for observability. • Familiarity with incident management practices (e.g., runbooks, postmortems). • Experience in being part of an on-call rotation and handling incidents. • Experience in setting up and maintaining Continuous Integration/Continuous Delivery pipelines (Jenkins, GitLab CI, CircleCI, etc.). • Hands-on experience with cloud providers (AWS, Google Cloud, Azure). • Knowledge of virtualization technologies (VMware, KVM) and cloud-native architecture. • Understanding of TCP/IP, DNS, HTTP/HTTPS, load balancing, and firewalls. • Strong understanding of deployment strategies (canary releases, blue-green deployments, etc.). • Familiarity with high availability and understanding failover mechanisms. • Familiarity with IAM (Identity and Access Management) and zero trust principles. • Experience working with distributed systems (e.g., Kafka, Cassandra, Elasticsearch). • Building custom monitoring tools or writing complex automation scripts. • Functional knowledge of database management (SQL and NoSQL). • Familiarity with distributed tracing (Jaeger, OpenTelemetry) and advanced log aggregation strategies (ELK stack, Splunk). • Familiarity with performance profiling tools and optimizing application performance under heavy load. • Familiarity in load testing and identifying bottlenecks. • Familiarity with Configuration Management using SaltStack for maintaining server configurations.

🏖️ Vorteile

• N/A

Jetzt Bewerben

Ähnliche Jobs

🕒 vor 18 Tagen

Cority

201 - 500

☁️ SaaS

📋 Compliance

Intermediate Site Reliability Engineer supporting reliability, performance, and scalability of cloud-hosted services. Collaborate with engineering teams and contribute to incident response processes.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 18 Tagen

General Motors

10.000+ Mitarbeiter

🚗 Transport

⚡ Energie

🏢 Unternehmen

Design Release Engineer focusing on semiconductor product development and engineering processes at GM. Involves collaboration with teams to uphold strategic vision and core values of GM.

🇺🇸 Vereinigte Staaten – Remote

💵 $124.702 - $161.100 / Jahr

💰 €500.000.000 Grant im 2024-07

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🦅 H1B-Visum-Sponsor

info

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 18 Tagen

Order.co

51 - 200

☁️ SaaS

💳 Fintech

🤝 B2B

Senior Site Reliability Engineer at Order.co to ensure reliable and scalable software systems. Collaborate with the Platform team while maintaining operational efficiency and infrastructure excellence.

🇺🇸 Vereinigte Staaten – Remote

💵 $175.000 - $200.000 / Jahr

💰 €30.000.000 Series B - Order im 2022-01

⏰ Vollzeit

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 18 Tagen

VetsEZ

201 - 500

🤝 B2B

☁️ SaaS

🏛️ Regierung

DevSecOps Engineer supporting secure software delivery and cloud infrastructure operations for federal government healthcare projects. Collaborating with teams to improve deployment reliability and efficiency.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 18 Tagen

VetsEZ

201 - 500

🤝 B2B

☁️ SaaS

🏛️ Regierung

DevSecOps Engineer for federal healthcare technology initiative, collaborating on secure software delivery and automation. Focusing on CI/CD, cloud infrastructure, and deployment efficiency.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich