Stelle veröffentlichen Partner

Remote-Jobs suchen

InfiniteChoice

Website LinkedIn Alle Stellen

11 - 50 Mitarbeiter

Gegründet 2015

🛍️ eCommerce

🤖 Künstliche Intelligenz

🤝 B2B

eCommerce • Artificial Intelligence • B2B

InfiniteChoice ist ein Plattformunternehmen, das Start-ups und wachstumsstarke Verbraucherunternehmen durch die Kombination von Kapital, operativer Expertise und geistigem Eigentum aufbaut und skaliert. Das Unternehmen legt besonderen Wert auf automatisierungsorientierte Ausführung und ein KI-geführtes Ökosystem, um die Zeit für das Erreichen der Skalierung von Unternehmen mit klarem Product-Market-Fit zu verkürzen und konzentriert sich dabei auf den Start und die Optimierung von E-Commerce-Marken und Kundenplattformen. Mit Unterstützung von Private Equity und unter der Leitung erfahrener Betreiber setzt InfiniteChoice strategisches Kapital, Technologie und operatives Talent ein, um profitables, margenstarkes Wachstum in seinem Portfolio voranzutreiben.

Principal Site Reliability Engineer, SRE

Stelle nicht auf LinkedIn

🕒 vor 3 Monaten

🏄 California, Texas – Remote

💵 €180.000 - €210.000 / Jahr

⏰ Vollzeit

🔴 Experte

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

Cloud

Distributed Systems

Google Cloud Platform

Microservices

Jetzt Bewerben

Ähnliche Remote-Jobs finden

📊 Überprüfen Sie Ihre Lebenslauf-Bewertung für diese Stelle

Verbessern Sie Ihre Chancen auf ein Vorstellungsgespräch, indem Sie Ihre Lebenslauf-Bewertung vor der Bewerbung überprüfen.

InfiniteChoice

Website LinkedIn Alle Stellen

11 - 50 Mitarbeiter

Gegründet 2015

🛍️ eCommerce

🤖 Künstliche Intelligenz

🤝 B2B

eCommerce • Artificial Intelligence • B2B

Beschreibung

• Build SRE practices from scratch - define SLIs, SLOs, error budgets, and reliability metrics • Establish incident response procedures, on-call rotations, and post-mortem processes • Create reliability engineering standards and best practices across all engineering teams • Develop disaster recovery and business continuity strategies • Design and implement capacity planning and performance optimization frameworks • Drive architecture decisions for comprehensive application and infrastructure monitoring solutions • Design and develop custom SRE tools for automated monitoring, alerting, and remediation • Build observability platforms that provide deep insights into system performance and user experience • Create automation frameworks for deployment, scaling, and incident response • Architect logging, metrics, and tracing systems for distributed microservices environments • Leverage Google Cloud Platform services to build resilient, scalable infrastructure • Implement cloud-native monitoring using Stackdriver, Cloud Monitoring, and Cloud Logging • Design auto-scaling and self-healing systems using GKE, Cloud Functions, and managed services

🎯 Anforderungen

• 12+ years of experience in Site Reliability Engineering or Infrastructure Engineering • 5+ years in lead SRE roles building and scaling SRE teams and processes • Proven track record designing and implementing monitoring and observability solutions at scale • Deep understanding of distributed systems, microservices architectures, and cloud-native patterns • Experience with infrastructure as code, configuration management, and deployment automation • Hands-on experience with Google Cloud Platform is required • Expertise with GCP monitoring and observability stack (Cloud Monitoring, Cloud Logging, Cloud Trace) • Experience with GKE, Compute Engine, Cloud Functions, and other core GCP services • Bachelor's degree in Computer Science, Engineering, or equivalent professional experience • Industry certifications (Google Cloud Professional, SRE or related certifications preferred)

🏖️ Vorteile

• Ground-floor opportunity to build SRE practices and culture from scratch • Full autonomy to define processes, select technologies, and establish best practices • Direct impact on platform reliability serving millions of users • Opportunity to create lasting engineering culture and operational excellence • Remote-first culture with in-person meeting in Dallas, TX on need basis • Collaborative environment with smart, passionate engineers and cross-functional teams • Access to cutting-edge technologies and AI-driven development tools • Competitive compensation, equity participation, and comprehensive benefits

Jetzt Bewerben

Ähnliche Jobs

Principal Software Engineer – Site Reliability

🕒 vor 4 Monaten

Upstart

1001 - 5000

Website LinkedIn Alle Stellen

Principal Software Engineer on the SRE team at Upstart, advocating for reliability and scalability. Leading cross-functional collaboration and shaping technical roadmaps for SRE initiatives.

🇺🇸 Vereinigte Staaten – Remote

💵 $195.300 - $270.400 / Jahr

⏰ Vollzeit

🔴 Experte

⛑ DevOps- und Site Reliability Engineer (SRE)

🦅 H1B-Visum-Sponsor

🗣️🇺🇸🇬🇧 Englisch erforderlich

JavaScript

Prometheus

Python

Terraform

TypeScript

Bewerben

Stelle Ansehen

Staff Site Reliability Engineer

🕒 vor 5 Monaten

PathAI

501 - 1000

🤖 Künstliche Intelligenz

⚕️ Krankenversicherung

🧬 Biotechnologie

Website LinkedIn Alle Stellen

Staff Site Reliability Engineer designing and operating a hybrid cloud environment at PathAI. Focused on implementing SRE best practices and enhancing infrastructure reliability.

🇺🇸 Vereinigte Staaten – Remote

💵 $165.750 - $224.450 / Jahr

💰 €165.000.000 Series C im 2021-05

⏰ Vollzeit

🔴 Experte

⛑ DevOps- und Site Reliability Engineer (SRE)

🦅 H1B-Visum-Sponsor

🗣️🇺🇸🇬🇧 Englisch erforderlich

Ansible

AWS

Cloud

Grafana

Prometheus

Python

Terraform

Bewerben

Stelle Ansehen

SRE / DevOps Manager

🕒 vor 5 Monaten

Upshop

51 - 200

☁️ SaaS

🛒 Einzelhandel

🛍️ eCommerce

Website LinkedIn Alle Stellen

SRE / DevOps Manager at Upshop leading reliability and operations engineering team. Responsible for scalability, security, and performance of infrastructure.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟠 Senior

🔴 Experte

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

AWS

Azure

Cloud

Docker

Google Cloud Platform

Grafana

Kubernetes

MongoDB

Prometheus

Python

Shell Scripting

Terraform

Bewerben

Stelle Ansehen

Staff Site Reliability Engineer

🕒 vor 7 Monaten

FloSports

201 - 500

Website LinkedIn Alle Stellen

Staff SRE at FloSports improving developer enablement and migrating infrastructure to AWS. Leading technical architecture and critical tooling development with a focus on reliability and automation.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🔴 Experte

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

AWS

Google Cloud Platform

JavaScript

Kubernetes

Node.js

Terraform

Bewerben