Senior Platform Engineer

🕒 vor 5 Tagen

🗣️🇺🇸🇬🇧 Englisch erforderlich

Jetzt Bewerben
Ähnliche Remote-Jobs finden

📊 Überprüfen Sie Ihre Lebenslauf-Bewertung für diese Stelle

Verbessern Sie Ihre Chancen auf ein Vorstellungsgespräch, indem Sie Ihre Lebenslauf-Bewertung vor der Bewerbung überprüfen.

Logo of Flexential

Flexential

501 - 1000 Mitarbeiter

Gegründet 2000

🤝 B2B

📡 Telekommunikation

🏢 Unternehmen

B2B • Telecommunications • Enterprise

Flexential ist ein Anbieter von speziell entwickelten Rechenzentrumsservices wie Colocation, Interconnection, Cloud und Datenschutz, die über die FlexAnywhere®-Plattform bereitgestellt werden. Das Unternehmen betreibt über 40 hochvernetzte Rechenzentren in 18 US-Märkten und bietet Hochleistungsstromversorgung, flüssigkeitsbasierte Kühlung, Carrier- und Cloud-Interconnection sowie verwaltete und professionelle Dienstleistungen zur Unterstützung von Hybrid-IT, KI/ML-Workloads, Disaster-Recovery und Compliance-Anforderungen für Unternehmenskunden.

Beschreibung

• Design, develop and operationally manage automated, resilient, high availability, self-healing, secure platforms with native-AI capabilities for IT needs, serving both internal as well as customer business capabilities • Develop , and manage the Observability OpenTelemetry Central Backend Stack: Grafana Enterprise, Mimir, Loki, Tempo, and Alertmanager on Kubernetes/RKE2 via Helm and GitLab CI -CD . • Build and manage iaC and CI-CD for automated provisioning and deployment, including Terraform modules for Infra/ VM/storage provisioning, Ansible AWX playbooks for OS/ App bootstrap, ArgoCD and Helm for Kubernetes configuration . • Develop and manage OpenTelemetry Prometheus scrape profile library including SNMP exporters, REST API exporters, and cloud provider exporters (CloudWatch, Azure Monitor, GCP) for multiple device classes. • Develop AIOps capabilities on platforms for e.g Observability use-cases : anomaly detection integrations, event correlation rules in Alertmanager , and synthetic monitoring patterns to reduce alert noise. • Configure and maintain Zabbix auto-discovery: network range scanning, device classification, and Prometheus service discovery integration. • Build and harden Edge Stack deployments (Prometheus + OTel collector) per data center site using GitOps templates. • Integrate Alertmanager with ServiceNow: webhook routing, ticket enrichment, auto-close logic, and escalation policy configuration. • Maintain platform security: Conjur /CyberArk secret injection at runtime, mTLS between stack components, RBAC in Grafana Enterprise. • Author and maintain Grafana dashboards in JSON/GitLab — facility overview, network health, RED metrics, application telemetry. • Mentor mid-level engineers, lead code reviews, and establish engineering standards for the team. • Represent platform engineering in cross-functional architecture reviews and executive-level program updates. • Perform other duties as required and assigned

🎯 Anforderungen

• DevOps / Automation - 5+ years in a production environment • Kubernetes (RKE2/k3s), Helm chart deployment, system services, Docker/ container • LGTM Stack Development and Configuration - 4 + years : Grafana, Mimir, Loki, Tempo configuration, tuning, dash- boarding and production operation s ; Prometheus required • Senior-level Python / Scripting frameworks - 5+ years, Automation scripts, exporter development, GitLab pipeline scripting, REST API integrations • GitOps / CI/CD - 5+ years, GitLab CI/CD pipeline authoring; Terraform and Ansible as primary IaC tools; ArgoCD or Flux preferred • AIOps / Observability Engineering - 2+ years , Alertmanager rule authoring, anomaly detection integration, event correlation, noise reduction techniques • Working infrastructure (Linux/VM) management knowledge - 5+ years, Linux administration, VMware vCenter/ VCF experience , Netapp storage management , network fundamentals (SNMP, TCP/IP) • Secrets Management - 2+ years , CyberArk/ Conjur , HashiCorp Vault, or equivalent — runtime secret injection patterns • Minimal travel may be required

🏖️ Vorteile

• Medical, Telehealth, Dental and Vision • 401(k) • Health Savings Accounts (HSA) and Flexible Spending Accounts (FSA) • Life and AD&D • Short Term and Long-Term disability • Flex Paid Time Off (PTO) • Leave of Absence • Employee Assistance Program • Wellness Program • Rewards and Recognition Program

Jetzt Bewerben

Ähnliche Jobs

🕒 vor 6 Tagen

Defense Unicorns

51 - 200

🔒 Cybersecurity

Platform Engineer at Defense Unicorns focusing on UDS deployments across AWS and Azure. Responsibilities include customer training and support along with system architecture development.

🇺🇸 Vereinigte Staaten – Remote

💵 $123.250 - $166.750 / Jahr

💰 Seed Round im 2022-10

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

🏗️ Plattformingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 6 Tagen

Kerr Dental

1001 - 5000

⚕️ Krankenversicherung

🔬 Wissenschaft

🧘 Wellness

Executive Director leading AI platform engineering for Novartis, driving AI transformation through data and advanced analytics. Leading engineering strategy, delivery, and operational excellence for agentic AI platform.

🇺🇸 Vereinigte Staaten – Remote

💵 $225.400 - $418.600 / Jahr

💰 Debt Financing im 2005-12

⏰ Vollzeit

🟠 Senior

🏗️ Plattformingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 6 Tagen

Defense Unicorns

51 - 200

🔒 Cybersecurity

Platform Engineer developing and maintaining the platform for Navy Certificate to Ship team at Defense Unicorns. Involves innovating solutions and collaborating across various tech domains.

🇺🇸 Vereinigte Staaten – Remote

💵 $148.750 - $201.250 / Jahr

💰 Seed Round im 2022-10

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

🏗️ Plattformingenieur

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 6 Tagen

Quantiphi

1001 - 5000

🤖 Künstliche Intelligenz

🏢 Unternehmen

📚 Bildung

Architect - Platform Engineer at Quantiphi designing and optimizing infrastructure for GenAI workloads. Collaborating with architects and teams to deliver AI solutions in a remote environment.

🇺🇸 Vereinigte Staaten – Remote

💰 Series A im 2019-12

⏰ Vollzeit

🟠 Senior

🔴 Experte

🏗️ Plattformingenieur

🦅 H1B-Visum-Sponsor

info

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 6 Tagen

Novartis

10.000+ Mitarbeiter

💊 Pharmazie

🧬 Biotechnologie

⚕️ Krankenversicherung

Executive Director leading AI Platform Engineering at Novartis. Driving AI transformation and enabling data-driven decision-making leveraging advanced analytics for business growth.

🗣️🇺🇸🇬🇧 Englisch erforderlich