Observability & Operations Engineer

🕒 il y a 3 mois

🌵 Arizona – Distant

info

💵 $131 709 - $161 344 / an

⏰ Temps Plein

🟠 Senior

🔴 Expert

⚙️ Opérations

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of Fullbay

Fullbay

51 - 200 employés

☁️ SaaS

🏢 Entreprise

🚗 Transport

💰 Venture Round en 2019-05

SaaS • Enterprise • Transport

Fullbay est une solution logicielle complète conçue spécifiquement pour les ateliers de réparation diesel et d'autres opérations de réparation de véhicules lourds. L'entreprise offre une gamme d'outils pour simplifier divers aspects de la gestion de l'atelier, notamment les devis et factures, le flux de travail des ordres de service, la gestion des stocks et la communication avec les clients. Fullbay s'intègre aux systèmes comptables pour une tenue de livres sans faille et prend en charge la messagerie bidirectionnelle entre les clients et les ateliers. Elle fournit également un logiciel de réparation spécialisé pour l'entretien de flotte, les opérations mobiles et des industries spécifiques telles que les véhicules agricoles et d'urgence. Avec des fonctionnalités comme des rapports, l'intégration MOTOR pour les réparations, et des données sécurisées sur le cloud, Fullbay vise à maximiser l'efficacité des ateliers et à accroître leur rentabilité tout en contribuant à la sécurité routière.

Description

• Design and implement a comprehensive observability strategy (logging, metrics, tracing, alerting) across all AWS environments, leveraging AI-powered tools to detect anomalies and surface insights automatically • Build and manage monitoring platforms such as Datadog, Grafana, Prometheus, and AWS CloudWatch — actively exploring AI-native features within these tools to reduce alert fatigue and improve signal quality • Use AI coding assistants (e.g. GitHub Copilot, Claude) to accelerate development of dashboards, runbooks, and automation scripts • Own the incident management lifecycle — on-call rotations, post-mortems, root cause analysis — and apply AI-assisted log analysis to speed up diagnosis and resolution • Instrument Java, Kotlin, and Node.js-based cloud-native applications to emit structured logs, distributed traces, and metrics; identify opportunities to use ML-based anomaly detection in place of static thresholds • Build repeatable, code-first observability pipelines that treat dashboards, alerts, and runbooks as first-class software — versioned, tested, and deployed through Harness • Leverage AWS PaaS services (Lambda, API Gateway, ECS, RDS, SQS, SNS, and others) to build scalable, automated operational tooling • Collaborate with development teams to embed observability and AI-assisted quality checks into CI/CD pipelines via Harness • Own the FinOps function for our AWS environment — tracking cloud spend, building cost dashboards, identifying waste, and using AI-powered cost analysis tools to surface optimization opportunities and drive accountability across engineering teams • Monitor AWS infrastructure for performance, availability, and cost — partnering with finance and engineering to enforce spend governance • Develop and maintain Infrastructure as Code using Terraform, using AI pair programming to improve quality and consistency • Contribute to architectural decisions with a focus on resilience, automation, and reducing toil through intelligent systems • Adheres to all confidentiality and compliance regulations • Performs other duties as assigned

🎯 Exigences

• 7 –10 years of experience in Software Engineering, Cloud Operations, or Site Reliability Engineering • 5+ years of hands-on experience with AWS infrastructure and AWS PaaS services; certifications are a plus • Demonstrated experience building repeatable, code-first pipelines and treating operational configuration as first-class software • Experience working with polyglot environments including Java, Kotlin, and Node.js • Demonstrated experience using AI tools (coding assistants, AI-powered observability platforms, or similar) in a professional setting — we’re an AI-first company and expect this to be part of how you work, not something you’re just exploring

Postuler Maintenant

Emplois Similaires

🕒 il y a 3 mois

Smarkets

51 - 200

🎲 Jeux d'argent

⚽ Sports

🛍️ eCommerce

Senior Customer Operations Executive delivering support for US customer base via chat and email. Involves customer queries, operational improvement, and cross-team collaboration.

🇺🇸 États-Unis – Télétravail

💰 Series B en 2021-06

⏰ Temps Plein

🟠 Senior

⚙️ Opérations

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 3 mois

Commure

1001 - 5000

🤖 Intelligence artificielle

☁️ SaaS

🤝 B2B

Manager overseeing billing and collection operations at healthcare technology firm. Ensuring AR efficiency and compliance while collaborating with finance and sales teams.

🇺🇸 États-Unis – Télétravail

💵 $120 000 - $140 000 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⚙️ Opérations

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 3 mois

Gray

1001 - 5000

🤝 B2B

🛍️ eCommerce

📱 Médias

Field Operations Manager overseeing steel field operations nationwide for NexGen. Supervising General Superintendents and ensuring project execution aligns with plans and timelines.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 3 mois

Gray

1001 - 5000

🤝 B2B

🛍️ eCommerce

📱 Médias

Field Operations Manager overseeing Steel field operations nationwide for NexGen. Leading General Superintendents and ensuring project execution aligns with customer needs.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 3 mois

Taylor Corporation

10 000+ employés

Oracle Fusion Operational Excellence Leader managing ongoing support and optimization for Oracle Fusion Cloud ERP. Leading incident management and continuous improvement across various business functions for Taylor Corporation.

🇺🇸 États-Unis – Télétravail

💵 $130 000 - $150 000 / an

⏰ Temps Plein

🟠 Senior

⚙️ Opérations

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis