Senior Manager, Engineering – Observability Platform

🕒 6 dias atrás

☕ Washington – Remoto

info

💵 $205.000 - $275.000 / ano

⏰ Tempo Integral

🟠 Sênior

🖥 Engenheiro de Software

🦅 Patrocina Visto H1B

info

🗣️🇺🇸🇬🇧 Inglês obrigatório

Candidatar-se
Encontrar Vagas Remotas Similares

📊 Verifique sua pontuação de currículo para esta vaga

Melhore suas chances de conseguir uma entrevista verificando sua pontuação de currículo antes de se candidatar.

Logo of Smartsheet

Smartsheet

1001 - 5000 funcionários

Fundada em 2005

☁️ SaaS

⚡ Produtividade

🤝 B2B

SaaS • Productivity • B2B

Smartsheet é uma plataforma projetada para gerenciar projetos, automatizar fluxos de trabalho e construir soluções em larga escala. Ela oferece uma ampla gama de recursos, incluindo automação, colaboração em equipe, painéis e relatórios, e integrações, permitindo que as empresas otimizem suas operações. A plataforma atende a diversos casos de uso, como gestão de projetos, gestão de portfólio de TI, gestão de marketing e mais, servindo a várias indústrias, incluindo governo, finanças e saúde. Smartsheet também enfatiza a segurança e proteção de dados, garantindo a privacidade dos dados dos usuários. Além disso, oferece serviços profissionais como consultoria, treinamento e suporte à implementação para maximizar as capacidades da plataforma.

Descrição

• Lead a team of engineers focused on observability platform engineering, driving build-out of a unified observability stack used by all engineering teams at Smartsheet. • Own and evolve the platform's technical roadmap, consolidating multiple tooling platforms, and AI observability tooling into a coherent, scalable capability. • Define platform standards, contribute to architectural direction, and ensure the team operates with engineering rigor and strong operational habits. • Build and scale the team, hiring senior engineers and establishing effective global practices across distributed stakeholders. • Lead design and delivery of centralized observability infrastructure covering metrics pipelines, distributed tracing, alerting frameworks, and log analytics across Smartsheet services. • Drive SLO/SLA definition and tooling for platform-wide reliability visibility, partnering closely with infrastructure, platform engineering, and on-call teams. • Own governance including instrumentation standards, cost optimization, and rollout of advanced capabilities such as APM, RUM, and custom dashboards. • Lead architecture, scaling, and operational practices for log analytics across high-throughput production workloads. • Establish shared observability libraries, agents, and SDKs that reduce instrumentation burden for application engineering teams. • Build and maintain AI/ML observability integrations in partnership with the AI Platform team. • Partner with the Data & AI Platform team to integrate MLflow tracing, Inference Tables, and LLM-as-judge evaluation pipelines into the observability stack. • Develop dashboards and alerting for agentic AI workloads, including latency, token consumption, error rates, and evaluation metric drift. • Contribute to the AI governance and cost observability program, providing telemetry for model usage, cost attribution, and compliance reporting. • Serve as the primary engineering partner for platform consumers across Data & AI, Commerce, Infrastructure, and Security teams, ensuring observability needs are met across workstreams. • Lead complex, cross-functional observability projects with high ambiguity, managing delivery risk, communicating clearly to senior stakeholders, and building alignment across teams. • Partner with delivery partners to coordinate instrumentation across platform modernization and migration workstreams • Contribute to quarterly and annual platform goals, reporting on key reliability and observability metrics to engineering leadership. • Communicate platform status, risks, and roadmap progress to Engineering leadership and above audiences in a clear, executive-ready format. • Embed on-call culture and incident management discipline into the team, ensuring clear runbooks, fast MTTR, and post-incident learning loops. • Drive cost governance for observability tooling, including spend optimization and efficient resource management. • Champion AI-assisted engineering practices within the team, applying tooling and automation to reduce toil and accelerate delivery.

🎯 Requisitos

• 10+ years of software or platform engineering experience, with strong fundamentals in distributed systems, infrastructure, and backend services. • 3 years of engineering management experience, including direct team building, performance management, and cross-functional delivery ownership. • Deep hands-on expertise with observability tooling: Datadog (APM, metrics, logs, alerting), OpenSearch or Elasticsearch, distributed tracing (OpenTelemetry or equivalent), and SLO/SLA management at scale. • Proven experience operating observability platforms for high-availability, high-throughput production environments. • Experience building and scaling engineering teams in distributed or international focus • Strong execution track record on complex, cross-functional infrastructure programs with high ambiguity. • Clear, direct communication (written and verbal) with both technical and non-technical audiences, including leadership and executive stakeholders. • Proactive risk identification and status communication without prompting. • Experience managing vendors, external delivery partners, and third-party integrations in a platform context.

🏖️ Benefícios

• Employer subsidized medical/vision and dental coverage for full-time employees • 401k Match to help you save for your future (50% of your contribution up to the first 6% of your eligible pay) • Monthly stipend to support your work and productivity • Flexible Time Away Program, plus Sick Time Off • US employees are automatically covered under Smartsheet-sponsored life insurance, short-term, and long-term disability plans • US employees receive 12 paid holidays per year • Up to 24 weeks of Parental Leave • Personal paid Volunteer Day to support our community • Opportunities for professional growth and development including access to Udemy online courses • Company Funded Perks, including a counseling membership, local retail discounts, and your own personal Smartsheet account • Teleworking options from any registered location in the U.S. (role specific)

Candidatar-se

Vagas Similares

🕒 Junho 12

Quest Defense

201 - 500

🚀 Aeroespacial

Software Engineer developing User Interface using Python for safety-critical systems in aerospace. Performing extensive testing under NQA-1 safety standards for embedded software applications.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $90.000 - $120.000 / ano

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Junho 12

Dominion Digital

1 - 10

🤝 B2B

📱 Mídia

MS Dynamics Developer focusing on designing and customizing Microsoft Dynamics 365 applications for client business needs. Collaborating with clients and delivering reliable CRM platforms with effective technical solutions.

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Junho 12

Accenture Federal Services

10.000+ funcionários

🤖 Inteligência Artificial

🔒 Cibersegurança

🏛️ Governo

SAP Fiori Developer leading the design and implementation of user-centric SAP Fiori applications. Collaborating with various teams to ensure business goals and user needs are met.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $86.400 - $203.400 / ano

⏰ Tempo Integral

🟠 Sênior

🔴 Especialista

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Junho 12

Zurich Insurance

10.000+ funcionários

💸 Finanças

🤝 B2B

👥 B2C

Construction Risk Engineer Consultant supporting upper Midwest region with Zurich's risk engineering solutions. Delivering high-quality consultative services to reduce losses and improve risk management outcomes.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $79.600 - $172.300 / ano

⏰ Tempo Integral

🟠 Sênior

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Junho 12

WEX

5001 - 10000

🚗 Transporte

💸 Finanças

💳 Fintech

Sr. Director of Engineering in the WEX Mobility Engineering organization. Leading engineering for Field Services Management SaaS product catering to trades customers.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $230.800 - $283.600 / ano

💰 $310.000.000 Post-IPO Debt em 2020-06

⏰ Tempo Integral

🟠 Sênior

🖥 Engenheiro de Software

🦅 Patrocina Visto H1B

info

🗣️🇺🇸🇬🇧 Inglês obrigatório