Senior Software Engineer II – Applied AI and Evaluations

🕒 Abril 13

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $175.000 - $245.000 / ano

⏰ Tempo Integral

🟠 Sênior

🧑‍💻 Engenheiro Full-stack

🦅 Patrocina Visto H1B

info

🗣️🇺🇸🇬🇧 Inglês obrigatório

Candidatar-se
Encontrar Vagas Remotas Similares

📊 Verifique sua pontuação de currículo para esta vaga

Melhore suas chances de conseguir uma entrevista verificando sua pontuação de currículo antes de se candidatar.

Logo of Smartsheet

Smartsheet

1001 - 5000 funcionários

Fundada em 2005

☁️ SaaS

⚡ Produtividade

🤝 B2B

SaaS • Productivity • B2B

Smartsheet é uma plataforma projetada para gerenciar projetos, automatizar fluxos de trabalho e construir soluções em larga escala. Ela oferece uma ampla gama de recursos, incluindo automação, colaboração em equipe, painéis e relatórios, e integrações, permitindo que as empresas otimizem suas operações. A plataforma atende a diversos casos de uso, como gestão de projetos, gestão de portfólio de TI, gestão de marketing e mais, servindo a várias indústrias, incluindo governo, finanças e saúde. Smartsheet também enfatiza a segurança e proteção de dados, garantindo a privacidade dos dados dos usuários. Além disso, oferece serviços profissionais como consultoria, treinamento e suporte à implementação para maximizar as capacidades da plataforma.

Descrição

• Own agent quality end-to-end: diagnosis, improvement, and validation across SmartAssist's orchestrator and subagents • Identify failure modes across quality dimensions factual accuracy, completeness, tone, actionability, and latency and prioritize what to fix • Drive quality improvements through prompt engineering, context engineering, and RAG retrieval tuning • Extend and mature our evaluation framework: scorers, golden datasets, regression gates, and online evaluation for production traffic • Close the feedback loop ensure that every change has a measurable, attributable quality signal • Collaborate with our Agent Architecture lead to distinguish quality problems that require prompt/context solutions from those that require structural fixes • Establish repeatable methodology that scales beyond any single agent or subagent

🎯 Requisitos

• 8+ years of software engineering experience, with at least 2 years working directly with LLMs in production • Deep, hands-on experience with prompt engineering and context engineering, you understand how model behavior changes with framing, structure, and input design • Strong working knowledge of RAG architectures: chunking strategies, embedding models, retrieval evaluation, and failure diagnosis • Experience building or extending LLM evaluation frameworks, you have designed scorers, worked with golden datasets, and thought carefully about what good looks like • Fluency in agent system design, you don't need to own the architecture, but you can engage as a peer on architectural tradeoffs that affect quality • Strong Python skills; comfortable working in data-heavy environments (Databricks, Delta tables, or equivalent) • Ability to communicate complex quality findings (written and verbal) to both technical and non-technical stakeholders, you can explain what’s broke, why it matters, and what needs to happen next without losing the room • Strong cross-functional judgment, you know when to escalate, when to resolve independently, and how to build credibility across engineering, product, and AI platform teams • A bias for clarity in ambiguous situations, when failure modes are murky and trade-offs are real, you bring structure and a clear point of view rather than waiting for consensus • Legally eligible to work in the U.S. on an ongoing basis • BS or MS in Computer Science, a related field, or equivalent industry experience.

🏖️ Benefícios

• Employer subsidized medical/vision and dental coverage for full-time employees • 401k Match to help you save for your future (50% of your contribution up to the first 6% of your eligible pay) • Monthly stipend to support your work and productivity • Flexible Time Away Program, plus Sick Time Off • US employees are automatically covered under Smartsheet-sponsored life insurance, short-term, and long-term disability plans • US employees receive 12 paid holidays per year • Up to 24 weeks of Parental Leave • Personal paid Volunteer Day to support our community • Opportunities for professional growth and development including access to Udemy online courses • Company Funded Perks, including a counseling membership, local retail discounts, and your own personal Smartsheet account • Teleworking options from any registered location in the U.S. (role specific)

Candidatar-se

Vagas Similares

🕒 Abril 13

Switzerland Global Enterprise

51 - 200

🤝 B2B

🛍️ Comércio Eletrônico

Lead Engineer for HVDC Control Systems at GE Vernova. Overseeing design, development, and validation of control-related software.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟠 Sênior

🧑‍💻 Engenheiro Full-stack

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Abril 13

Reset

1 - 10

💳 Fintech

💸 Finanças

Full Stack Engineer developing real-time systems for financial health at Reset. Collaborate closely with partners and ensure fast, reliable integrations for users' access to income.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $150.000 - $180.000 / ano

💰 $2.300.000 Pre Seed Round - Reset Financial Technologies em 2024-02

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

🧑‍💻 Engenheiro Full-stack

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Abril 13

U.S. Department of Labor

10.000+ funcionários

🏛️ Governo

📋 Conformidade

Design and develop integration architecture between components for ASI's web-based applications. Collaborate with teams to ensure successful project delivery in an agile environment.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟠 Sênior

🧑‍💻 Engenheiro Full-stack

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Abril 13

GE Vernova

10.000+ funcionários

⚡ Energia

🚀 Aeroespacial

🤖 Inteligência Artificial

Lead Engineer for design and development of HVDC Control Systems. Working in a dynamic team and maintaining application software for control and protection systems.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟠 Sênior

🧑‍💻 Engenheiro Full-stack

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Abril 13

EncorEstate Plans

1 - 10

☁️ SaaS

🤝 B2B

💸 Finanças

Senior Software Engineer designing and implementing product features for AI-assisted estate planning service. Collaborating with CTO and team to drive impactful solutions in a fast-paced environment.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $120.000 - $160.000 / ano

⏰ Tempo Integral

🟠 Sênior

🧑‍💻 Engenheiro Full-stack

🗣️🇺🇸🇬🇧 Inglês obrigatório