Member of Engineering – Reinforcement Learning

🕒 April 28

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of poolside

poolside

51 - 200 employees

Founded 2023

🤖 Artificial Intelligence

🏢 Enterprise

Artificial Intelligence • Enterprise

poolside is a frontier AI lab and enterprise platform that builds and deploys foundation models, multi-agent systems, and developer-facing tools focused on automating complex software work. The company specializes in on-prem and VPC deployments, security-first integrations, governance, and connectors to enterprise data sources so organizations can run agents and models inside their own boundaries. Poolside embeds research and engineering with customers to deliver outcome ownership, risk controls, and measurable business impact while advancing toward AGI by starting in high-consequence software environments.

📋 Description

• Research and experiment on ways to improve reasoning and code generation for LLMs. Own the full experiment life cycle from idea to experimentation and integration • Keep up with the latest research, and be familiar with the state of the art in LLMs, RL, and code generation. Translate research ideas into clean, reusable codebases that other researchers can build on • Design, analyze, and iterate on data generation and training of LLMs • Implement and iterate on RL training pipelines that scale reliably across domains • Diagnose training instabilities and failures, debug RL runs and propose mitigation methods • Write high-quality, reproducible and maintainable code

🎯 Requirements

• Experience with Large Language Models (LLM), including: • Understanding of the Transformer architecture and scaling laws • Mid-training and post-training techniques • Experience training reasoning and/or agentic models • Hands-on use of LLMs, with a sense of their capabilities and limitations • Reinforcement Learning experience • Solid grasp of Reinforcement Learning concepts and familiarity with modern algorithms • Experience developing distributed, large-scale RL pipelines from data creation to evaluations • Research experience • Scientific publications in any of the following topics: Reinforcement Learning, LLMs and reasoning models • Ability to discuss the latest research with sufficient level of detail • Is reasonably opinionated • Engineering skills • Strong machine learning, algorithm skills and engineering background • Experience with distributed training • Excellent programming skills in Python • Familiarity with a deep learning framework (Pytorch or JAX)

🏖️ Benefits

• Fully remote work & flexible hours • 37 days/year of vacation & holidays • Health insurance allowance for you and dependents • Company-provided equipment • Wellbeing, always-be-learning and home office allowances • Frequent team get togethers • Great diverse & inclusive people-first culture

Apply Now

Similar Jobs

🕒 April 24

Linear

11 - 50

Product marketer creating content that demonstrates use cases for Linear's AI agent. Collaborating with teams to help customers discover the product's potential.

🕒 April 15

Infovista

501 - 1000

🤝 B2B

📡 Telecommunications

Engineering Operations Manager overseeing SDLC process and governance at Infovista. Leading cross-functional teams to ensure disciplined and compliant engineering practices.

SDLC

🕒 March 31

Aquiva Labs

201 - 500

🤝 B2B

☁️ SaaS

🏢 Enterprise

Mulesoft Developer designing and implementing integrations between Salesforce and other enterprise systems. Requires expertise in MuleSoft Anypoint Platform with Agile team collaboration.

Cloud

SOAP

🕒 March 28

Weaviate

51 - 200

🤖 Artificial Intelligence

🤝 B2B

Manage Weaviate's social media channels, creating engaging content for a tech-savvy audience. Collaborate with teams to drive marketing strategies in an AI-focused startup.

🕒 March 28

SPACE44

11 - 50

🤝 B2B

☁️ SaaS

👥 HR Tech

Freelance Engineer/Developer at SPACE44 building scalable applications and cloud environments for diverse client projects. Join an expert community of engineers across various technical domains.

Airflow

Angular

AWS

Azure

Cloud

Docker

ETL

Google Cloud Platform

Java

JavaScript

Kafka

Kubernetes

MySQL

Node.js

PHP

Postgres

Python

React

Spark

SQL

Terraform

TypeScript

Vue.js

Go