Applied Research Scientist

🕒 February 26

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Sully.ai

Sully.ai

11 - 50 employees

Founded 2023

🤖 Artificial Intelligence

⚕️ Healthcare Insurance

☁️ SaaS

Artificial Intelligence • Healthcare Insurance • SaaS

Sully. ai is an innovative company that provides AI agents designed specifically for healthcare organizations. By utilizing AI technology, Sully. ai offers services that enhance various healthcare functions, such as clinical procedures, patient appointment management, medical billing, and patient care coordination. Their AI agents are built to be 10 times faster and 20 times cheaper than traditional healthcare employees, while ensuring safety, security, and compliance with industry standards like HIPAA. Healthcare providers across the country have embraced Sully. ai for its efficiency and effectiveness in improving patient care and operational workflows.

📋 Description

• Build and scale automated evaluation pipelines (LLM-as-judge + human review) with clinical-grade benchmarks. • Audit existing evaluation approaches for clinical and agentic tasks. • Define initial benchmarks and build early automated pipelines. • Partner with engineering to land first set of CI gates for accuracy, factuality, and safety. • Deliver a repeatable evaluation framework with automated pipelines in production. • Demonstrate measurable improvements in robustness, hallucination reduction, or safety. • Publish or present internal research findings that directly shape product reliability.

🎯 Requirements

• Proven experience designing agentic processes and LLM evaluation/benchmarking frameworks. • Strong Python and ML background (PyTorch/TensorFlow, Hugging Face, LangChain/LlamaIndex). • Demonstrated ability to design rigorous experiments and translate findings into production. • Track record of published research or deep applied work in LLMs and agent evaluation. • Strong communication and technical writing skills to articulate complex findings clearly.

🏖️ Benefits

• Speed matters - we operate with urgency, autonomy, and ownership • You’ll work on real, first-of-their-kind problems at the edge of AI and medicine • Your work helps doctors reclaim their time - and patients get better, faster care

Apply Now

Similar Jobs

🕒 February 25

fal

51 - 200

🤖 Artificial Intelligence

🔌 API

🏢 Enterprise

ML Researcher at fal with broad expertise in generative media, focusing on innovative solutions. Develop new methods while collaborating with a dedicated AI team.

🕒 February 23

Fuze Health

1001 - 5000

☁️ SaaS

🤝 B2B

💊 Pharmaceuticals

Applied Scientist developing advanced machine learning models at Fuze Health, enhancing pharmacy operations and workflows. Collaborating with cross-functional teams to translate challenges into data-driven solutions.

🕒 February 10

Zillow

5001 - 10000

🏠 Real Estate

🛍️ eCommerce

👥 B2C

Applied Scientist enhancing Zillow’s home-shopping experience by designing machine learning models and collaborating cross-functionally. Leading AI-driven innovations in property search and recommendations.

🕒 January 27

Zillow

5001 - 10000

🏠 Real Estate

🛍️ eCommerce

👥 B2C

Senior Applied Scientist at Zillow developing machine learning models for immersive virtual home tours. Collaborating across product, research, and engineering to enhance user exploration of homes.

🕒 January 16

American Institutes for Research

1001 - 5000

📚 Education

⚕️ Healthcare Insurance

🌍 Social Impact

Senior Researcher leading comprehensive health care quality, costs, and access research at AIR. Overseeing project operations and managing teams for impactful evaluations and studies.