Software Engineer, Benchmarking

Job not on LinkedIn

🔥 18 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Epoch AI

Epoch AI

11 - 50 employees

Founded 2022

🤖 Artificial Intelligence

🔬 Science

🤝 B2B

Artificial Intelligence • Science • B2B

Epoch AI is a research and data organization that tracks and analyzes trends in artificial intelligence, maintaining open databases of AI models, data centers, hardware performance, chip sales, and benchmarking results. It publishes papers, reports, newsletters, podcasts, and data insights, and offers custom research and advisory services to policymakers, institutions, and companies to inform decisions about AI development, infrastructure, and governance.

📋 Description

• Implement benchmarks: Implement AI benchmarks within our evaluation infrastructure (primarily using the Inspect library) to expand the suite of capabilities we track. Develop our existing suite of benchmarks so we can quickly and painlessly evaluate new model releases. • Develop new benchmarks: Contribute to the development of brand new benchmarks. You will have the opportunity to pitch and prototype your own ideas in addition to helping out with existing projects. • Collaborate: Work closely with researchers, analysts, and other engineers at Epoch AI to ensure evaluation data and outputs are accurate, insightful, and effectively integrated into our research products and publications.

🎯 Requirements

• Solid engineering skills: A strong software engineering background with more than two years of professional experience building and maintaining complex systems. You are expected to regularly contribute high-quality, robust, and maintainable code and be comfortable diving deep into existing codebases and infrastructure. • Ideas and creativity: Candidates should be able to generate their own ideas for new benchmarks, experiments, novel things to try, and other projects. • Mission-driven: You’re motivated by Epoch AI’s mission to provide rigorous, independent insight into key trends in AI. You want to deliver public, trustworthy evaluations of AI capabilities on challenging benchmarks, empowering researchers, policymakers, and the wider public to make well-informed decisions about AI. • AI domain expertise or cybersecurity experience are strong pluses but not required.

🏖️ Benefits

• Fully remote environment, including flexible work hours and schedules for most roles. • Competitive global benefits program, including a comprehensive health insurance program—including supplemental benefits specific to a local country, as available and mandated by local law—and life insurance and a pension plan, if applicable in your country. • Generous paid time off (PTO), including no specific limit on PTO with 30 days per year protected, unlimited personal and sick leave, and up to 6 months (combination of paid + unpaid) parental leave for permanent staff. • A flexible and generous expense policy for you to spend on equipment and a large range of productivity tools or learning/development opportunities you might find valuable, subject to regulations and manager approval. • Paid work trips, including 3 staff retreats per year and relevant conferences. • Access to our very well-equipped offices in Berkeley, California, including paid meals, snacks, gym, and more. All staff, independently of where they are based, have access to the office for at least 20 days each year.

Apply Now

Similar Jobs

🔥 3 hours ago

Clover Health

501 - 1000

Software Engineer enhancing engineering productivity at Clover Health. Requires expertise in backend infrastructure and systems engineering for healthcare innovation.

🔥 8 hours ago

Rockstar

1 - 10

🎯 Recruiter

👥 HR Tech

🤖 Artificial Intelligence

Founding Product Engineer building product workflows for an innovative AI company in accounting automation. Collaborating across product, design, and engineering to create durable product solutions.

🔥 21 hours ago

MelodyArc

11 - 50

🤖 Artificial Intelligence

☁️ SaaS

🤝 B2B

Senior Software Development Engineer at MelodyArc, designing scalable backend services for orchestration. Leading technical projects and partnering with CTO for quality improvements.

🕒 Yesterday

BJAK

51 - 200

🛍️ eCommerce

🏪 Marketplace

iOS Software Engineer developing mobile applications for BJAK's AI-powered insurance and financial products. Collaborating within a global engineering team to enhance user experiences.

🕒 2 days ago

NBCUniversal

10,000+ employees

📱 Media

WordPress Software Engineer part of the backend team for NBCUniversal's news operations. Supporting and developing digital websites and applications using WordPress and AWS.