Director, AI Alignment and Interpretability

🕒 vor 11 Tagen

🇺🇸 Vereinigte Staaten – Remote

💵 $195.000 - $290.000 / Jahr

⏰ Vollzeit

🔴 Experte

🤖 Künstliche Intelligenz

🦅 H1B-Visum-Sponsor

info

🗣️🇺🇸🇬🇧 Englisch erforderlich

Jetzt Bewerben
Ähnliche Remote-Jobs finden

📊 Überprüfen Sie Ihre Lebenslauf-Bewertung für diese Stelle

Verbessern Sie Ihre Chancen auf ein Vorstellungsgespräch, indem Sie Ihre Lebenslauf-Bewertung vor der Bewerbung überprüfen.

Logo of CrowdStrike

CrowdStrike

5001 - 10000 Mitarbeiter

Gegründet 2011

🔒 Cybersecurity

☁️ SaaS

🤖 Künstliche Intelligenz

Cybersecurity • SaaS • Artificial Intelligence

CrowdStrike ist ein Cybersecurity-Unternehmen, das cloudbasierte Sicherheitsdienste bereitstellt, um Sicherheitsverletzungen zu stoppen. Es gilt als führend in den Bereichen Endpoint Protection, Identity- und Cloud-Security sowie Managed Detection and Response (MDR). Die Plattform von CrowdStrike, Falcon, integriert künstliche Intelligenz (AI), um Echtzeit-Transparenz, Erkennung und Schutz vor hochentwickelten Cyberbedrohungen zu bieten. Für seine Effektivität beim Schutz von Netzwerken und Daten wird das Unternehmen hoch geschätzt und ist ein vertrauenswürdiger Partner für Unternehmen weltweit.

Beschreibung

• Own the alignment and interpretability research agenda for security-domain AI • Set priorities, personally lead the hardest open problems, and develop methods that explain model behavior mechanistically: not just what models do, but why, and what that implies at the edges of their training distribution • Build and apply techniques for detecting offensive-misuse signal in model internals, including probing for latent representations of vulnerability knowledge, circuit analysis to understand how security-relevant capabilities are encoded, and activation analysis to surface risk that behavioral testing alone would miss • Work closely with the adversarial evaluation team to close the loop between what they find in testing and what you find in the weights • Develop alignment methodology for security-domain AI and own the evaluation framework that makes it measurable • Contribute original research through publications and external engagement • Recruit, develop, and retain a lean team of research scientists

🎯 Anforderungen

• MS or PhD in machine learning, computer science, or a related field, with research depth in interpretability, AI alignment, or a closely adjacent area • 8+ years in ML research or engineering, with direct experience doing interpretability or alignment research on large language models • Hands-on expertise with mechanistic interpretability methods (probing classifiers, circuit analysis, activation patching, causal tracing, feature visualization) applied to real models • Experience designing and running alignment evaluations: behavioral testing, capability elicitation, red-lining, or similar methodologies rigorous enough to support meaningful safety claims • Track record of leading and growing researchers while remaining an active technical contributor yourself

🏖️ Vorteile

• Market leader in compensation and equity awards • Comprehensive physical and mental wellness programs • Competitive vacation and holidays for recharge • Paid parental and adoption leaves • Professional development opportunities for all employees regardless of level or role • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections • Vibrant office culture with world class amenities • Great Place to Work Certified™ across the globe

Jetzt Bewerben

Ähnliche Jobs

🕒 vor 12 Tagen

XPRIZE

51 - 200

🤝 Non-Profit

🔬 Wissenschaft

🌍 Soziale Wirkung

Technical Prize Director leading global competition for energy-efficient AI solutions at XPRIZE. Overseeing operations, partnerships, and technical validation while driving innovation in AI infrastructure.

🇺🇸 Vereinigte Staaten – Remote

💵 $170.000 - $200.000 / Jahr

💰 €1.800.000 Grant im 2018-10

⏰ Vollzeit

🔴 Experte

🤖 Künstliche Intelligenz

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 12 Tagen

Toast

1001 - 5000

☁️ SaaS

🤝 B2B

Transformation leader driving AI adoption and workflow redesign within Toast's marketing organization. Instrumental in leading change management, governance, and enabling programs.

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 12 Tagen

3Cloud

501 - 1000

☁️ SaaS

🤖 Künstliche Intelligenz

🏢 Unternehmen

Principal Architect leading Azure programs and architectures at 3Cloud. Guiding technical strategy and mentoring teams while ensuring alignment with client goals.

🇺🇸 Vereinigte Staaten – Remote

💵 $176.200 - $264.300 / Jahr

⏰ Vollzeit

🔴 Experte

🤖 Künstliche Intelligenz

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 12 Tagen

Instacart

1001 - 5000

🛍️ eCommerce

🚗 Transport

🛒 Einzelhandel

AI Engagement Manager orchestrating complex multi-stakeholder AI engagements for Instacart. Managing partner relationships and engagement economics in a remote-first environment.

🇺🇸 Vereinigte Staaten – Remote

💵 $178.000 - $226.000 / Jahr

💰 €232.000.000 Venture Round im 2021-11

⏰ Vollzeit

🟠 Senior

🔴 Experte

🤖 Künstliche Intelligenz

🦅 H1B-Visum-Sponsor

info

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 13 Tagen

ActivTrak

51 - 200

☁️ SaaS

⚡ Produktivität

🏢 Unternehmen

Director of AI Transformation managing strategy, execution, and team leadership at ActivTrak. Overseeing AI adoption, governance, and cross-functional collaboration for innovative solutions.

🗣️🇺🇸🇬🇧 Englisch erforderlich