GenAI Analyst

Job not on LinkedIn

November 16

Apply Now
Logo of ActiveFence

ActiveFence

ActiveFence is a Trust and Safety provider for online platforms, protecting platforms and their users from malicious behavior and content. Trust and Safety teams of all sizes rely on ActiveFence to keep their users safe from the widest spectrum of online harms, unwanted content, and malicious behavior, including child safety and exploitation, disinformation, hate speech, terror, nudity, fraud, and more. We offer a full stack of capabilities with our deep intelligence research, AI-driven harmful content detection, and online content moderation platform. Protecting over three billion users globally everyday in over 100 languages, ActiveFence lets people interact and thrive online.

201 - 500 employees

💰 $400M Series B on 2021-07

📋 Description

• Writing adversarial prompts to identify weaknesses in AI models. • Managing and analyzing datasets for high-quality outputs. • Summarizing findings into structured reports or data templates.

🎯 Requirements

• Proven experience with Generative AI models is essential, though direct technical experience is not a prerequisite. • Understanding of risk taxonomies (e.g., harm categories, policy tiers). • Command of English at a near-native level. • Attention to detail, organizational capabilities. • Ability to manage multiple tasks simultaneously and meet deadlines. • Familiarity with various model types (Text-to-Text, Text-to-Image) is desirable. • Experience with prompt injection techniques, jailbreaks and red-teaming techniques. • Prior work in model evaluation,prompt engineering, or safety analysis. • Regional expertise or cultural fluency in specific geopolitical areas.

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com