About the role: As a Red Teaming Fellow, you will help evaluate the safety, security, and reliability of advanced AI systems. Fellows work alongside researchers, engineers, and subject-matter experts to identify vulnerabilities, uncover failure modes, and generate insights that help organizations deploy AI systems more safely.

This is a hands-on fellowship at the intersection of AI, security, and adversarial testing. Fellows will contribute to real-world evaluations of frontier models and AI-enabled systems, helping design and execute tests that probe how models behave under challenging or unexpected conditions.

In this role, you will:

Conduct adversarial testing of AI systems across safety, security, and misuse scenarios
Design and execute red teaming exercises against models, agents, and AI-enabled applications
Develop prompts, attack strategies, and test cases to identify vulnerabilities and failure modes
Analyze model behavior and document findings in clear, actionable reports
Support the development of evaluation frameworks, taxonomies, and testing methodologies
Research emerging AI threats, attack techniques, and risk trends
Collaborate with engineers, researchers, and subject-matter experts to investigate emerging risks and threat vectors

Develop front-end dashboards and other visualizations

Qualifications:

Pursuing or recently completed a degree in Computer Science, Cybersecurity, Engineering, Data Science, Mathematics, Political Science, International Relations, or a related field
Strong interest in AI safety, cybersecurity, red teaming, or adversarial testing
Familiarity with AI abuse including jailbreaking, system prompt extraction, indirect prompt injection, data exfiltration, etc.
Strong analytical, research, problem-solving, and communication skills
Ability to think creatively from an adversarial perspective
Ability to work independently and collaboratively in a remote environment
Experience with Python, Bash, large language models, AI systems, cybersecurity, or research methodologies preferred
Proficiency in a language other than English preferred

Benefits:

Flexible start / end dates
Remote work (based in the continental U.S.)
Flexible schedule, up to 20 hours per week (negotiable)
Hourly pay commensurate with experience and qualifications

$25 per hour for undergraduate students
$32.50 per hour for graduate students

Red Teaming Fellowship

Description

Stack