
Compensation
Salary undisclosedDescription
In This Role, You Will
- Develop and run adversarial test suites—both manual and scripted—for LLMs and image / video models.
- Craft multilingual prompts, jailbreaks, and escalation chains targeting policy edge cases.
Analyze outputs, triage failures, and write concise vulnerability reports. - Contribute to internal tooling (e.g., prompt libraries, scenario generators, dashboards).
We’re Looking for Someone Who
- Has 2-4 years of experience in red-teaming, security research, trust & safety, or related fields.
- Is comfortable scripting basic tests (Python, Bash, or similar) and working in Jupyter or prompt-engineering tools.
- Communicates clearly in English and at least one additional language (ideally major non-English language relevant to global threat landscapes).
- Thinks like an adversary, documents findings crisply, and iterates quickly.
Requirements
- Bachelor’s degree—or equivalent experience—in CS, data science, linguistics, international studies, or security.
- Basic proficiency with Python and command-line tools.
- Demonstrated interest in AI safety, adversarial ML, or abuse detection.
- Strong writing skills for short vulnerability reports and long-form analyses.
- Ability to rapidly context switch across domains, modalities, and abuse areas.
- Excited to work in a fast-paced and ambiguous space.
Nice to Have
- Full professional proficiency in Arabic, Chinese, Farsi, Portuguese, Russian, or Spanish, as well as English.
- Prior work in content moderation, disinformation analysis, or cyber-threat intelligence.
- Experience with prompt-automation frameworks (e.g., Promptfoo, LangChain, Garak).
Familiarity with vector search or LLM fine-tuning workflows. - Formal training or certification in red-teaming or penetration testing.
Compensation & Benefits
- Salary range: $70K–$90K depending on experience.
- Opportunity for spot bonuses and annual performance-based bonus.
- Fully remote (U.S.-based) with flexible hours.
- Comprehensive health, dental, and vision.
- Generous PTO and paid holidays.
- 401(k) plan.
- Professional-development stipend for courses, conferences, or language study.
- We reward excellence with growth—team members who excel have clear paths for promotion and skill development.
Stack
Data SciencePythonLLMsVector DatabasesLangChainMachine LearningFine-tuning
- Posted
- May 20, 2025
- Last seen
- Jun 25, 2026
- First seen
- Jun 25, 2026
- Status
- active