
Compensation
Salary undisclosedDescription
RESPONSIBILITIES:
- Scale synthetic coding data to trillions of tokens with large-scale docker verification.
- Distill the intelligence of flagship models into flash models through synthetic data generation.
- Optimize mid-training data mixtures to boost the ceiling for RL.
- Engineer long-context data recipes.
- Develop robust and diverse evaluation for mid-training checkpoints.
BASIC QUALIFICATIONS:
- Expertise in ML and large model scaling, with familiarity across all kinds of scaling laws.
- Strong ability to design ML experiments.
- Familiarity with state-of-the-art techniques for curating AI training data for text, image, audio, and video modalities.
- Strong engineering abilities in Spark, Ray, and other frameworks for large-scale data processing.
COMPENSATION AND BENEFITS:
$180,000 - $440,000 USD
Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.
Stack
SparkMachine LearningDocker
- Posted
- Oct 31, 2025
- Last seen
- Jun 25, 2026
- First seen
- Jun 25, 2026
- Status
- active