Machine Learning Scientist I/II, Multi-Modal Scientific Reasonings
On-site
Fresh
Physical Sciences AI
Compensation
$176,000-$304,000Description
Your Impact at LILA
We’re hiring a Machine Learning Scientist to advance multi‑modal reasoning with vision‑language models (VLMs) on real-world scientific data including, but not limited to: figures and plots, microscopy data from diverse sources. You’ll design and build state‑of‑the‑art methods to advance the state of Scientific Superintelligence.
What You'll Be Building
- Lead research on multi‑modal reasoning systems that interpret scientific data (images, plots, text, etc) using state‑of‑the‑art and custom VLMs.
- Design training, adaptation and test-time methods and strategies (e.g., instruction tuning, supervised learning, RLHF, RAG) for scientific understanding tasks.
- Build datasets and benchmarks from real scientific artifacts (e.g., microscopy, spectra, protocols) to understand model performance.
- Develop perception modules (e.g, OCR, table/structure recognition, plot parsing) for multi-modal data modalities.
- Collaborate with domain scientists and engineers to scale research into production ready systems for scientific superintelligence.
What You’ll Need to Succeed
- Advanced degree in a relevant field (CS/AI, Applied Math/Stats, EE) or a physical‑sciences discipline (Materials, Chemistry, Physics) with strong ML focus; or equivalent research/industry experience.
- Track record in multi‑modal ML or VLMs demonstrated via shipped systems, publications, or open‑source.
- Understanding of scientific QA/benchmarks and custom evaluation design.
- Experience with multi-modal fine-tuning, document parsing & understanding, dataset curation and benchmarking.
- Strong engineering skills centered on modern machine learning frameworks (e.g., PyTorch, Huggingface).
- Clear communication and collaboration in cross‑functional settings.
Bonus Points For
- Experience with scientific data modalities in real-world laboratories such as microscopy images.
- Publications in top ML/CV/NLP venues or tangible impact in applied industrial research.
- Contributions to open‑source multi‑modal tooling, evaluation suites, or datasets.
Stack
PyTorchHugging FaceMachine LearningFine-tuningRAGNLPReinforcement Learning
- Posted
- Feb 3, 2026
- Last seen
- Jun 25, 2026
- First seen
- Jun 25, 2026
- Status
- active