Kairos
Back to jobs

Data Curator, London

On-site
Isomorphic LabsLondon, GB3 weeks agoWebsite
Tech

Compensation

Salary undisclosed
Apply
Share

Description

Your impact 

This is an exciting opportunity to join the data team at IsoLabs, working closely with world leading AI experts and Drug Discovery scientists to establish machine learning ready datasets that power the discovery of the next generation of medicines. As a data curator you will be foundational in ensuring the quality of data at scale and lead our efforts to represent chemical, biological, and clinical information in the most impactful way for IsoLabs, an AI driven drug-discovery platform.

What you will do 

  • Integrate large scale biomedical and biochemical datasets and curate them to enhance their quality and create interoperable data assets that fuel IsoLabs research efforts. 
  • Work in partnership across research teams to create ML-ready datasets.
  • Use your expertise in chemistry and/or biology to maximise the quality and scale of available training data.
  • Contribute to the data team’s efforts to identify, evaluate and assess new data sources and data generation opportunities. 
  • Collaborate to devise novel ways to couple machine learning based data extraction methods with human domain expertise to build large scale high-quality datasets.
  • Communicate your work and raise awareness of opportunities to improve data quality.

Skills and qualifications 

Essential:

  • Proven experience working in industry at a biotech or pharmaceutical company or closely with industry at a research institution. 
  • PhD in a Life Science or Informatics discipline, or equivalent experience in scientific research.
  • Expert in data representation, ontologies, and curation of high quality data assets.
  • Experience working with a broad range of data types used in the drug discovery process (e.g. binding assays, ADMET properties).
  • Deep knowledge of biomedical and biochemical databases and data sources and approaches to improve their interoperability for machine learning use cases.
  • Working knowledge of Python and SQL with experience using cheminformatics and data science toolkits (e.g. RDKit, Pandas/Polars).
  • Strong communicator and a proven collaborator with both multi-disciplinary biology/chemistry and product/engineering teams.

Nice to have:

  • Familiarity with data engineering concepts and experience with running jobs on Cloud-based infrastructure.

Stack

PythonData ScienceSQLpandasMachine LearningData Engineering
Posted
Jun 2, 2026
Last seen
Jun 25, 2026
First seen
Jun 25, 2026
Status
active