Netherlands Cancer Registry (NCR)
Synthetic Dataset
A subset of the NCR is available as a synthetic dataset. This concerns breast cancer (mammary carcinoma, C50) data, including:
- ICDO3 diagnosis
- TNM/stage
- Key outcomes
- First-line treatment
- Death
The synthetic dataset is available in two formats:
- As a single CSV file. This can be used directly for analyses or can be converted into a synthetic dataset in the OMOP Common Data Model (OMOP CDM) in Docker using this script.
- In OMOP CDM format. In this case, a CSV file is provided for each OMOP CDM table containing patient information. These can then be loaded into an existing OMOP CDM environment.
Application procedure
For more information and to apply for a synthetic dataset, please see our website: