Version: 1.0.0 | Published: 8 Oct 2024 | Updated: 229 days ago
Summary
DOI Name:
10.11581/yk6n-b652
Documentation
Associated Media:
Description:
This wholly synthetic dataset is based on real anonymised primary care patient data extracted from the CPRD Aurum database and focuses on cardiovascular disease risk factors.
Researchers will not be able to access the real anonymised patient data extract which was used as the basis for the synthetic dataset generation to preserve patient privacy.
The ground truth data extract was subject to data pre-processing and as such, the synthetic dataset, which is based on this, does not reflect the structure of the source CPRD Aurum
database. This synthetic dataset was developed as part of a project funded by the Regulators’ Pioneer Fund launched by The Department for Business, Energy and Industrial Strategy (BEIS) and
managed by Innovate UK. The methodology used to generate and evaluate this synthetic dataset is outlined in Wang et al. 2019.
Coverage
Spatial:
United Kingdom
Typical Age Range:
0-150
Follow Up:
Unknown
Pathway:
Primary care
Provenance
Origin
Purposes:
Study
Collection Situations:
Other
Temporal
Accrual Periodicity:
Other
Distribution Release Date:
28 June 2020
Start Date:
25 March 2020
Time Lag:
Not applicable
Accessibility
Access
Access Rights:
Access Service:
Access to CPRD data, including UK Primary Care Data, and linked data such as
Hospital Episode Statistics, is subject to protocol approval via CPRD’s Research
Data Governance (RDG) Process. Independent scientific and patient advice is
provided by Expert Review Committees (ERCs) and the Central Advisory Committee
(CAC): https://www.cprd.com/research-applications
Access Request Cost:
Delivery Lead Time:
Not applicable
Jurisdictions:
GB-GBN
Data Controller:
Clinical Practice Research Datalink (CPRD)
Data Processor:
CPRD
Usage
Data Use Limitations:
- General research use
- No linkage
- Research-specific restrictions
- Research use only
Data Use Requirements:
- Geographical restrictions
- Institution-specific restrictions
- Project-specific restrictions
- Time limit on use
- User-specific restriction
Resource Creators:
CPRD
Format and Standards
Vocabulary Encoding Schemes:
SNOMED CT
Languages:
en
Formats:
Tab delimited text
Observations
Statistical Population
Population Description
Population Size
Measured Property
Observation Date
Persons
Patients in the dataset
499344
COUNT
28 June 2020