client logo
Version: 17.0.0 | Published: 30 Oct 2023 | Updated: 572 days ago
ind-dataset-logo

Genomics England - Rare Disease

Dataset

Documentation

Description:
Rare Disease data are presented at the level of Rare Disease families (families of probands), Rare Disease pedigrees, and participants. Participants are individuals who have consented to be part of the project with the expectation that a sample of their DNA will be obtained and their genome sequenced. Pedigree members are extended members of the proband’s family, this includes participants as well a small amounts of deidentified data recorded to allow a full picture of the proband’s extended family. This additional information is extracted from the proband’s medical record. All Rare Disease table names are prefixed with “rare_diseases_”. Data at the Level of Rare Disease Families: rare_diseases_family: Data describing the families of rare disease probands participating in the 100,000 Genomes Project. It includes the family group type, the status of the family’s pre-interpretation clinical review and the settings that were chosen for the interpretation pipeline at the clinical review. rare_diseases_pedigree: Data describing the Rare Disease participants, linking pedigrees to probands and their family members. rare_diseases_pedigree_member: Data describing the Rare Disease pedigree members, similar to the data about each individual participant in the participant table (common data view, see section 8.2). It may also include additional data, such as the age of onset of predominant clinical features; data on links to other family members; as well as data collected only for Phenotypes. Data at the Level of Rare Disease Participants. The data presented in these tables provides information on disease progression and pertinent medical history: rare_diseases_participant_disease: Data describing the rare disease participants' disease type/subtype assigned to them upon enrolment, and the date of diagnosis. rare_diseases_participant_phenotype: Data describing the Rare Disease participants’ phenotypes. For each Rare Disease participant in the 100,000 Genomes Project, there are data about whether a phenotypic abnormality as defined by an HPO term is present and what the HPO term is, as well as the age of onset, the severity of manifestation, the spatial pattern in the body and whether it is progressive or not. Please note that these data are only available for a subset of the rare disease participants. rare_diseases_gen_measurement: For Rare Disease participants in the 100,000 Genomes Project, this table contains general measurements relevant to the disease, alongside the date that the measurements were taken on. Please note that these data are only available for a subset of the rare disease participants. rare_diseases_early_childhood_observation: For Rare Disease participants in the 100,000 Genomes Project, this table contains measurements and milestones provided by the GMCs, related to childhood development. Please note that these data are only available for a subset of the rare disease participants. rare_diseases_imaging For Rare Disease
Is Part Of:
100K Primary Data

Coverage

Spatial:
UK
Typical Age Range:
0-150
Follow Up:
OTHER
Physical Sample Availability:
DNA
Pathway:
Linked datasets cover secondary care.

Provenance

Origin

Purposes:
  • CARE
  • STUDY
  • OTHER
Sources:
  • ELECTRONIC SURVEY
  • EPR
  • LIMS
  • OTHER
Collection Situations:
  • CLINIC
  • IN-PATIENTS
  • OUTPATIENTS

Temporal

Accrual Periodicity:
QUARTERLY
Distribution Release Date:
30 March 2023
Start Date:
01 January 2014
End Date:
01 January 2019
Time Lag:
2-6 MONTHS

Accessibility

Access

Access Service:
More information about the Genomics England Research Environment can be found here: https://www.genomicsengland.co.uk/about-genomics-england/research-environment/ https://research-help.genomicsengland.co.uk/display/GERE/1.+The+Genomics+England+Research+Environment Genomics England 100k participants have consented to longitudinal lifetime followup and recontact safely through our clinical network. BRST (Bioinformatics Research Services) are a team of bioinformatics who know the dataset inside out and provide consultancy projects on a case by case basis. Our network of clinical and medical experts can be made available on case by case basis. Researchers have the opportunity to work with our and access the GeCIP network who are a community of world-leading experts in specific cancers and rare diseases.
Access Request Cost:
Fees will be dependent on the type of access that is necessary. Raw data is not eligible for export. Summary-level data may be exported provided that it is approved through the Genomics England Airlock Process
Delivery Lead Time:
2-6 MONTHS
Jurisdictions:
GB-GBN
Data Controller:
GENOMICS ENGLAND
Data Processor:
GENOMICS ENGLAND

Usage

Data Use Limitations:
GENERAL RESEARCH USE
Data Use Requirements:
  • ETHICS APPROVAL REQUIRED
  • PROJECT SPECIFIC RESTRICTIONS
  • PUBLICATION MORATORIUM
Resource Creators:
  • The 100
  • 000 Genomes Project Protocol v3
  • Genomics England. doi:10.6084/m9.figshare.4530893.v3. 2017. Publications that use the Genomics England Database should include an author as Genomics England Research Consortium. Please see the publication policy.

Format and Standards

Vocabulary Encoding Schemes:
  • HPO
  • OPCS4
  • SNOMED CT
  • OTHER
Languages:
en
Formats:
Multiple Formats Available

Enrichment and Linkage

Qualified Relations:
  • HES Accident and Emergency
  • HES Outpatient Care
  • Office for National Statistics - Death details data (ONS)
  • Systemic Anti-Cancer Therapy Data Set (SACT)
  • National Radiotherapy Dataset (RTDS)
  • Diagnostic Imaging Dataset (DID)
  • Patient Reported Outcome Measures (PROMs)
  • Cancer Registration (AV) tables
  • Lung Cancer Data Audit (LUCADA)
  • Mental Health Minimum Data Set (MHMDS)
  • PHE Diagnostic Imaging Dataset (NCRAS_DID)
  • HES Admitted Patient Care
  • Cancer waiting times (CWT)
Derivations:
Not Known

Observations

Statistical Population
Population Description
Population Size
Measured Property
Observation Date
Findings
Cancer Tumour - Number of genomes
17,003
Count
30 March 2023
Findings
Cancer Germline - Number of genomes
32,753
Count
30 March 2023
Findings
Rare Disease - Number of genomes
73,517
Count
30 March 2023
Persons
Rare Disease Participants
72,874
Count
30 March 2023
Persons
Cancer Participants
15,624
Count
30 March 2023