Datasets

ClinVar RDF

ClinVar is a freely available archive describing relationships between medically important variants and phenotypes.

Dataset specifications

Tags
Polymorphism Health/Disease Sequence
Provenance Third party
Registration Submitted
Data provider
  • National Center for Biotechnology Information
Creator
  • National Center for Biotechnology Information
Issued 2025-06-23
Licenses
  • https://www.ncbi.nlm.nih.gov/clinvar/docs/maintenance_use/
Version release_20250623
Download https://rdfportal.org/download/clinvar
SPARQL Endpoint https://rdfportal.org/ncbi/sparql

Dataset statistics

Triples
1991923652
Subjects
362095575
Properties
183
Objects
502025919
Classes
80

SPARQL example queries

Example 1

Run on Endpoint
# Endpoint: https://rdfportal.org/ncbi/sparql
# Description: Obrain clinical annotations from ClinVar
# Parameter: reference: (example: <http://identifiers.org/hco/13#GRCh37>)
#            position: (example: 32889080)
#            db: (example: MedGen)

PREFIX cvo: <http://purl.jp/bio/10/clinvar/>
PREFIX sio: <http://semanticscience.org/resource/>
PREFIX faldo: <http://biohackathon.org/resource/faldo#>

SELECT ?vcv ?interpretation ?id ?position ?reference
FROM <http://rdfportal.org/dataset/clinvar>
WHERE {
    VALUES ?reference { <http://identifiers.org/hco/13#GRCh37> }
    VALUES ?position { 32889080 }
    ?Clinvar a cvo:VariationArchiveType ;
        cvo:accession ?vcv ;
        cvo:variation_id ?id ;
        cvo:classified_record / sio:SIO_000628 / faldo:location / faldo:position ?position ;
        cvo:classified_record / sio:SIO_000628 / faldo:location / faldo:reference ?reference .
    OPTIONAL {
        ?Clinvar cvo:classified_record / cvo:rcv_list / cvo:rcv_accession / cvo:rcv_classifications / cvo:germline_classification / cvo:description / cvo:description ?interpretation .
    }
}
LIMIT 100

Schema diagram

Schema diagram for clinvar
Schema diagram for clinvar