Datasets

Ensembl RDF for human genome GRCh 37

Ensembl is a collection of automatically annotated genome databases for vertebrates and a number of eukaryotes, integrated with all other relevant biological data available. This RDF dataset is generated by the RDF Portal based on files distributed by Ensembl, and contains only data for the human genome assembly GRCh37. For data on the human genome assembly GRCh38, please use the standard Ensembl RDF dataset.

Dataset specifications

Tags
Gene Protein
Provenance Third party
Registration Added by RDF portal
Data provider
  • EMBL-EBI
Creator
  • Shuichi KawashimaDatabase Center for Life Science
  • Shuya IkedaDatabase Center for Life Science
Issued 2022-05-19
Licenses
  • http://ensembl.org/info/about/legal/disclaimer.html
Version 75
Download https://rdfportal.org/download/ensembl_grch37
SPARQL Endpoint https://rdfportal.org/ebi/sparql

Dataset statistics

Triples
26152049
Subjects
5437757
Properties
16
Objects
7911977
Classes
80

SPARQL example queries

Example 1

Run on Endpoint
# Endpoint: https://rdfportal.org/ebi/sparql
# Description: Ensembl gene and chromosome
# Parameter: ensg_taxonomy: (example: taxonomy:9606)

PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX terms: <http://rdf.ebi.ac.uk/terms/ensembl/>
PREFIX so: <http://purl.obolibrary.org/obo/so#>
PREFIX dcterms: <http://purl.org/dc/terms/>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX obo: <http://purl.obolibrary.org/obo/>
PREFIX taxonomy: <http://identifiers.org/taxonomy/>

SELECT ?ensg_id ?ensg_label ?ensg_chromosome ?ensg_taxonomy
FROM <http://rdfportal.org/dataset/ensembl>
WHERE {
    VALUES ?ensg_taxonomy { taxonomy:9606 }
    ?EnsemblGene a terms:EnsemblGene ;
        dcterms:identifier ?ensg_id ;
        rdfs:label ?ensg_label ;
        so:part_of ?ensg_chromosome ;
        obo:RO_0002162 ?ensg_taxonomy .
}
LIMIT 100

Schema diagram

Schema diagram for ensembl_grch37
Schema diagram for ensembl_grch37