Ensembl RDF for human genome GRCh 37
Ensembl is a collection of automatically annotated genome databases for vertebrates and a number of eukaryotes, integrated with all other relevant biological data available. This RDF dataset is generated by the RDF Portal based on files distributed by Ensembl, and contains only data for the human genome assembly GRCh37. For data on the human genome assembly GRCh38, please use the standard Ensembl RDF dataset.
Dataset statistics
- Triples
- 26152049
- Subjects
- 5437757
- Properties
- 16
- Objects
- 7911977
- Classes
- 80
SPARQL example queries
# Endpoint: https://rdfportal.org/ebi/sparql
# Description: Ensembl gene and chromosome
# Parameter: ensg_taxonomy: (example: taxonomy:9606)
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX terms: <http://rdf.ebi.ac.uk/terms/ensembl/>
PREFIX so: <http://purl.obolibrary.org/obo/so#>
PREFIX dcterms: <http://purl.org/dc/terms/>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX obo: <http://purl.obolibrary.org/obo/>
PREFIX taxonomy: <http://identifiers.org/taxonomy/>
SELECT ?ensg_id ?ensg_label ?ensg_chromosome ?ensg_taxonomy
FROM <http://rdfportal.org/dataset/ensembl>
WHERE {
VALUES ?ensg_taxonomy { taxonomy:9606 }
?EnsemblGene a terms:EnsemblGene ;
dcterms:identifier ?ensg_id ;
rdfs:label ?ensg_label ;
so:part_of ?ensg_chromosome ;
obo:RO_0002162 ?ensg_taxonomy .
}
LIMIT 100