Datasets

ChEMBL RDF

ChEMBL is a manually curated database of bioactive molecules with drug-like properties.

Dataset specifications

Tags
Drug/Chemical
Provenance Original
Registration Added by RDF portal
Data provider
  • European Bioinformatics Institute
Creator
  • European Bioinformatics Institute
Issued 2023-03-01
Licenses
  • Creative Commons Attribution-Share Alike 3.0 Unported License
Version 32
Download https://rdfportal.org/download/chembl
SPARQL Endpoint https://rdfportal.org/ebi/sparql

Dataset statistics

Triples
740029339
Subjects
83960714
Properties
125
Objects
185509730
Classes
105

SPARQL example queries

Example 1

Run on Endpoint
# Endpoint: https://rdfportal.org/ebi/sparql
# Description: List the pairs of ChEMBL compound ID and its label

PREFIX cco: <http://rdf.ebi.ac.uk/terms/chembl#>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>

SELECT DISTINCT ?molecule_chemblid ?molecule_label
FROM <http://rdf.ebi.ac.uk/dataset/chembl>
WHERE {
    VALUES ?Molecule__class { cco:SmallMolecule cco:ProteinMolecule cco:UnknownSubstance cco:Oligosaccharide cco:UnclassifiedSubstance cco:CellTherapy cco:Enzyme cco:Antibody cco:Oligonucleotide cco:UnclassifiedMolecule }
    ?Molecule a ?Molecule__class ;
        cco:chemblId ?molecule_chemblid ;
        rdfs:label ?molecule_label .
}
LIMIT 100

Example 2

Run on Endpoint
# Endpoint: https://rdfportal.org/ebi/sparql
# Description: List compounds targeting Tyrosine-protein kinase ABL (CHEMBL1862) and approved as a drug
# Parameter: drug_mechanism_has_target: (example: chembl_target:CHEMBL1862)
#            molecule_highest_development_phase: (example: 4)

PREFIX cco: <http://rdf.ebi.ac.uk/terms/chembl#>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX chembl_target: <http://rdf.ebi.ac.uk/resource/chembl/target/>

SELECT ?molecule_chemblid ?molecule_label ?molecule_highest_development_phase ?molecule_has_mechanism ?drug_mechanism_has_target
FROM <http://rdf.ebi.ac.uk/dataset/chembl>
WHERE {
    VALUES ?drug_mechanism_has_target { chembl_target:CHEMBL1862 }
    VALUES ?molecule_highest_development_phase { 4 }
    VALUES ?drug_mechanism_has_target__class { cco:ADMET cco:ProteinComplexGroup cco:ProteinComplex cco:ProteinSelectivityGroup cco:Phenotype cco:SingleProtein cco:NucleicAcid cco:Organism cco:Metal cco:Macromolecule cco:SmallMoleculeTarget cco:SubCellular cco:OligosaccharideTarget cco:UnknownTarget cco:UnclassifiedTarget cco:CellLineTarget cco:Tissue cco:NonMolecular cco:ChimericProtein cco:ProteinNucleicAcidComplex cco:ProteinFamily cco:ProteinProteinInteraction }
    VALUES ?Molecule__class { cco:SmallMolecule cco:ProteinMolecule cco:UnknownSubstance cco:Oligosaccharide cco:UnclassifiedSubstance cco:CellTherapy cco:Enzyme cco:Antibody cco:Oligonucleotide cco:UnclassifiedMolecule }
    ?Molecule a ?Molecule__class ;
        cco:chemblId ?molecule_chemblid ;
        rdfs:label ?molecule_label ;
        cco:highestDevelopmentPhase ?molecule_highest_development_phase .
    ?molecule_has_mechanism a cco:Mechanism ;
        cco:hasTarget ?drug_mechanism_has_target .
    ?drug_mechanism_has_target a ?drug_mechanism_has_target__class .
    OPTIONAL {
        ?Molecule cco:hasMechanism ?molecule_has_mechanism .
    }
}
LIMIT 100

Schema diagram

Schema diagram for chembl
Schema diagram for chembl