Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/21734
Title: Weighted similarity-based clustering of chemical structures and bioactivity data in early drug discovery
Authors: PERUALILA, Nolen Joy 
TALLOEN, Willem 
SHKEDY, Ziv 
Göhlmann, Hinrich W. H.
VAN MOERBEKE, Marijke 
KASIM, Adetayo 
Corporate Authors: Quantitative Structure Transcription Assay Relationships (QSTAR) Consortium
Issue Date: 2016
Source: Journal of Bioinformatics and Computational Biology, 14 (4)
Abstract: The modern process of discovering candidate molecules in early drug discovery phase includes a wide range of approaches to extract vital information from the intersection of biology and chemistry. A typical strategy in compound selection involves compound clustering based on chemical similarity to obtain representative chemically diverse compounds (not incorporating potency information). In this paper, we propose an integrative clustering approach that makes use of both biological (compound efficacy) and chemical (structural features) data sources for the purpose of discovering a subset of compounds with aligned structural and biological properties. The datasets are integrated at the similarity level by assigning complementary weights to produce a weighted similarity matrix, serving as a generic input in any clustering algorithm. This new analysis work flow is semi-supervised method since, after the determination of clusters, a secondary analysis is performed wherein it finds differentially expressed genes associated to the derived integrated cluster(s) to further explain the compound-induced biological effects inside the cell. In this paper, datasets from two drug development oncology projects are used to illustrate the usefulness of the weighted similarity-based clustering approach to integrate multi-source high-dimensional information to aid drug discovery. Compounds that are structurally and biologically similar to the reference compounds are discovered using this proposed integrative approach.
Keywords: bioactivity; chemical structure; clustering; transcriptomic
Document URI: http://hdl.handle.net/1942/21734
ISSN: 0219-7200
e-ISSN: 1757-6334
DOI: 10.1142/S0219720016500189
ISI #: 000384031700007
Rights: © World Scientific Publishing Europe Ltd.
Category: A1
Type: Journal Contribution
Validations: ecoom 2017
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
perualila-tan2016.pdf
  Restricted Access
Published version518.01 kBAdobe PDFView/Open    Request a copy
Show full item record

SCOPUSTM   
Citations

2
checked on Sep 2, 2020

WEB OF SCIENCETM
Citations

6
checked on Apr 22, 2024

Page view(s)

64
checked on Aug 9, 2022

Download(s)

44
checked on Aug 9, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.