Ridge regression ensemble for toxicity prediction

This source preferred by Marcin Budka

Authors: Budka, M. and Gabrys, B.

http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2359117%232010%23999989998%232084734%23FLP%23&_cdi=59117&_pubType=J&_auth=y&_acct=C000011378&_version=1&_urlVersion=0&_userid=1682380&md5=33b0f16352418b51f72cffbe4d5669e3

Journal: Procedia Computer Science

Volume: 1

Pages: 193-201

ISSN: 1877-0509

DOI: 10.1016/j.procs.2010.04.022

Traditional methods of assessing chemical toxicity of various compounds require tests on animals, which raises ethical concerns and is expensive. Current legislation may lead to a further increase of demand for laboratory animals in the next years. As a result, automatically generated predictions using Quantitative Structure–Activity Relationship (QSAR) modelling approaches appear as an attractive alternative. Due to sparsity of the chemical space, making this kind of predictions is however a difficult task.

In this paper we propose a purely data–driven, rigorous and universal methodology of QSAR modelling, based on ensemble of relatively simple ridge regressors trained in various subspaces of the chemical space, selected using an iterative optimization procedure. The model described has been developed without using any domain knowledge and has been evaluated within the Environmental Toxicity Prediction Challenge CADASTER 2009, which has attracted over 100 participants from 25 countries. The presented approach was chosen as one of the First–Pass Winners, with predictive power non–significantly different to the highest ranked method, developed by the experts in the area of QSAR modelling and toxicology.

This data was imported from DBLP:

Authors: Budka, M. and Gabrys, B.

Editors: Croitoru, M., Ferré, S. and Lukose, D.

http://dx.doi.org/10.1007/978-3-642-14197-3

Journal: ICCS

Volume: 6208

Pages: 193-201

Publisher: Springer

DOI: 10.1016/j.procs.2010.04.022

This data was imported from Scopus:

Authors: Budka, M. and Gabrys, B.

Journal: Procedia Computer Science

Volume: 1

Issue: 1

Pages: 193-201

eISSN: 1877-0509

DOI: 10.1016/j.procs.2010.04.022

Traditional methods of assessing chemical toxicity of various compounds require tests on animals, which raises ethical concerns and is expensive. Current legislation may lead to a further increase of demand for laboratory animals in the next years. As a result, automatically generated predictions using Quantitative Structure-Activity Relationship (QSAR) modelling approaches appear as an attractive alternative. Due to sparsity of the chemical space, making this kind of predictions is however a difficult task. In this paper we propose a purely data-driven, rigorous and universal methodology of QSAR modelling, based on ensemble of relatively simple ridge regressors trained in various subspaces of the chemical space, selected using an iterative optimization procedure. The model described has been developed without using any domain knowledge and has been evaluated within the Environmental Toxicity Prediction Challenge CADASTER 2009, which has attracted over 100 participants from 25 countries. The presented approach was chosen as one of the First-Pass Winners, with predictive power non-significantly different to the highest ranked method, developed by the experts in the area of QSAR modelling and toxicology.

This data was imported from Web of Science (Lite):

Authors: Budka, M., Gabrys, B. and ICCS

Journal: ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS

Volume: 1

Issue: 1

Pages: 193-201

ISSN: 1877-0509

DOI: 10.1016/j.procs.2010.04.022

The data on this page was last updated at 17:31 on November 21, 2017.