Accessibility navigation


Feature selection and ensemble of regression models for predicting the protein macromolecule dissolution profile

Ojha, V. ORCID: https://orcid.org/0000-0002-9256-1192, Jackowski, K., Abraham, A. and Snásel, V. (2014) Feature selection and ensemble of regression models for predicting the protein macromolecule dissolution profile. In: 2014 Sixth World Congress on Nature and Biologically Inspired Computing (NaBIC 2014), 30 July-1 Aug. 2014, Porto, Portugal, pp. 121-126, https://doi.org/10.1109/NaBIC.2014.6921864.

[img]
Preview
Text - Accepted Version
· Please see our End User Agreement before downloading.

208kB

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

To link to this item DOI: 10.1109/NaBIC.2014.6921864

Abstract/Summary

Predicting the dissolution rate of proteins plays a significant role in pharmaceutical/medical applications. The rate of dissolution of Poly Lactic-co-Glycolic Acid (PLGA) micro- and nanoparticles is influenced by several factors. Considering all factors leads to a dataset with three hundred features, making the prediction difficult and inaccurate. Our present study consists of three phases. Firstly, dimensionality reduction techniques are applied in order to simplify the task and eliminate irrelevant and redundant attributes. Subsequently, a heterogeneous pool of several classical regression algorithms is created and evaluated. Regression algorithms in the pool are independently trained to identify the problem at hand. Finally, we test several ensemble methods in order to elevate the accuracy of the prediction. The Evolutionary Weighted Ensemble method proposed in this paper offered the lowest RMSE and significantly outperformed competing classical algorithms and other ensemble techniques.

Item Type:Conference or Workshop Item (Paper)
Refereed:Yes
Divisions:Interdisciplinary Research Centres (IDRCs) > Centre for the Mathematics of Planet Earth (CMPE)
Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science
ID Code:93564

Downloads

Downloads per month over past year

University Staff: Request a correction | Centaur Editors: Update this record

Page navigation