Ojha, V. ORCID: https://orcid.org/0000-0002-9256-1192, Jackowski, K., Abraham, A. and Snásel, V.
(2014)
Feature selection and ensemble of regression models for predicting the protein macromolecule dissolution profile.
In: 2014 Sixth World Congress on Nature and Biologically Inspired Computing (NaBIC 2014), 30 July-1 Aug. 2014, Porto, Portugal, pp. 121-126.
doi: 10.1109/NaBIC.2014.6921864
Abstract/Summary
Predicting the dissolution rate of proteins plays a significant role in pharmaceutical/medical applications. The rate of dissolution of Poly Lactic-co-Glycolic Acid (PLGA) micro- and nanoparticles is influenced by several factors. Considering all factors leads to a dataset with three hundred features, making the prediction difficult and inaccurate. Our present study consists of three phases. Firstly, dimensionality reduction techniques are applied in order to simplify the task and eliminate irrelevant and redundant attributes. Subsequently, a heterogeneous pool of several classical regression algorithms is created and evaluated. Regression algorithms in the pool are independently trained to identify the problem at hand. Finally, we test several ensemble methods in order to elevate the accuracy of the prediction. The Evolutionary Weighted Ensemble method proposed in this paper offered the lowest RMSE and significantly outperformed competing classical algorithms and other ensemble techniques.
Altmetric Badge
Item Type | Conference or Workshop Item (Paper) |
URI | https://reading-clone.eprints-hosting.org/id/eprint/93564 |
Item Type | Conference or Workshop Item |
Refereed | Yes |
Divisions | Interdisciplinary Research Centres (IDRCs) > Centre for the Mathematics of Planet Earth (CMPE) Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science |
Download/View statistics | View download statistics for this item |
Downloads
Downloads per month over past year
University Staff: Request a correction | Centaur Editors: Update this record