Backward elimination model construction for regression and classification using leave-one-out criteria

Hong, X.; Mitchell, R. J.

Download

Full text not archived in this repository.

Advice

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Tools

Lists

Hong, X. ORCID: https://orcid.org/0000-0002-6832-2298 and Mitchell, R. J. (2007) Backward elimination model construction for regression and classification using leave-one-out criteria. International Journal of Systems Science, 38 (2). pp. 101-113. ISSN 0020-7721 doi: 10.1080/00207720601051463

Abstract/Summary

A fundamental principle in practical nonlinear data modeling is the parsimonious principle of constructing the minimal model that explains the training data well. Leave-one-out (LOO) cross validation is often used to estimate generalization errors by choosing amongst different network architectures (M. Stone, "Cross validatory choice and assessment of statistical predictions", J. R. Stast. Soc., Ser. B, 36, pp. 117-147, 1974). Based upon the minimization of LOO criteria of either the mean squares of LOO errors or the LOO misclassification rate respectively, we present two backward elimination algorithms as model post-processing procedures for regression and classification problems. The proposed backward elimination procedures exploit an orthogonalization procedure to enable the orthogonality between the subspace as spanned by the pruned model and the deleted regressor. Subsequently, it is shown that the LOO criteria used in both algorithms can be calculated via some analytic recursive formula, as derived in this contribution, without actually splitting the estimation data set so as to reduce computational expense. Compared to most other model construction methods, the proposed algorithms are advantageous in several aspects; (i) There are no tuning parameters to be optimized through an extra validation data set; (ii) The procedure is fully automatic without an additional stopping criteria; and (iii) The model structure selection is directly based on model generalization performance. The illustrative examples on regression and classification are used to demonstrate that the proposed algorithms are viable post-processing methods to prune a model to gain extra sparsity and improved generalization.

Altmetric Badge

Item Type	Article
URI	https://centaur.reading.ac.uk/id/eprint/15284
Identification Number/DOI	10.1080/00207720601051463
Refereed	Yes
Divisions	Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science
Uncontrolled Keywords	classification, cross validation, forward regression, backward, elimination, system identification, ALGORITHM, SELECTION, DECOMPOSITION
Download/View statistics	View download statistics for this item

Deposit Details

University Staff: Request a correction | Centaur Editors: Update this record

Date Deposited:	26 Oct 2010 16:11	Date item deposited into CentAUR
Last Modified:	08 Jun 2025 02:48	Date item last modified