Bookmark

Scale adjustments for classifiers in high-dimensional, low sample size settings

Chan, Yao-Ban; Hall, Peter
Biometrika , Volume 96 (2): 469 Oxford University PressJun 1, 2009

Preview Only

Scale adjustments for classifiers in high-dimensional, low sample size settings

Abstract

Abstract Distance-based classifiers are generally considered to be effective at discriminating between populations that differ in location. Indeed, nearest-neighbour methods and the support vector machine are frequently used in very high-dimensional problems involving gene expression data, where it is believed that elevated levels of expression convey much of the information for classification. However, one problem inherent to distance-based classifiers is that scale differences can mask location differences. In consequence, such classifiers can have poor performance if the information for classification accumulates through a large number of relatively small location differences in data components, rather than via large differences. In this paper, we show that a simple adjustment for scale, applicable to a variety of distance-based classifiers, can remedy the problem. For some classifiers, such as those based on the support vector machine or the centroid method, scale corrections are important primarily in the case of small training-sample sizes. However, for other classifiers, including those based on nearest-neighbour and average-distance methods, scale adjustments are helpful more generally. Some key words
Loading next page...
1 Page

Preview Only. This article cannot be rented because we do not currently have permission from the publisher.

 
/lp/oxford-university-press/scale-adjustments-for-classifiers-in-high-dimensional-low-sample-size-AJiLljlJwh
Title
Scale adjustments for classifiers in high-dimensional, low sample size settings
Author(s)
Chan, Yao-Ban; Hall, Peter
Journal
Biometrika , Volume 96 (2): 469 Oxford University Press – Jun 1, 2009
Publisher
Oxford University Press
Copyright
Copyright © 2009 Oxford University Press
ISSN
0006-3444
eISSN
1464-3510
D.O.I.
10.1093/biomet/asp007
Publisher site
Get PDF