Accessibility navigation


Fast adaptive real-time classification for data streams with concept drift

Tennant, M., Stahl, F. ORCID: https://orcid.org/0000-0002-4860-0203 and Gomes, J. (2015) Fast adaptive real-time classification for data streams with concept drift. In: The 8th International Conference on Internet and Distributed Computing Systems, pp. 265-272.

[img]
Preview
Text - Accepted Version
· Please see our End User Agreement before downloading.

404kB

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Official URL: http://dx.doi.org/10.1007/978-3-319-23237-9_23

Abstract/Summary

An important application of Big Data Analytics is the real-time analysis of streaming data. Streaming data imposes unique challenges to data mining algorithms, such as concept drifts, the need to analyse the data on the fly due to unbounded data streams and scalable algorithms due to potentially high throughput of data. Real-time classification algorithms that are adaptive to concept drifts and fast exist, however, most approaches are not naturally parallel and are thus limited in their scalability. This paper presents work on the Micro-Cluster Nearest Neighbour (MC-NN) classifier. MC-NN is based on an adaptive statistical data summary based on Micro-Clusters. MC-NN is very fast and adaptive to concept drift whilst maintaining the parallel properties of the base KNN classifier. Also MC-NN is competitive compared with existing data stream classifiers in terms of accuracy and speed.

Item Type:Conference or Workshop Item (Paper)
Refereed:Yes
Divisions:Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science
ID Code:44252
Uncontrolled Keywords:Data stream classification; Adaptation to concept drift; High velocity data streams
Publisher:Springer International Publishing

Downloads

Downloads per month over past year

University Staff: Request a correction | Centaur Editors: Update this record

Page navigation