Tennant, M., Stahl, F. ORCID: https://orcid.org/0000-0002-4860-0203 and Gomes, J.
(2015)
Fast adaptive real-time classification for data streams with concept drift.
In: The 8th International Conference on Internet and Distributed Computing Systems, pp. 265-272.
Abstract/Summary
An important application of Big Data Analytics is the real-time analysis of streaming data. Streaming data imposes unique challenges to data mining algorithms, such as concept drifts, the need to analyse the data on the fly due to unbounded data streams and scalable algorithms due to potentially high throughput of data. Real-time classification algorithms that are adaptive to concept drifts and fast exist, however, most approaches are not naturally parallel and are thus limited in their scalability. This paper presents work on the Micro-Cluster Nearest Neighbour (MC-NN) classifier. MC-NN is based on an adaptive statistical data summary based on Micro-Clusters. MC-NN is very fast and adaptive to concept drift whilst maintaining the parallel properties of the base KNN classifier. Also MC-NN is competitive compared with existing data stream classifiers in terms of accuracy and speed.
Item Type | Conference or Workshop Item (Paper) |
URI | https://reading-clone.eprints-hosting.org/id/eprint/44252 |
Item Type | Conference or Workshop Item |
Refereed | Yes |
Divisions | Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science |
Uncontrolled Keywords | Data stream classification; Adaptation to concept drift; High velocity data streams |
Publisher | Springer International Publishing |
Download/View statistics | View download statistics for this item |
Downloads
Downloads per month over past year
University Staff: Request a correction | Centaur Editors: Update this record