eRules: A Modular Adaptive Classification Rule Learning Algorithm for Data Streams

Authors: Stahl, F., Medhat Gaber, M. and Martin Salvador, M.

Conference: AI-2012, The Thirty-second SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence

Dates: 11-13 December 2013

Journal: Incorporating Applications and Innovations in Intelligent Systems XX Proceedings of AI-2012, The Thirty-second SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence

Pages: 65-78

ISBN: 978-1-4471-4738-1

DOI: 10.1007/978-1-4471-4739-8_5

https://eprints.bournemouth.ac.uk/23390/

http://link.springer.com/chapter/10.1007/978-1-4471-4739-8_5

Source: Manual

Preferred by: Manuel Salvador

eRules: A Modular Adaptive Classification Rule Learning Algorithm for Data Streams.

Authors: Stahl, F., Medhat Gaber, M. and Salvador, M.M.

Editors: Bramer, M. and Petridis, M.

Conference: AI-2012, The Thirty-second SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence

Pages: 65-78

Publisher: Springer

ISBN: 978-1-4471-4738-1

Abstract:

Advances in hardware and software in the past decade allow to capture, record and process fast data streams at a large scale. The research area of data stream mining has emerged as a consequence from these advances in order to cope with the real time analysis of potentially large and changing data streams. Examples of data streams include Google searches, credit card transactions, telemetric data and data of continuous chemical production processes. In some cases the data can be processed in batches by traditional data mining approaches. However, in some applications it is required to analyse the data in real time as soon as it is being captured. Such cases are for example if the data stream is infinite, fast changing, or simply too large in size to be stored. One of the most important data mining techniques on data streams is classification. This involves training the classifier on the data stream in real time and adapting it to concept drifts. Most data stream classifiers are based on decision trees. However, it is well known in the data mining community that there is no single optimal algorithm. An algorithm may work well on one or several datasets but badly on others. This paper introduces eRules, a new rule based adaptive classifier for data streams, based on an evolving set of Rules. eRules induces a set of rules that is constantly evaluated and adapted to changes in the data stream by adding new and removing old rules. It is different from the more popular decision tree based classifiers as it tends to leave data instances rather unclassified than forcing a classification that could be wrong. The ongoing development of eRules aims to improve its accuracy further through dynamic parameter setting which will also address the problem of changing feature domain values.

https://eprints.bournemouth.ac.uk/23390/

http://link.springer.com/chapter/10.1007/978-1-4471-4739-8_5

Source: BURO EPrints