FuAGGE: A Novel System to Automatically Generate Fuzzy Rule Based Learners
Abstract
Data in real world applications are in most cases linguistic information that are ambiguous and uncertain. Hence, such data should be handled by fuzzy set representation schemes to increase expressiveness and comprehensiveness. Moreover, mining these data requires ways to generate automatically useful information/knowledge through a set of fuzzy rules. This paper proposes a novel system called FuAGGE that stands for Fuzzy Automatic Generator Genetic Expression. The FuAGGE approach uses a grammar based evolutionary technique. The grammar is expressed in the Backus Naur Form (BNF) and represents a fuzzy set covering method. The grammar is mapped into programs that are themselves implementations of fuzzy rule-based learners. Binary strings are used as inputs to the mapper along with the BNF grammar. These binary strings represent possible potential solutions resulting from the initializer component and the building blocks from Weka, a workbench that contains a collection of visualization tools and algorithms for data analysis and predictive modeling. This operation facilitates the induction process and makes induced programs shorter. FuAGGE has been tested on a benchmark of well-known datasets and experimental results prove the efficiency of the proposed method. It is shown through comparison that our method outperforms most recent and similar, manual techniques. The system is able to generate rule-based learners specialized to specific domains, for example medical or biological data. The generated learners will be able to produces efficient rule models. The produced rule models will achieves more accurate classification for the specific used domain.
Full Text:
PDFReferences
X. Wang, D. Nauck, M. Spott, R. Kruse. Intelligent data analysis with fuzzy decision trees, Soft Computing, 11 (5) pp. 439-457, 2007.
H. Shen, J. Yang, S. Wang, X. Liu, Attribute weighted mercer kernel based fuzzy clustering algorithm for general non-spherical datasets. In Soft Computing, 10 (11) pp.1061-1073, 2006.
R. Ab Ghani, A. Salwani, Y. Razali, Comparisons between artificial neural networks and fuzzy logic models in forecasting general examinations results. 2nd International Conference on Computer, Communications and Control Technology, (2015), pp. 253-257.
J. Zyl, Fuzzy set covering as a new paradigm for the induction of fuzzy classification rules. University of Mannheim, PhD thesis, (2007).
C. Wang, J. Liu, T. Hong, S. Tseng, FILSMR: a fuzzy inductive learning strategy for modular rules. Proceedings of the Sixth IEEE International Conference on Fuzzy Systems , (1997), (3) pp.1289-1294.
L. A. Zadeh, Fuzzy sets. Information and Control 8 (3), 1965
D. Cendrowska . PRISM: An Algorithm for Inducing Modular Rules. In International Journal of Machine Learning Studies, pp.349-370, (1987).
C. Wang, C. Tsai, T. Hong, S. Tseng, Fuzzy Inductive Learning Strategies. Applied Intelligence, 18 (2) pp.179-193, 2003.
J. Zyl, I. Cloete, FuzzConRI- A Fuzzy Conjunctive Rule Inducer. Proceedings of the ECML/PKDD04 Workshop on Advances in Inductive Rule Learning, (2004) pp.194-203.
P. Clark, R. Boswell, Rule induction with CN2: Some recent improvements. Proceedings of the Sixth European Working Session on Learning, (1991), pp.151-163.
H. Theron, I. Cloete, BEXA: A covering algorithm for learning propositional concept descriptions. In Machine Learning,1996, 24 (1) pp.5-40, 1996.
J. Zyl, I. Cloete, Fuzzy rule induction in a set covering framework. IEEE Trans. Fuzzy Syst. 14 (1) pp.93-110, 2005.
J. Huhn, E. Hullermeier, FURIA: an algorithm for unordered fuzzy rule induction. Data Mining and Knowledge Discovery, 19 (3) pp. 293-319, 2009.
J. Furnkranz, Pruning Methods for Rule Learning Algorithms. Proceedings of the 4th International Workshop on Inductive Logic Programming, (1994), pp.321-336.
J. N. Swathi, B.B. Rajen, P. Ilango , M. Khalid , B. K. Tripathy, Induction of fuzzy decision trees and its refinement using gradient projected-neuro-fuzzy decision tree. International Journal of Advanced Intelligence Paradigms, 6 (4) pp. 346-369, 2014.
J. Montana, Strongly Typed Genetic Programming. Evolutionary Computation Journal, (3) pp.199-230, 1994.
A. Nohejl, Grammar Based Genetic Programming. MSc, Charles University of Prague, Prague, Czech, (2011).
H. Li, L. Wong, Knowledge Discovering in Corporate Securities Fraud by Using Grammar Based Genetic Programming. Journal of Computer and Communications, (2) pp.148-156, 2014.
I. Dempsey, M. O'Neill, A. Brabazon, Foundations in Grammatical Evolution for Dynamic Environments. Berlin,Germany, Springer, (2009).
M. O'Neill, E. Hemberg, C. Gilligan, E. Bartley, J. McDermott, A. Brabazon, GEVA: Grammatical Evolution. in Java, SIGEVOlution ACM, (2008), (3) pp.17-23.
A. De Silva, F. Noorian, R. Davis, P. Leong, A Hybrid Feature Selection and Generation Algorithm for Electricity Load Prediction using Grammatical Evolution, 12th International Conference on Machine Learning and Applications, Miami USA, (2013) ,pp. 211-217.
P. Gisele, A. Freita, Automating the Design of Data Mining Algorithms. Springer Berlin Heidelberg, (2010).
J. R. Quinlan, Induction of Decision Trees. Machine Learning. Kluwer Academic Publishers, (1986), (1) pp. 81-106,
A. Freitas, Data mining and Knowledge Discovery with evolutionary algorithms. Berlin, Germany, Springer Verlag, (2002).
M. Corn, M. A. Kunc, Designing model and control system using evolutionary algorithm. 8th Vienna International Conferenceon Mathematical Modelling — MATHMOD 2015, (2015), 48 (1) pp.526-531.
M. Dong, R. Kothari, Classifiability Based Pruning of Decision Trees, Neural Networks. Proceedings IJCNN '14, International Joint Conference on Neural Networks (IJCNN), Washington, DC, (2014).
UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/
A. Nohejl, Grammatical Evolution. BSc, Charles University of Prague, Prague, Czech, (2009).
J. N. Swathi, I. Paramasivam, B. B. Rajen, M. Khalid. A Study on the Approximation of Clustered Data to Parameterized Family of Fuzzy Membership Functions for the Induction of Fuzzy Decision Trees.
Cybernetics and Information Technologies, 15( 2) pp.75-96, 2015.
B. B. Rajen, J. N. Swathi, P. Ilango, and M. Khalid, Approximating fuzzy membership functions from clustered raw data. In Proceedings of India Conference (INDICON) Annual IEEE, (2012) pp. 487-492.
R. Mazouni, A. Rahmoun, AGGE: A Novel Method to Automatically Generate Rule Induction Classifiers Using Grammatical Evolution. Studies in Computational Intelligence, Volume 570, Iintelligent Distributed Computing VIII, (2015) pp. 270-288.
This work is licensed under a Creative Commons Attribution 3.0 License.