Wednesday 25 September 2019

A Review of Decision Tree Algorithms for Predictive Analysis in Data Mining

Volume 12 Issue 1 July - September 2017

Research Paper

A Review of Decision Tree Algorithms for Predictive Analysis in Data Mining

Diana Moses*, B. Deepa**, Trilochan Patri***, M. Sowmya****
* Professor, Department of Computer Science and Engineering, St. Peter's Engineering College, Hyderabad, India.
**-**** UG Scholar, Department of Computer Science and Engineering, St. Peter's Engineering College, Hyderabad, India.
Moses, D., Deepa, B., Patri , T., and Sowmya, M. (2017). A Review of Decision Tree Algorithms for Predictive Analysis in Data Mining. i-manager’s Journal on Software Engineering, 12(1), 38-45. https://doi.org/10.26634/jse.12.1.13923

Abstract

There is a wealth of data archived by business organizations. Analysis of this data provides predictive information for taking proactive decisions and making statistical algorithms which are used for improving the knowledge regarding the engineering process and analysis of data. Data mining is a class of algorithms that analyses the relationship between data and identifies futuristic trends from archived data. Decision tree learning will help us to create a predictive model which will map different items consisting in the set of data and its targets in such a way that each element in this dataset is true. There are many strategies to construct the decision trees, but ID3 is one of the simplest and popularly used decision tree algorithms as there is a disadvantage in ID3 algorithm that it gives more importance to the attributes having multiple values while selecting any item affecting the decision tree. Hence in this paper, the objective is to justify that C4.5 algorithm works better than the ID3 algorithm. C4.5 system of Quinlan is one best classification algorithm that deserves a special mention for several reasons. First best reason is that it is used to represent result of research in machine learning that traces back to the ID3 system. For that reason it is taken as the point of reference for the development and analysis of novel proposals. On the other hand the results of the datasets in this paper proves that C4.5 tree-induction algorithm provides good classification, accuracy, and it is the fastest among the compared main memory algorithms for machine learning and data mining.

No comments:

Post a Comment