Hostname: page-component-cd9895bd7-dzt6s Total loading time: 0 Render date: 2024-12-28T05:09:32.535Z Has data issue: false hasContentIssue false

Prediction of warning level in aircraft accidents using data mining techniques

Published online by Cambridge University Press:  27 January 2016

A. B. Arockia Christopher*
Affiliation:
Anna University, Chennai, India
S. Appavu alias Balamurugan*
Affiliation:
KLN College of Information Technology, Sivagangai, India

Abstract

Data mining is a data analysis process which is designed for large amounts of data. It proposes a methodology for evaluating risk and safety and describes the main issues of aircraft accidents. We have a huge amount of knowledge and data collection in aviation companies. This paper focuses on different feature selectwindion techniques applied to the datasets of airline databases to understand and clean the dataset. CFS subset evaluator, consistency subset evaluator, gain ratio feature evaluator, information gain attribute evaluator, OneR attribute evaluator, principal components attribute transformer, ReliefF attribute evaluatoboundar and symmetrical uncertainty attribute evaluator are used in this study in order to reduce the number of initial attributes. The classification algorithms, such as DT, KNN, SVM, NN and NB, are used to predict the warning level of the component as the class attribute. We have explored the use of different classification techniques on aviation components data. For this purpose Weka software tools are used. This study also proves that the principal components attribute with decision tree classifier would perform better than other attributes and techniques on airline data. Accuracy is also very highly improved. This work may be useful for an aviation company to make better predictions. Some safety recommendations are also addressed to airline companies.

Type
Research Article
Copyright
Copyright © Royal Aeronautical Society 2014 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

1. Arauzo-Azofra, A., Benitez, J.M. and Castro, J.C. Consistency measures for feature selection, J Intell Inf Syst, 2008, 30, pp 273292.Google Scholar
2. Asha, G.K, Manjunath, A.S. and Jayaram, M.A. A comparative study of attribute selection using gain ratio and correlation based feature selection, Int J of Info Tech and Knowledge Management, July-December 2012, 2, pp 271277.Google Scholar
3. Ramana, B.V. Babu, M.S.P. and Venkateswarlu, N.B. A critical study of selected classification algorithms for liver disease diagnosis, Int J Database Management Systems, May 2011, 3, (2).Google Scholar
4. Ienco, D., Pensa, R.G. and Meo, R. Context-based Distance Learning for Categorical Data Clustering, 2009, IDA 2009, LNCS 5772, Springer, Berlin, pp 8394.Google Scholar
5. Recent advances in data mining, Eng Apps of Artificial Intelligence (Editorial), 19, 2006, pp 361362.Google Scholar
6. Guyon, I. and Elisseeff, A. An introduction to variable and feature selection, J Machine Learning Research, 2003, 3, pp 11571182.Google Scholar
7. He, H., Graco, W. and Yao, X. Application of genetic algorithm and k -nearest neighbour method in medical fraud detection, simulated evolution and learning, November 1998, Second Asia-Pacific Conference on Simulated Evolution and Learning, SEAL’ 98, Canberra, Australia, Springer, pp 7481.Google Scholar
8. Jiawei, H. and Kamber, M. Data Mining: Concepts and Techniques, 2001, University of Simon Fraser.Google Scholar
9. Wang, H., Khoshgoftaar, T.M. and Napolitano, A. A comparative study of ensemble feature selection techniques for software defect prediction, 2010, Ninth International Conference on Machine Learning and Applications, 12-14 December 2010, Washington, DC, USA, pp 135140.Google Scholar
10. Shyur, H.J. A quantitative model for aviation safety risk assessment, Computers and Ind Eng, 2007.Google Scholar
11. Han, J. and Kamber, M. Data Mining; Concepts and Techniques, 2000, Morgan Kaufmann Publishers.Google Scholar
12. Bineid, M. and Fielding, J.P. Development of a civil aircraft dispatch reliability prediction methodology, Aircraft Eng and Aerospace Tech, 2003, 75, (6), 2003, pp 588594.Google Scholar
13. Murty, M.N. and Devi, V.S. Pattern Recognition: An Algorithmic Approach, Springer, Ch 4, pp 8697.Google Scholar
14. Grimaldi, M., Cunningham, P. and Kokaram, A. An Evaluation of Alternative Feature Selection Strategies and Ensemble Techniques for Classifying Music, Trinity College Dublin, Ireland.Google Scholar
15. Pizzi, N.J. and Pedrycz, W. Effective classifcation using feature selection and fuzzy, Integration Fuzzy Sets and Systems, 2008.Google Scholar
16. Dessureault, S., Sinuhaji, A. and Coleman, P. Data mining mine safety data, Mining Engineering Littleton, 2007, 59, (8), p 64. 7 pgs.Google Scholar
17. Pulatova, S. Covering (rule-based) Algorithms Lecture Notes in Data Mining, 2006, World Scientific Publishing Co, pp 8797.Google Scholar
18. Solomon, S. Nguyen, H. Liebowitz, J. and Agresti, W. Using data mining to improve traffic safety programs, Ind Management and Data Systems, 2006, 106, (5), pp 621643.Google Scholar
19. Suraj, Z. and Delimata, P. Data mining exploration system for feature selection tasks. In International conference on hybrid information technology, 2006, Ichıt’06, User Manuel of Polyanalyst 5, April 2005, IEEE, Computer Society.Google Scholar
20. Mitchell, T. Machine Learning, 1997, McGraw Hill.Google Scholar
21. Altidor, W., Khoshgoftaar, T.M., Van Hulse, J. and Napolitano, A. Ensemble feature ranking methods for data intensive computing applications, Handbook of Data Intensive Computing, Springer Science + Business Media, LLC 2011, pp 349376.Google Scholar
22. Chen, Y., Li, Y., Cheng, X., Guo, L., Lipmaa, H., Yung, M. and Lin, D. Survey and taxonomy of feature selection algorithms in intrusion detection system, 2006, Inscrypt 2006, LNCS 4318 Springer-Verlag, Berlin, Germany, pp153167.Google Scholar
23. Nazeri, Z. and Jianping, Z. Mining aviation data to understand impacts of severe weather on airspace system performance, International Conference on Information Technology, 2002, IEEE.Google Scholar