A Novel Data Mining Study to Spot Anomalies in Organizations: A Human Resources Management Case

A Novel Data Mining Study to Spot Anomalies in Organizations: A Human Resources
Management Case

Volume 2, Issue 4
GOKHAN SILAHTAROGLU , PELIN VARDARLIER

Published online: 10 August 2016

Article Views: 20

Abstract

Organizational behaviour is one of the most important assets of an organization; it may increase business value and profitability. Although it comes to life itself with the sector’s contribution, leadership, environment, market conditions and competition, unemployment etc., there must be some ways to lead or direct an organizational behaviour. However, it is not practical to monitor and control organizational behaviour in detail with bare eyes. Technology-wise, we are not away from building a system to learn, understand and monitor the organizational behaviour and give alarms or signals to top management when the system perceives an extraordinary condition. This is doable with machine learning algorithms which is a model of Artificial Intelligence (AI). In the literature, there are plenty of machine learning algorithms that have been proved that they work or learn very well. Both supervised and unsupervised algorithms may be used to learn organizational behaviour and detect anomalies in the behaviour. In this paper, we propose a novel system to build a data warehouse with he data available in an organization and design it for learning the organizational behavior by machine, i.e. computers. The system we propose may be used to spot daily, weekly or monthly changes in organizational behaviour. These changes may have positive or negative effects on the performance of organizations, so this system may also be used as a decision support system for top management. When necessary feedback is given to the system, it may also develop or learn new features to interpret the effect of the change on the organization’s overall performance. The system may also be used to compare the changes in organizational behaviour every year. Nevertheless, it is a useful tool to track, learn and monitor the interactions of each employee in the system. In the study, human resources management has been used as a sample model to detail the proposed system.

References

Al-Raheem, K. F., & Abdul-Karem, W. (2010). Rolling bearing fault diagnostics using artificial neural networks based on Laplace wavelet analysis. International Journal of Engineering, Science and Technology, 2(6), 278-290.
Angiulli, F., & Pizzuti, C. (2002). Fast outlier detection in high dimensional spaces. Paper presented at PKDD ’02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery, (pp. 15-26). https://dx.doi.org/10.1007/3-540-45681-3_2
Breunig, M. M., Kriegel, H. P., Ng, R. T., & Sander, J. (2000). LOF: Identifying density-based local outliers. ACM Sigmod Record, 29(2), 93-104. https://dx.doi.org/10.1145/335191.335388
Cerroni, W., Moro, G., Pasolini, R., & Ramilli, M. (2015). Decentralized detection of network attacks through P2P data clustering of SNMP data. Computers & Security, 52, 1-16.https://dx.doi.org/10.1016/j.cose.2015.03.006
Coifman, R. R., Lafon, S., Lee, A. B., Maggioni, M., Nadler, B., Warner, F., & Zucker, S. W. (2005). Geometric diffusions as a tool for harmonic analysis and structure definition of data: Diffusion maps. Paper presented at the Proceedings of the National Academy of Sciences of the United States of America, 102(21), 7426-7431. https://dx.doi.org/10.1073/pnas.0500334102
Demetgul, M., Yildiz, K., Taskin, S., Tansel, I. N., & Yazicioglu, O. (2014). Fault diagnosis on material handling system using feature selection and data mining techniques. Measurement, 55, 15-24.https://dx.doi.org/10.1016/j.measurement.2014.04.037
Ester, M., Kriegel, H. P., Sander, J., & Xu, X. (1996). A density-based algorithm for discovering clusters in large spatial databases with noise. Knowledge Discovery and Data Mining, 96(34), 226-231.
Hagan, M. T., Demuth, H. B., & Beale, M. H. (1996). Neural network design (pp. 2-14). Boston, Massachusetts, MA: PWS Pub.
Haykin, S. S., Haykin, S. S., Haykin, S. S., & Haykin, S. S. (2009). Neural networks and learning machines. Upper Saddle River,New Jersey, NJ: Pearson Education.
Hemalatha, C. S., Vaidehi, V., & Lakshmi, R. (2015). Minimal infrequent pattern based approach for mining outliers in data streams. Expert Systems with Applications, 42(4), 1998-2012. https://dx.doi.org/10.1016/j.eswa.2014.09.053
Jannesari, A. (2015). Detection of high-level synchronization anomalies in parallel programs. International Journal of Parallel Programming, 43(4), 656-678.
Knorr, E. M., Ng, R. T., & Tucakov, V. (2000). Distance-based outliers: Algorithms and applications. The International Journal on Very Large Data Bases, 8(3-4), 237-253. https://dx.doi.org/10.1007/s007780050006
Knox, E. M., & Ng, R. T. (1998). Algorithms for mining distancebased outliers in large datasets. Paper presented at Proceedings of the International Conference on Very Large Data Bases (pp. 392-403).
Lafon, S., & Lee, A. B. (2006). Diffusion maps and coarse-graining: A unified framework for dimensionality reduction, graph partitioning, and data set parameterization. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 28(9), 1393-1403. https://dx.doi.org/10.1109/TPAMI.2006.184 PMid:16929727
Palacios, A., Martinez, A., Sanchez, L., & Couso, I. (2015). Sequential pattern mining applied to aeroengine condition monitoring with uncertain health data. Engineering Applications of Artificial Intelligence, 44, 10-24. https://dx.doi.org/10.1016/j.engappai.2015.05.003
Purarjomandlangrudi, A., Ghapanchi, A. H., & Esmalifalak, M. (2014). A data mining approach for fault diagnosis: An application of anomaly detection algorithm. Measurement, 55, 343-352.https://dx.doi.org/10.1016/j.measurement.2014.05.029
Roweis, S. T., & Saul, L. K. (2000). Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500), 2323-2326. https://dx.doi.org/10.1126/science.290.5500.2323 PMid:11125150
Sarbu, C., Zehl, K., & Einax, J. W. (2007). Fuzzy divisive hierarchical clustering of soil data using GustafsonKessel algorithm. Chemometrics and Intelligent Laboratory Systems, 86(1), 121-129.https://dx.doi.org/10.1016/j.chemolab.2006.08.015
Todeschini, R., Ballabio, D., Consonni, V., Sahigara, F., & Filzmoser, P. (2013). Locally centred Mahalanobis distance: A new distance measure with salient features towards outlier detection. Analytica Chimica Acta, 787, 1-9. https://dx.doi.org/10.1016/j.aca.2013.04.034 PMid:23830416
Van der Maaten, L. J. P. (2007). An introduction to dimensionality reduction using matlab.
Vardarlier, P., Vural, Y., & Birgun, S. (2014). Modelling of the strategic recruitment process by axiomatic design principles. Procedia-Social and Behavioral Sciences, 150, 374-383.https://dx.doi.org/10.1016/j.sbspro.2014.09.031
Wang, B., Xiao, G., Yu, H., & Yang, X. (2009). Distance-based outlier detection on uncertain data. Paper presented at Computer and Information Technology, 2009. CIT’09. Ninth IEEE International Conference on (Vol. 1, pp. 293-298). IEEE. https://dx.doi.org/10.1109/cit.2009.107
Xia, H., Fang, B., Gao, M., Ma, H., Tang, Y., & Wen, J. (2015). A novel item anomaly detection approach against shilling attacks in collaborative recommendation systems using the dynamic time interval segmentation technique. Information Sciences, 306, 150-165. https://dx.doi.org/10.1016/j.ins.2015.02.019
Yegnanarayana, B. (2009). Artificial neural networks. Delhi, India: PHI Learning Pvt. Ltd. PMid:19813815
Zhang, Q., & Couloigner, I. (2005). A new and efficient k-medoid algorithm for spatial clustering. In Computational Science and Its Applications ICCSA 2005 (pp. 181-189). Berlin, Germany: Springer Berlin Heidelberg. https://dx.doi.org/10.1007/11424857_20

To Cite this article

Silahtaroglu, G., & Vardarler, P. (2016). A novel data mining study to spot anomalies in organizations: A human resources management case. International Journal of Business and Administrative Studies, 2(4), 89-95.