Deep Data Mining And Hadoop Simulation In Computerized Systems

Document Type : Primary Research paper

Authors

1 Universiti Tunku Abdul Rahman, Malaysia

2 Fakulti AlamBina, Universiti Malaya

3 Cluster of Education and Social Sciences, Open University Malaysia

Abstract

In this investigation, he main aim was to determine how effective and feasible it would be to exploit idle computational storage. Architecturally, the proposed model was that which relied on HDFS (Hadoop Distributed File System). Also, CPN tools were used during model implementation. Hence, the tools constituted CPN ML programming language and Colored Petri Dish Nets. To ensure that the availability of the workstations was characterized within the model, the data collection process occurred in a computer lab for about 40 days. In the findings, it was established that when three tests in the form of a physical test, a cloud test, and a simulation base test are applied, the deep data locality approach yields a significant improvement in the Hadoop performance. Particularly, the use of the deep data locality technique led to a 34 percent improvement in the Hadoop system. Thus, it was concluded that the superiority of the proposed approach arises from its ability to yield a reduction in the HDFS data movement.