A novel approach for mining patterns from large uncertain data using MapReduce model

dc.contributor.author Rathan, B. Rini
dc.contributor.author Rani, K. Swarupa
dc.date.accessioned 2022-03-27T05:50:57Z
dc.date.available 2022-03-27T05:50:57Z
dc.date.issued 2017-11-21
dc.description.abstract Frequent pattern mining discovers associations among different items in large sets of data. In many real-world applications, the presence of an object or a characteristic cannot be given exactly all the time. Instead, they can be better expressed in terms of probability and such data is called uncertain data. Mining frequent patterns from uncertain data is challenging due to presence of existential probabilities. With this scenario, researchers are focusing on mining frequent patterns from uncertain data. Leung et al. proposed a few algorithms like UF-Growth, PUF-Growth for pattern mining from uncertain data. These algorithms mine patterns in a sequential manner. They may not be the efficient solutions when dealing with huge amounts of data. Some other algorithms were proposed which can mine patterns in a parallel and distributed environment. But it has the overhead of data distribution, parallelization etc. All such overheads are internally taken care in MapReduce framework. In MR-Growth algorithm, data is stored in the form of UF-Tree. But when the same item has many different probabilities, the size of UF-Tree becomes large, which may effect the overall efficiency. In this paper, in order to overcome this limitation, we have modified and extended the works of Leung et al. [3] in order to represent the data in compact tree structure for mining uncertain data. The functionality and utility of the proposed MR-PUFGrowth algorithm has been demonstrated and also experimented with different kinds of benchmark datasets like mushroom, connect, retail, T10I4D100K.
dc.identifier.citation 2017 International Conference on Computer Communication and Informatics, ICCCI 2017
dc.identifier.uri 10.1109/ICCCI.2017.8117705
dc.identifier.uri http://ieeexplore.ieee.org/document/8117705/
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8292
dc.subject MapReduce
dc.subject PUF-Tree
dc.subject Uncertain Data
dc.title A novel approach for mining patterns from large uncertain data using MapReduce model
dc.type Conference Proceeding. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: