A hybrid approach to classification of categorical data based on information-theoretic context selection

dc.contributor.author Alamuri, Madhavi
dc.contributor.author Surampudi, Bapi Raju
dc.contributor.author Negi, Atul
dc.date.accessioned 2022-03-27T05:53:01Z
dc.date.available 2022-03-27T05:53:01Z
dc.date.issued 2015-01-01
dc.description.abstract Clustering or classification of data described by categorical attributes is a challenging task in data mining. This is because it is difficult to define a measure between pairs of values of a categorical attributes. The difficulty arises due to lack of ordering information between various pairs of categorical attributes. In this paper we introduce a Hybrid Approach which combines set based context selection with distance computation using KL divergence method. In the literature context based approaches have been introduced recently. Current approaches look at categorical attributes individually, however our approach proposes a novel scheme inspired from information theory. We consider the interdependence redundancy measure to select the significant attributes for context selection. The proposed approach gives encouraging results for low dimensional benchmark UCI datasets with k-nearest neighbor classifier based on the proposed measure. On these datasets the proposed measure performed well in comparison to other distance measures while using various classifiers such as SVM, Naive Bayes and C4.5.
dc.identifier.citation Advances in Intelligent Systems and Computing. v.415
dc.identifier.issn 21945357
dc.identifier.uri 10.1007/978-3-319-27212-2_22
dc.identifier.uri http://link.springer.com/10.1007/978-3-319-27212-2_22
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8587
dc.subject Categorical data
dc.subject Classification
dc.subject Context
dc.subject Similarity
dc.title A hybrid approach to classification of categorical data based on information-theoretic context selection
dc.type Book Series. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: