A hybrid approach to classification of categorical data based on information-theoretic context selection
A hybrid approach to classification of categorical data based on information-theoretic context selection
| dc.contributor.author | Alamuri, Madhavi | |
| dc.contributor.author | Surampudi, Bapi Raju | |
| dc.contributor.author | Negi, Atul | |
| dc.date.accessioned | 2022-03-27T05:53:01Z | |
| dc.date.available | 2022-03-27T05:53:01Z | |
| dc.date.issued | 2015-01-01 | |
| dc.description.abstract | Clustering or classification of data described by categorical attributes is a challenging task in data mining. This is because it is difficult to define a measure between pairs of values of a categorical attributes. The difficulty arises due to lack of ordering information between various pairs of categorical attributes. In this paper we introduce a Hybrid Approach which combines set based context selection with distance computation using KL divergence method. In the literature context based approaches have been introduced recently. Current approaches look at categorical attributes individually, however our approach proposes a novel scheme inspired from information theory. We consider the interdependence redundancy measure to select the significant attributes for context selection. The proposed approach gives encouraging results for low dimensional benchmark UCI datasets with k-nearest neighbor classifier based on the proposed measure. On these datasets the proposed measure performed well in comparison to other distance measures while using various classifiers such as SVM, Naive Bayes and C4.5. | |
| dc.identifier.citation | Advances in Intelligent Systems and Computing. v.415 | |
| dc.identifier.issn | 21945357 | |
| dc.identifier.uri | 10.1007/978-3-319-27212-2_22 | |
| dc.identifier.uri | http://link.springer.com/10.1007/978-3-319-27212-2_22 | |
| dc.identifier.uri | https://dspace.uohyd.ac.in/handle/1/8587 | |
| dc.subject | Categorical data | |
| dc.subject | Classification | |
| dc.subject | Context | |
| dc.subject | Similarity | |
| dc.title | A hybrid approach to classification of categorical data based on information-theoretic context selection | |
| dc.type | Book Series. Conference Paper | |
| dspace.entity.type |
Files
License bundle
1 - 1 of 1