Quartiles based UnderSampling(QUS): A Simple and Novel Method to increase the Classification rate of positives in Imbalanced Datasets

dc.contributor.author Veni, C. V.Krishna
dc.contributor.author Rani, T. Sobha
dc.date.accessioned 2022-03-27T05:50:48Z
dc.date.available 2022-03-27T05:50:48Z
dc.date.issued 2018-12-27
dc.description.abstract The main challenge in learning from imbalanced datasets is the presence of a large set of training examples available for the negatives(majority class instances), and very few positives(minority class instances). This may result in a good overall performance of the classifier even though there is a huge red uction in the classification rate of positives. Quartiles based UnderSampling(QUS) method proposed in this paper, addresses the above problem in a simple way. That is balancing the dataset by selecting the negatives based on their similarity with respect to 5 quartiles: minimum, quartile1(Q1), median, quartile3(Q3) and maximum. Intention is to reduce the influence of excessive negatives on the classifier, which may bias it towards a better negatives classification otherwise. An advantage of this undersampling method is parameter independence and gives better results compared to the state of the art methods. The proposed method is tested on kNN (k Nearest Neighbour) classifier and empirical results improve the classification rate of positives than the original unprocessed imbalanced dataset.
dc.identifier.citation 2017 9th International Conference on Advances in Pattern Recognition, ICAPR 2017
dc.identifier.uri 10.1109/ICAPR.2017.8593202
dc.identifier.uri https://ieeexplore.ieee.org/document/8593202/
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8247
dc.subject Classification
dc.subject Clustering
dc.subject Imbalance
dc.subject Stratified Sampling
dc.subject UnderSampling
dc.title Quartiles based UnderSampling(QUS): A Simple and Novel Method to increase the Classification rate of positives in Imbalanced Datasets
dc.type Conference Proceeding. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: