Parameswari_faith_nagaraju@Dravidian-CodeMixFIRE: A machine-learning approach using n-grams in sentiment analysis for code-mixed texts: A case study in Tamil and Malayalam
Parameswari_faith_nagaraju@Dravidian-CodeMixFIRE: A machine-learning approach using n-grams in sentiment analysis for code-mixed texts: A case study in Tamil and Malayalam
dc.contributor.author | Krishnamurthy, Parameswari | |
dc.contributor.author | Varghese, Faith | |
dc.contributor.author | Vuppala, Nagaraju | |
dc.date.accessioned | 2022-03-26T13:38:06Z | |
dc.date.available | 2022-03-26T13:38:06Z | |
dc.date.issued | 2020-01-01 | |
dc.description.abstract | Sentiment analysis is a fast growing research positioned to uncover the underlying meaning of a text by categorizing it into different levels. This paper is an attempt to decode the deeply entangled code-mixed Malayalam and Tamil datasets and classify its interlined meaning at five various levels. Along with the corpus creation, [1] propose a five-level classification for Malayalam and Tamil code-mixed datasets. In this paper, we follow the five-level annotated datasets and aim to solve the classification problem by implementing unigram and bigram knowledge with a Multinomial Naive Bayes model. Our model scores an F1-score of 0.55 for Tamil and 0.48 for Malayalam. | |
dc.identifier.citation | CEUR Workshop Proceedings. v.2826 | |
dc.identifier.issn | 16130073 | |
dc.identifier.uri | https://dspace.uohyd.ac.in/handle/1/2042 | |
dc.subject | A Multinomial Naive Bayes model | |
dc.subject | Code-mixed texts | |
dc.subject | Malayalam | |
dc.subject | N-gram | |
dc.subject | Sentiment Analysis | |
dc.subject | Tamil | |
dc.title | Parameswari_faith_nagaraju@Dravidian-CodeMixFIRE: A machine-learning approach using n-grams in sentiment analysis for code-mixed texts: A case study in Tamil and Malayalam | |
dc.type | Conference Proceeding. Conference Paper | |
dspace.entity.type |
Files
License bundle
1 - 1 of 1