Word Representations for Gender Classification Using Deep Learning

dc.contributor.author Ritesh, Ritesh
dc.contributor.author Bhagvati, Chakravarthy
dc.date.accessioned 2022-03-27T05:54:23Z
dc.date.available 2022-03-27T05:54:23Z
dc.date.issued 2018-01-01
dc.description.abstract This paper studies the effect of word representations on gender classification using deep learning. There are two main objectives: how well do popular deep learning architectures, namely LSTM and CNNs, perform on gender classification task and investigate how the choice of word representation effects the performance. Three networks, LSTM, CNN and LeNet-5, were trained on a dataset containing about 18000 names from India, Western countries, Sri Lanka and Japan. These names, encoded using the popular One-Hot representation and Word Embeddings in addition to Integer representation and an Enhanced Integer representation (proposed in this paper), were given as Input and the performance is evaluated on accuracy, training times and size of input layer. Experimental results show that LSTM in combination with word embedding derived from the proposed Enhanced Integer representation gives the best performance of about 85%. One-Hot representation is superior to Integer and Enhanced Integer representation but appears to perform lower than word embeddings.
dc.identifier.citation Procedia Computer Science. v.132
dc.identifier.uri 10.1016/j.procs.2018.05.015
dc.identifier.uri https://www.sciencedirect.com/science/article/abs/pii/S1877050918307476
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8709
dc.subject CNN
dc.subject Gender prediction
dc.subject LSTM
dc.subject One-Hot
dc.subject Word embeddings
dc.subject Word representations
dc.title Word Representations for Gender Classification Using Deep Learning
dc.type Conference Proceeding. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: