Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/22360
|
Title: | Age and gender classification of tweets using convolutional neural networks |
Authors: | Bayot, Roy Gonçalves, Teresa |
Issue Date: | Jan-2018 |
Publisher: | Springer |
Citation: | Roy Bayot and Teresa Gonçalves. Age and gender classification of tweets using convo-
lutional neural networks. In International Workshop on Machine Learning, Optimization
and Big Data. Lecture Notes in Computer Science 2018, vol. 10710 LNCS, pp. 337-348. 2018 |
Abstract: | Determining age and gender from a series of texts is useful for areas such as business intelligence and digital forensics. We explore the use of convolutional neural networks together with word2vec word embeddings for this task in comparison to handcrafted features. The network constructed consists of five layers and is trained using adadelta. It starts with an embedding layer where a word is represented by a vector, followed by a convolutional layer composed of three filters, each with 100 feature maps. It is followed by a max-over-time pooling layer which is done on each map and the resulting features are concatenated before a dropout layer and a softmax layer. The network was trained to classify age and gender for English and Spanish tweets. The predictions per tweet were aggregated using the majority prediction as the final prediction for the user who gave the tweets. The results outperform previous experiments. The highest English age and gender classification accuracy obtained are 49.6\% and 72.1\% respectively. The highest Spanish age and gender classification accuracy obtained on the other hand are 56.0\% and 69.3\% respectively. |
URI: | http://hdl.handle.net/10174/22360 |
Type: | article |
Appears in Collections: | INF - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|