Comparing tweet classifications by authors' hashtags, machine learning, and human annotators

Nishioka, Chifumi; Scherp, Ansgar; Dellschaft, Klaas

doi:10.1109/WI-IAT.2015.69

Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/28022

Appears in Collections:	Computing Science and Mathematics Conference Papers and Proceedings
Author(s):	Nishioka, Chifumi Scherp, Ansgar Dellschaft, Klaas
Contact Email:	ansgar.scherp@stir.ac.uk
Title:	Comparing tweet classifications by authors' hashtags, machine learning, and human annotators
Citation:	Nishioka C, Scherp A & Dellschaft K (2016) Comparing tweet classifications by authors' hashtags, machine learning, and human annotators. In: 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), volume 1. 2015 International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 06.12.2015-09.12.2015. Singapore: Institute of Electrical and Electronics Engineers, pp. 67-74. https://doi.org/10.1109/WI-IAT.2015.69
Issue Date:	31-Dec-2016
Date Deposited:	18-Oct-2018
Conference Name:	2015 International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)
Conference Dates:	2015-12-06 - 2015-12-09
Abstract:	Over the last years, many papers have been published about how to use machine learning for classifying postings on microblogging platforms like Twitter, e.g., in order to assist users to reach tweets that interest them. Typically, the automatic classification results are then evaluated against a gold standard classification which consists of either (i) the hashtags of the tweets' authors, or (ii) manual annotations of independent human annotators. In this paper, we show that there are fundamental differences between these two kinds of gold standard classifications, i.e., human annotators are more likely to classify tweets like other human annotators than like the tweets' authors. Furthermore, we discuss how these differences may influence the evaluation of automatic classifications, like they may be achieved by Latent Dirichlet Allocation (LDA). We argue that researchers who conduct machine learning experiments for tweet classification should pay particular attention to the kind of gold standard they use. One may even argue that hashtags are not appropriate as a gold standard for tweet classification.
Status:	VoR - Version of Record
Rights:	The publisher does not allow this work to be made publicly available in this Repository. Please use the Request a Copy feature at the foot of the Repository record to request a copy directly from the author. You can only request a copy if you wish to use this work for your own research or private study.
Licence URL(s):	http://www.rioxx.net/licenses/under-embargo-all-rights-reserved

Files in This Item:

File	Description	Size	Format
Nishioka et al 2015.pdf	Fulltext - Published Version	280.56 kB	Adobe PDF	Under Permanent Embargo Request a copy

Note: If any of the files in this item are currently embargoed, you can request a copy directly from the author by clicking the padlock icon above. However, this facility is dependent on the depositor still being contactable at their original email address.

This item is protected by original copyright

View License

Show full item record

Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.

STORRE

STORRE: Stirling Online Research Repository