Tom�s Pt�cek
Sarcasm detection on Czech and English Twitter
Pt�cek, Tom�s; Habernal, Ivan; Hong, Jun
Abstract
This paper presents a machine learning approach to sarcasm detection on Twitter in two languages – English and Czech. Although there has been some research in sarcasm detection in languages other than English (e.g., Dutch, Italian, and Brazilian Portuguese), our work is the first attempt at sarcasm detection in the Czech language. We created a large Czech Twitter corpus consisting of 7,000 manually-labeled tweets and provide it to the community. We evaluate two classifiers with various combinations of features on both the Czech and English datasets. Furthermore, we tackle the issues of rich Czech morphology by examining different preprocessing techniques. Experiments show that our language-independent approach significantly outperforms adapted state-of-the-art methods in English (F-measure 0.947) and also represents a strong baseline for further research in Czech (F-measure 0.582).
Presentation Conference Type | Conference Paper (unpublished) |
---|---|
Conference Name | COLING 2014 |
Start Date | Aug 23, 2014 |
End Date | Aug 29, 2014 |
Acceptance Date | Aug 23, 2014 |
Publication Date | Aug 23, 2014 |
Deposit Date | Feb 14, 2017 |
Peer Reviewed | Peer Reviewed |
Pages | 213-223 |
Keywords | sarcasm, detection, Czech, English, Twitter |
Public URL | https://uwe-repository.worktribe.com/output/813504 |
Publisher URL | http://anthology.aclweb.org/C/C14/ |
Additional Information | Title of Conference or Conference Proceedings : Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics |
Contract Date | Feb 14, 2017 |
You might also like
A survey of location inference techniques on Twitter
(2015)
Journal Article
Privacy preserving record linkage in the presence of missing values
(2017)
Journal Article
A novel ensemble learning approach to unsupervised record linkage
(2017)
Journal Article
A collaborative multiagent framework based on online risk-aware planning and decision-making
(2017)
Journal Article
Downloadable Citations
About UWE Bristol Research Repository
Administrator e-mail: repository@uwe.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search