Visual Analysis of Humor Assessment Annotations for News Headlines in the Humicroedit Data Set

Abstract
Effective utilization of training data is a critical component for the success of any artificial intelligence algorithm, including natural language processing (NLP) tasks. One particular task of interest is related to detecting or ranking humor in texts, as exemplified by the Humicroedit data set used for the SemEval 2020 task of assessing humor in micro-edited news headlines. Rather than focusing on text classification or prediction, in this study, we focus on gaining a deeper understanding and utilization of the data through the use of information visualization techniques facilitated by the established NLP methods such as sentiment analysis and topic modeling. We describe the design of an interactive visualization tool prototype that relies on multiple coordinated views to allow the user explore and analyze the relationships between the annotated humor scores, sentiments, and topics. Evaluation of the proposed approach involves a case study with the Humicroedit data set as well as domain expert reviews with four participants. The experts deemed the prototype useful for its purpose and saw potential in exploring similar data sets with it, as well as further potential applications in their line of work. Our study thus contributes to the body of work on visual text analytics for supporting computational humor analysis as well as annotated text data analysis in general.
Description

CCS Concepts: Human-centered computing → Visual analytics; Information visualization; Computing methodologies → Natural language processing

        
@inproceedings{
10.2312:vis4nlp.20241134
, booktitle = {
Vis4NLP 2024 - Workshop on Visualization for Natural Language Processing
}, editor = {
Yousef, Tariq
and
Al-Khatib, Khalid
}, title = {{
Visual Analysis of Humor Assessment Annotations for News Headlines in the Humicroedit Data Set
}}, author = {
Kucher, Kostiantyn
and
Akkurt, Elin
and
Folde, Johanna
and
Kerren, Andreas
}, year = {
2024
}, publisher = {
The Eurographics Association
}, ISBN = {
978-3-03868-259-2
}, DOI = {
10.2312/vis4nlp.20241134
} }
Citation
Collections