TextDNA: Visualizing Word Usage with Configurable Colorfields
dc.contributor.author | Szafir, Danielle Albers | en_US |
dc.contributor.author | Stuffer, Deidre | en_US |
dc.contributor.author | Sohail, Yusef | en_US |
dc.contributor.author | Gleicher, Michael | en_US |
dc.contributor.editor | Kwan-Liu Ma and Giuseppe Santucci and Jarke van Wijk | en_US |
dc.date.accessioned | 2016-06-09T09:33:03Z | |
dc.date.available | 2016-06-09T09:33:03Z | |
dc.date.issued | 2016 | en_US |
dc.description.abstract | Patterns of words used in different text collections can characterize interesting properties of a corpus. However, these patterns are challenging to explore as they often involve complex relationships across many words and collections in a large space of words. In this paper, we propose a configurable colorfield design to aid this exploration. Our approach uses a dense colorfield overview to present large amounts of data in ways that make patterns perceptible. It allows flexible configuration of both data mappings and aggregations to expose different kinds of patterns, and provides interactions to help connect detailed patterns to the corpus overview. TextDNA, our prototype implementation, leverages the GPU to provide interactivity in the web browser even on large corpora. We present five case studies showing how the tool supports inquiry in corpora ranging in size from single document to millions of books. Our work shows how to make a configurable colorfield approach practical for a range of analytic tasks. | en_US |
dc.description.number | 3 | en_US |
dc.description.sectionheaders | Text and Document Data | en_US |
dc.description.seriesinformation | Computer Graphics Forum | en_US |
dc.description.volume | 35 | en_US |
dc.identifier.doi | 10.1111/cgf.12918 | en_US |
dc.identifier.issn | 1467-8659 | en_US |
dc.identifier.pages | 421-430 | en_US |
dc.identifier.uri | https://doi.org/10.1111/cgf.12918 | en_US |
dc.identifier.uri | https://diglib.eg.org/handle/10.1111/cgf12918 | |
dc.publisher | The Eurographics Association and John Wiley & Sons Ltd. | en_US |
dc.subject | H.5.2 [Information Interfaces and Presentation] | en_US |
dc.subject | User Interfaces | en_US |
dc.subject | Graphical User Interfaces | en_US |
dc.subject | I.7.m [Document and Text Processing] | en_US |
dc.subject | Micellaneous | en_US |
dc.subject | Text Analysis | en_US |
dc.subject | J.5 [Computer Applications] | en_US |
dc.subject | Arts and Humanities | en_US |
dc.subject | Literature | en_US |
dc.title | TextDNA: Visualizing Word Usage with Configurable Colorfields | en_US |