CAST: Character labeling in Animation using Self-supervision by Tracking

Nir, Oron; Rapoport, Gal; Shamir, Ariel

CAST: Character labeling in Animation using Self-supervision by Tracking

dc.contributor.author	Nir, Oron	en_US
dc.contributor.author	Rapoport, Gal	en_US
dc.contributor.author	Shamir, Ariel	en_US
dc.contributor.editor	Chaine, Raphaëlle	en_US
dc.contributor.editor	Kim, Min H.	en_US
dc.date.accessioned	2022-04-22T06:27:15Z
dc.date.available	2022-04-22T06:27:15Z
dc.date.issued	2022
dc.description.abstract	Cartoons and animation domain videos have very different characteristics compared to real-life images and videos. In addition, this domain carries a large variability in styles. Current computer vision and deep-learning solutions often fail on animated content because they were trained on natural images. In this paper we present a method to refine a semantic representation suitable for specific animated content. We first train a neural network on a large-scale set of animation videos and use the mapping to deep features as an embedding space. Next, we use self-supervision to refine the representation for any specific animation style by gathering many examples of animated characters in this style, using a multi-object tracking. These examples are used to define triplets for contrastive loss training. The refined semantic space allows better clustering of animated characters even when they have diverse manifestations. Using this space we can build dictionaries of characters in an animation videos, and define specialized classifiers for specific stylistic content (e.g., characters in a specific animation series) with very little user effort. These classifiers are the basis for automatically labeling characters in animation videos. We present results on a collection of characters in a variety of animation styles.	en_US
dc.description.number	2
dc.description.sectionheaders	Animation and Motion Capture
dc.description.seriesinformation	Computer Graphics Forum
dc.description.volume	41
dc.identifier.doi	10.1111/cgf.14464
dc.identifier.issn	1467-8659
dc.identifier.pages	135-145
dc.identifier.pages	11 pages
dc.identifier.uri	https://doi.org/10.1111/cgf.14464
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf14464
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Imaging and Video --> Video Summarization; Methods and Applications --> Artificial Intelligence; Computer Vision; Neural Nets
dc.subject	Imaging and Video
dc.subject	Video Summarization
dc.subject	Methods and Applications
dc.subject	Artificial Intelligence
dc.subject	Computer Vision
dc.subject	Neural Nets
dc.title	CAST: Character labeling in Animation using Self-supervision by Tracking	en_US

Files

Original bundle

Now showing 1 - 3 of 3

Name:: v41i2pp135-145.pdf
Size:: 20.76 MB
Format:: Adobe Portable Document Format

Download

Name:: paper1019_1.mp4
Size:: 31.44 MB
Format:: Unknown data format

Download

Name:: paper1019_2.pdf
Size:: 71.11 MB
Format:: Adobe Portable Document Format

Download

Collections

41-Issue 2