CAST: Character labeling in Animation using Self-supervision by Tracking

dc.contributor.authorNir, Oronen_US
dc.contributor.authorRapoport, Galen_US
dc.contributor.authorShamir, Arielen_US
dc.contributor.editorChaine, Raphaëlleen_US
dc.contributor.editorKim, Min H.en_US
dc.date.accessioned2022-04-22T06:27:15Z
dc.date.available2022-04-22T06:27:15Z
dc.date.issued2022
dc.description.abstractCartoons and animation domain videos have very different characteristics compared to real-life images and videos. In addition, this domain carries a large variability in styles. Current computer vision and deep-learning solutions often fail on animated content because they were trained on natural images. In this paper we present a method to refine a semantic representation suitable for specific animated content. We first train a neural network on a large-scale set of animation videos and use the mapping to deep features as an embedding space. Next, we use self-supervision to refine the representation for any specific animation style by gathering many examples of animated characters in this style, using a multi-object tracking. These examples are used to define triplets for contrastive loss training. The refined semantic space allows better clustering of animated characters even when they have diverse manifestations. Using this space we can build dictionaries of characters in an animation videos, and define specialized classifiers for specific stylistic content (e.g., characters in a specific animation series) with very little user effort. These classifiers are the basis for automatically labeling characters in animation videos. We present results on a collection of characters in a variety of animation styles.en_US
dc.description.number2
dc.description.sectionheadersAnimation and Motion Capture
dc.description.seriesinformationComputer Graphics Forum
dc.description.volume41
dc.identifier.doi10.1111/cgf.14464
dc.identifier.issn1467-8659
dc.identifier.pages135-145
dc.identifier.pages11 pages
dc.identifier.urihttps://doi.org/10.1111/cgf.14464
dc.identifier.urihttps://diglib.eg.org:443/handle/10.1111/cgf14464
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.subjectCCS Concepts: Imaging and Video --> Video Summarization; Methods and Applications --> Artificial Intelligence; Computer Vision; Neural Nets
dc.subjectImaging and Video
dc.subjectVideo Summarization
dc.subjectMethods and Applications
dc.subjectArtificial Intelligence
dc.subjectComputer Vision
dc.subjectNeural Nets
dc.titleCAST: Character labeling in Animation using Self-supervision by Trackingen_US
Files
Original bundle
Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
v41i2pp135-145.pdf
Size:
20.76 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
paper1019_1.mp4
Size:
31.44 MB
Format:
Unknown data format
Loading...
Thumbnail Image
Name:
paper1019_2.pdf
Size:
71.11 MB
Format:
Adobe Portable Document Format
Collections