Inferring the Structure of Action Movies

Potapov, Danila; Douze, Matthijs; Revaud, Jérôme; Harchaoui, Zaid; Schmid, Cordelia

Inferring the Structure of Action Movies

dc.contributor.author	Potapov, Danila	en_US
dc.contributor.author	Douze, Matthijs	en_US
dc.contributor.author	Revaud, Jérôme	en_US
dc.contributor.author	Harchaoui, Zaid	en_US
dc.contributor.author	Schmid, Cordelia	en_US
dc.contributor.editor	William Bares and Vineet Gandhi and Quentin Galvane and Remi Ronfard	en_US
dc.date.accessioned	2017-04-22T17:13:02Z
dc.date.available	2017-04-22T17:13:02Z
dc.date.issued	2017
dc.description.abstract	While important advances were recently made towards temporally localizing and recognizing specific human actions or activities in videos, efficient detection and classification of long video chunks belonging to semantically-defined categories remains challenging. Examples of such categories can be found in action movies, whose storylines often follow a standardized structure corresponding to a sequence of typical segments such as ''pursuit'', ''romance'', etc. We introduce a new dataset, Action Movie Franchises, consisting of a collection of Hollywood action movie franchises. We define 11 non-exclusive semantic categories that are broad enough to cover most of the movie footage. The corresponding events are annotated as groups of video shots, possibly overlapping. We propose an approach for localizing events based on classifying shots into categories and learning the temporal constraints between shots. We show that temporal constraints significantly improve the classification performance. We set up an evaluation protocol for event localization as well as for shot classification, depending on whether movies from the same franchise are present or not in the training data.	en_US
dc.description.sectionheaders	Styles and Challenges
dc.description.seriesinformation	Eurographics Workshop on Intelligent Cinematography and Editing
dc.identifier.doi	10.2312/wiced.20171067
dc.identifier.isbn	978-3-03868-031-4
dc.identifier.issn	2411-9733
dc.identifier.pages	19-27
dc.identifier.uri	https://doi.org/10.2312/wiced.20171067
dc.identifier.uri	https://diglib.eg.org:443/handle/10.2312/wiced20171067
dc.publisher	The Eurographics Association	en_US
dc.subject	I.2.10 [Artificial Intelligence]
dc.subject	Vision and Scene understanding
dc.subject	Video analysis
dc.title	Inferring the Structure of Action Movies	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 019-027.pdf
Size:: 1.62 MB
Format:: Adobe Portable Document Format

Download

Collections

WICED 2017