Inferring the Structure of Action Movies

dc.contributor.authorPotapov, Danilaen_US
dc.contributor.authorDouze, Matthijsen_US
dc.contributor.authorRevaud, Jérômeen_US
dc.contributor.authorHarchaoui, Zaiden_US
dc.contributor.authorSchmid, Cordeliaen_US
dc.contributor.editorWilliam Bares and Vineet Gandhi and Quentin Galvane and Remi Ronfarden_US
dc.date.accessioned2017-04-22T17:13:02Z
dc.date.available2017-04-22T17:13:02Z
dc.date.issued2017
dc.description.abstractWhile important advances were recently made towards temporally localizing and recognizing specific human actions or activities in videos, efficient detection and classification of long video chunks belonging to semantically-defined categories remains challenging. Examples of such categories can be found in action movies, whose storylines often follow a standardized structure corresponding to a sequence of typical segments such as ''pursuit'', ''romance'', etc. We introduce a new dataset, Action Movie Franchises, consisting of a collection of Hollywood action movie franchises. We define 11 non-exclusive semantic categories that are broad enough to cover most of the movie footage. The corresponding events are annotated as groups of video shots, possibly overlapping. We propose an approach for localizing events based on classifying shots into categories and learning the temporal constraints between shots. We show that temporal constraints significantly improve the classification performance. We set up an evaluation protocol for event localization as well as for shot classification, depending on whether movies from the same franchise are present or not in the training data.en_US
dc.description.sectionheadersStyles and Challenges
dc.description.seriesinformationEurographics Workshop on Intelligent Cinematography and Editing
dc.identifier.doi10.2312/wiced.20171067
dc.identifier.isbn978-3-03868-031-4
dc.identifier.issn2411-9733
dc.identifier.pages19-27
dc.identifier.urihttps://doi.org/10.2312/wiced.20171067
dc.identifier.urihttps://diglib.eg.org:443/handle/10.2312/wiced20171067
dc.publisherThe Eurographics Associationen_US
dc.subjectI.2.10 [Artificial Intelligence]
dc.subjectVision and Scene understanding
dc.subjectVideo analysis
dc.titleInferring the Structure of Action Moviesen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
019-027.pdf
Size:
1.62 MB
Format:
Adobe Portable Document Format
Collections