视频处理_Hollywood2_Human Actions and Scenes Dataset(好莱坞第二辑_人脸和场景图像数据集)
- 格式:pdf
- 大小:97.98 KB
- 文档页数:6
Hollywood2_Human Actions and Scenes Dataset(好莱坞第二辑_人脸和场景图像数据集)
数据摘要:
We provide a dataset with 12 classes of human actions and 10 classes of scenes distributed over 3669 video clips and approximately 20.1 hours of video in total. The dataset intends to provide a comprehensive benchmark for human action recognition in realistic and challenging settings. The dataset is composed of video clips from 69 movies (see the list of movies below).
中文关键词:
图像,视频,人脸,好莱坞,场景,
英文关键词:
image,video,human,hollywood,scene,
数据格式:
VIDEO
数据用途:
computer vision
数据详细介绍:
Hollywood2_Human Actions and Scenes Dataset
We provide a dataset with 12 classes of human actions and 10 classesof scenes distributed over 3669 video clips and approximately 20.1 hoursof video in total. The dataset intends to provide a comprehensivebenchmark for human action recognition in realistic and challengingsettings. The dataset is composed of video clips from 69 movies (see the list of movies below). A part of this dataset was originally used in the paper "Actions in Context", Marszalek et al. in Proc. CVPR'09.
Action samples were collected by means of automatic script-to-video alignment in combination with text-based script classification following Laptev at al. CVPR'08. Video samples generated from training movies correspond to the automatic training subset with noisy action labels.
Based on this subset we also constructed a clean training subset with action labels manually verified to be correct. We also provide a test subset with manually checked action labels.
Scene classes are selected automatically from scripts such as to maxi mizeco-occurrence with the given action classes and to capture action context as described in Marsza?ek et al. CVPR'09. Scene video samples are thengenerated using script-to-video alignment. The labels of test scenesamples are manually verified to be correct.
The following tables provide the numbers of video samples in each of thesubsets as well as the distributions of class instances in each subset.
Note that samples may contain instances of several actions such e.g.
kissing and hugging.
Download details
================
The dataset is split in two parts with actions and scene samples respectively: http://www.irisa.fr/vista/actions/hollywood2/Hollywood2-actions.tar.gz (approx. size: 15Gb | md5sum: 55948d0ef45a569a2134ea44e6f8976c)
http://www.irisa.fr/vista/actions/hollywood2/Hollywood2-scenes.tar.gz (approx. size: 25Gb | md5sum: b77f9ffe18ad5ea04957bb4c7725f5ce)
Action video samples are provided in directory AVIClips for three subsets according to the table above. The annotation of samples w.r.t. 12 action classes
is located in ClipSets directory. Similarly, the video samples and annotations for scene samples are located in AVSClipsScenes and ClipSetsScenes directories
respectively.
Example:
The file ClipSets/AnswerPhone_autotrain.txt contains annotation for AnswerPhone
action in the automatic training subset with 810 video clips. Each line of the annotation file provides a name of a video sample in AVIClips directory as well as the flag = {1|-1} indicating whether the sample contains AnswerPhone or not.
(Our annotation format is similar to PASCAL VOC annotation format for image classification task).
We also provide conditional probability tables
http://www.irisa.fr/vista/actions/hollywood2/hollywood2_condprob_scenesgive nact_scripttrain.txt
http://www.irisa.fr/vista/actions/hollywood2/hollywood2_condprob_actgivensce nes_scripttrain.txt
for p(scene|action) and p(action|scene) estimated from an independent set of movie scripts and used in "Actions in Context" Marszalek et al. CVPR'09 paper:
Source movies
=============
The 69 movies used to generate clips in this dataset were divided into 33 training
movies and 36 test movies as follows.
Training movies:
American Beauty, As Good as It Gets, Being John Malkovich, The Big Lebowski, Bruce