Departamento de
Inform?ica da Universidade da Beira Interior |
Contact: SOCIA Lab. - Soft Computing and
Image Analysis Group Department
of Computer Science, University of Beira Interior,
6201-001 Covilhã, Portugal
|
P-DESTRE Fully Annotated
Datasets for Pedestrian Detection, Tracking, Re-Identification and Search from Aerial
Devices
Dataset
The
P-DESTRE dataset is the result of a joint
effort from researchers at the University of Beira
Interior (Portugal) and at the JSS Science and
Technology University (India). In order to enable
the research on pedestrian identification from
aerial data, a set of "DJI
Phantom 4" drones controlled by
human operators flied over different scenes of the
campus of both universities, acquiring data that
simulate the everyday conditions in outdoor urban
crowded environments. All the subjects in the
dataset offered explicitly as volunteers and
they were asked to simply ignore the UAVs (Fig. 1),
that were flying at between 5.5 and 6.7 meters
height, with the camera pitch angles varying between
45º to 90º. Volunteers were mostly in the 18-24 age
interval (> 90%), roughly divided into two halves
for gender (approx 65% males), and ethnicity
(”white” and ”Indian”). About 28% of the volunteers
were using glasses, 10% of these were using
sunglasses. Data were recorded at 30fps, with 4K
spatial resolution (3,840 × 2,160), and stored in
”mp4” format, with H.264 compression.
Fig 1. Cohesive view of the data acquisition
protocol used to obtain the P-DESTRE data
sets. Human operators controlled "DJI Phantom 4"
aircrafts, flying at between 5.5 and 6.7 meters
altitude, in order to simulate the autonomous
surveillance of urban scenes. The gimbal pitch
angle varied between 45º to 90º. All
participating subjects explicitly
offered themselves as volunteers for this
experiment. AnnotationThe
P-DESTRE data sets are fully annotated at the
frame level, by providing one text file for each
video file (with the same name plus the ".txt"
extension), as illustrated in the example below: Fig 2. Example of one ".txt" annotation file, providing for each pair subject/frame in the video 26 labels (#frame, #subject ID, bounding box (x4), head pose information (x4) and soft label information (x16)).
Head Pose Statistics
|
|
|
fghfghfghfgh
DI-UBI Bloco VI Rua Marqu?
de ?vila e Bolama P- 6201-001 Covilh?PORTUGAL