1st IEEE International Conference on Multimedia and Expo (ICME2000), New-York, Amerika Birleşik Devletleri, 30 Temmuz - 02 Ağustos 2000, ss.801-804
Efficient methods of content characterization for the browsing, retrieval or filtering of vast amount of digital video content has become a necessity. Still, there is a gap between the computationally available measures of content characteristics and the semantic interpretations of these characteristics. We want to establish connections between motion activity characteristics of video segments and the semantic characterization of them. For this purpose, two simple descriptors for motion activity of a video content is used to infer high-level semantic features of video in certain contexts. One of these descriptors, monotonous activity, is defined as the average block-based motion vector magnitude. The second descriptor, non-monotonous activity, is an approximation to the average temporal derivative of motion vectors. Simulation results for browsing and retrieval applications show that by using the two measures together, object motions that occur close to the camera can be distinguished from distant ones. Also by using the two descriptors together, we are able to differentiate between a high activity due to camera motion and a high activity due to dancing people. Hence, these simple descriptors, especially when used to complete each other, promise to provide important clues about: semantics of a video.