Estimation of depth fields suitable for video compression based on 3-D structure and motion of objects


Creative Commons License

Alatan A. A., Onural L.

IEEE TRANSACTIONS ON IMAGE PROCESSING, vol.7, no.6, pp.904-908, 1998 (SCI-Expanded) identifier identifier identifier

  • Publication Type: Article / Letter
  • Volume: 7 Issue: 6
  • Publication Date: 1998
  • Doi Number: 10.1109/83.679440
  • Journal Name: IEEE TRANSACTIONS ON IMAGE PROCESSING
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Page Numbers: pp.904-908
  • Middle East Technical University Affiliated: No

Abstract

Intensity prediction along motion trajectories removes temporal redundancy considerably in video compression algorithms. In three-dimensional (3-D) object-based video coding, both 3-D motion and depth values are required for temporal prediction. The required 3-D motion parameters for each object are found by the correspondence-based E-matrix method. The estimation of the correspondences-two-dimensional (2-D) motion field-between the frames and segmentation of the scene into objects are achieved simultaneously by minimizing a Gibbs energy. The depth field is estimated by jointly minimizing a defined distortion and bit-rate criterion using the 3-D motion parameters. The resulting depth field is efficient in the rate-distortion sense, Bit-rate values corresponding to the lossless encoding of the resultant depth fields are obtained using predictive coding; prediction errors are encoded by a Lempel-Ziv algorithm. The results are satisfactory for real-life video scenes.