Scene representation technologies for 3DTV - A survey

Creative Commons License

Alatan A. A. , Yemez Y., Gueduekbay U., Zabulis X., Mueller K., ERDEM C. E. , ...More

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, vol.17, no.11, pp.1587-1605, 2007 (Peer-Reviewed Journal) identifier identifier

  • Publication Type: Article / Review
  • Volume: 17 Issue: 11
  • Publication Date: 2007
  • Doi Number: 10.1109/tcsvt.2007.909974
  • Journal Indexes: Science Citation Index Expanded, Scopus
  • Page Numbers: pp.1587-1605
  • Keywords: animation, dense depth map, modeling, MPEG-4, nonuniform rational B-spline (NURBS), octree, point-based modeling, polygonal mesh, pseudo-3D, rendering, scene representation, subdivision surfaces, texture, volumetric representation, VRML, X3D, 3DTV, SURFACES, COMPRESSION, ANIMATION, TEXTURE, MOTION, MODELS


3-D scene representation is utilized during scene extraction, modeling, transmission and display stages of a 3DTV framework. To this end, different representation technologies are proposed to fulfill the requirements of 3DTV paradigm. Dense point-based methods are appropriate for free-view 3DTV applications, since they can generate novel views easily. As surface representations, polygonal meshes are quite popular due to their generality and current hardware support. Unfortunately, there is no inherent smoothness in their description and the resulting renderings may contain unrealistic artifacts. NURBS surfaces have embedded smoothness and efficient tools for editing and animation, but they are more suitable for synthetic content. Smooth subdivision surfaces, which offer a good compromise between polygonal meshes and NURBS surfaces, require, sophisticated geometry modeling tools and are usually difficult to obtain. One recent trend in surface representation is point-based modeling which can meet most of the requirements of 3DTV, however the relevant state-of-the-art is not yet mature enough. On the other hand, volumetric representations encapsulate neighborhood information that is useful for the reconstruction of surfaces with their parallel implementations for multiview stereo algorithms. Apart from, the representation of 3-D structure by different primitives, texturing of scenes is also essential for a realistic scene rendering. Image-based rendering techniques directly render novel views of a scene from the acquired images, since they do not require any, explicit geometry or texture representation. 3-D human face and body modeling facilitate the realistic animation and rendering of human figures that is quite crucial for 3DTV that might demand real-time animation of human bodies. Physically based modeling and animation techniques produce impressive results, thus have potential for use in a 3DTV framework for modeling and animating dynamic scenes. As a concluding remark, it can be argued that 3-D scene and texture representation techniques are mature enough to serve and fulfill the requirements of 3-D extraction, transmission and display sides in a 3DTV scenario.