P. Merkle, Y. Morvan, A. Smolic, D. Farin, K. Mueller, P.H.N. de With, and T. Wiegand, “The Effects of Multiview Depth Video Compression on Multiview Rendering”, Signal Processing: Image Communication, 2009.
Abstract: This research investigates the interaction between different techniques for depth compression and view synthesis rendering with multiview video plus scene depth data. Two different approaches for depth coding are compared, namely H.264/MVC, using temporal and inter-view reference images for efficient prediction, and the novel platelet-based coding algorithm, characterized by being adapted to the special characteristics of depth-images. Since depth-images are a 2D representation of the 3D scene geometry, depth-image errors lead to geometry distortions. Therefore, the influence of geometry distortions resulting from coding artifacts is evaluated for both coding approaches in two different ways. First, the variation of 3D surface meshes is analyzed using the Hausdorff distance and second, the distortion is evaluated for 2D view synthesis rendering, where color and depth information are used together to render virtual intermediate camera views of the scene. The results show that – although its R-D performance is worse – platelet-based depth coding outperforms H.264, due to improved sharp edge preservation. Therefore depth coding needs to be evaluated with respect to geometry distortions.
K. Müller, A. Smolic, K. Dix, P. Merkle, P. Kauff, and T. Wiegand,”Reliability-based Generation and View Synthesis in Layered Depth Video“, Proc. MMSP 2008, IEEE International Workshop on Multimedia Signal Processing, Cairns, Australia, October 2008.
Abstract: In this paper, a system for video rendering on multiscopic 3D displays is considered where the data is represented as layered depth video (LDV). This representation consists of one full or central video with associated per-pixel depth and additional residual layers. Thus, only one full view with additional residual data needs to be transmitted. The LDV data is used at the receiver to generate all intermediate views for the display. The paper presents the LDV layer extraction as well as the view synthesis, using a scene reliability-driven approach. Here, unreliable image regions are detected and in contrast to previous approaches the residual data is enlarged to reduce artifacts in unreliable areas during rendering. To provide maximum data coverage, the residual data remains at its original positions and will not be projected towards the central view. The view synthesis process also uses this reliability analysis to provide higher quality intermediate views than previous approaches. As a final result, high quality intermediate views for an existing 9-view auto-stereoscopic display are presented, which prove the suitability of the LDV approach for advanced 3D video (3DV) systems.
A. Smolic, K. Müller, K. Dix, P. Merkle, P. Kauff, and T. Wiegand, ” Intermediate View Interpolation Based on Multiview Video Plus Depth for Advanced 3D Video Systems “, Proc. ICIP 2008, IEEE International Conference on Image Processing, San Diego, CA, USA, October 2008.
Abstract: A system for video on multiscopic 3D displays is considered where the data representation consists of multiview video plus scene depth. At most, 3 multiview video signals are being transmitted and used together with the depth data to generate intermediate views at the receiver. The paper presents an approach to such an intermediate view interpolation that separates unreliable image regions along depth discontinuities from reliable image regions. These image regions are processed with different algorithms and previous layered approaches, two boundary layers and one reliable layer is used. Moreover, the presented technique does not rely on 3D graphics support but uses image-based 3D warping instead. For enhanced quality intermediate view generation, hole-filling and filtering methods are described. As a result, high quality intermediate views for an existing 9-view auto-stereoscopic display are presented, which prove the suitability of the approach for advanced 3D video (3DV) systems.
This deliverable makes a user analysis including different user classes, user profiles and use case scenarios after presenting some important design principles of user interface and user interaction. Most important scenarios are chosen as the use case scenarios in this part. Lastly, the requirements for mobile users are categorized such as software, hardware, UI, performance, etc. and labeled as ‘must’, ‘should’ and ‘may’ according to their level of importance.
Specification of 3D camera HW module means the description of the 3D camera related hardware, low level software, mechanical and optical components which will be developed and integrated in the project.
This document describes the software platform 3DPhone. This platform must provide support to all applications that run on the phone. Applications require access devices like the camera or sensors, rendering their windows in 3D, using 3D Multimedia Framework, receive incoming events, and so on. Hence, the platform SW form an intermediate layer between applications and hardware on which applications will run late.
This document’s purpose is to describe the 3D Multimedia Framework to be used in the 3DPhone. Such framework must provide support to the applications developed in the project, helping to manage all multimedia contents and data flows. Therefore, 3DPhone applications define functional requirements, namely use cases, that should satisfy the 3D Multimedia Framework.
This report describes the first study results of 3D User Interfaces and Direct Manipulation Interfaces. First, in Part 1, we survey the direct manipulation input approaches, including both the use of cameras and hardware sensors (such as accelerometers). Then, in Part 2, we explore 3D User Interface techniques, including the design of 3D widgets, 3D rendering, and perception studies.
This document reports first study results for 3D video solutions. The different video formats that were defined in D5.1 are used for this, including three depth-based formats, namely single- and multiview video plus depth and layered depth video, and conventional stereo video. Study results and solutions presented in this deliverable address analysis and synthesis as well as coding algorithms for 3D video. Regarding conventional stereo video, coding approaches are analyzed. For video plus depth results on stereo video rendering for mobile phone video applications are evaluated. For MVD the report presents an extensive study on depth coding and appropriate evaluation methods as well as high quality rendering algorithms. Furthermore extraction and rendering algorithms for layered depth video are investigated. Finally, conclusions and future prospects of study results for 3D video solutions are presented.




