CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality

1University of Science and Technology of China, 2National Key Laboratory of Deep Space Exploration 3AI Thrust, HKUST(GZ) 4Department of CSE, HKUST

CUBE360 presents a novel holistic scene representation for panoramic images.

Abstract

Panoramic images provide comprehensive scene information and are suitable for VR applications. Obtaining corresponding depth maps is essential for achieving immersive and interactive experiences. However, panoramic depth estimation presents significant challenges due to the severe distortion caused by equirectangular projection (ERP) and the limited availability of panoramic RGBD datasets.

Inspired by the recent success of neural rendering, we propose a novel method, named CUBE360, that learns a cubic field composed of multiple MPIs from a single panoramic image for continuous depth estimation at any view direction. Our CUBE360 employs cubemap projection to transform an ERP image into six faces and extract the MPIs for each, thereby reducing the memory consumption required for MPI processing of high-resolution data. Additionally, this approach avoids the computational complexity of handling the uneven pixel distribution inherent to equirectangular projection. An attention-based blending module is then employed to learn correlations among the MPIs of cubic faces, constructing a cubic field representation with color and density information at various depth levels. Furthermore, a novel sampling strategy is introduced for rendering novel views from the cubic field at both cubic and planar scales. The entire pipeline is trained using photometric loss calculated from rendered views within a self-supervised learning approach, enabling training on 360 videos without depth annotations.

Experiments on both synthetic and real-world datasets demonstrate the superior performance of CUBE360 compared to prior SSL methods. We also highlight its effectiveness in downstream applications, such as VR roaming and visual effects, underscoring CUBE360's potential to enhance immersive experiences.

Video

Video Gallery

VR Roaming

Input Panorama

Image 1 description
Image 2 description
Image 3 description

Novel View Synthesis