Local Attention Guided Joint Depth Upsampling

Mallick, Arijit; Engelhardt, Andreas; Braun, Raphael; Lensch, Hendrik P. A.

Local Attention Guided Joint Depth Upsampling

dc.contributor.author	Mallick, Arijit	en_US
dc.contributor.author	Engelhardt, Andreas	en_US
dc.contributor.author	Braun, Raphael	en_US
dc.contributor.author	Lensch, Hendrik P. A.	en_US
dc.contributor.editor	Bender, Jan	en_US
dc.contributor.editor	Botsch, Mario	en_US
dc.contributor.editor	Keim, Daniel A.	en_US
dc.date.accessioned	2022-09-26T09:28:37Z
dc.date.available	2022-09-26T09:28:37Z
dc.date.issued	2022
dc.description.abstract	Image super resolution is a classical computer vision problem. A branch of super resolution tasks deals with guided depth super resolution as objective. Here, the goal is to accurately upsample a given low resolution depth map with the help of features aggregated from the high resolution color image of that particular scene. Recently, the development of transformers has improved performance for general image processing tasks credited to self-attention. Unlike previous methods for guided joint depth upsampling which rely mostly on CNNs, we efficiently compute self-attention with the help of local image attention which avoids the quadratic growth typically found in self-attention layers. Our work combines CNNs and transformers to analyze the two input modalities and employs a cross-modal fusion network in order to predict both a weighted per-pixel filter kernel and a residual for the depth estimation. To further enhance the final output, we integrate a differentiable and a trainable deep guided filtering network which provides an additional depth prior. An ablation study and empirical trials demonstrate the importance of each proposed module. Our method shows comparable as well as state-of-the-art performance on the guided depth upsampling task.	en_US
dc.description.sectionheaders	Joint Session
dc.description.seriesinformation	Vision, Modeling, and Visualization
dc.identifier.doi	10.2312/vmv.20221197
dc.identifier.isbn	978-3-03868-189-2
dc.identifier.pages	1-8
dc.identifier.pages	8 pages
dc.identifier.uri	https://doi.org/10.2312/vmv.20221197
dc.identifier.uri	https://diglib.eg.org:443/handle/10.2312/vmv20221197
dc.publisher	The Eurographics Association	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	CCS Concepts: Computing methodologies --> Computer vision; Image representations; Reconstruction
dc.subject	Computing methodologies
dc.subject	Computer vision
dc.subject	Image representations
dc.subject	Reconstruction
dc.title	Local Attention Guided Joint Depth Upsampling	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 001-008.pdf
Size:: 3.19 MB
Format:: Adobe Portable Document Format

Download

Collections

VMV2022