Attention And Positional Encoding Are (Almost) All You Need For Shape Matching

Raganato, Alessandro; Pasi, Gabriella; Melzi, Simone

Attention And Positional Encoding Are (Almost) All You Need For Shape Matching

dc.contributor.author	Raganato, Alessandro	en_US
dc.contributor.author	Pasi, Gabriella	en_US
dc.contributor.author	Melzi, Simone	en_US
dc.contributor.editor	Memari, Pooran	en_US
dc.contributor.editor	Solomon, Justin	en_US
dc.date.accessioned	2023-06-30T06:19:14Z
dc.date.available	2023-06-30T06:19:14Z
dc.date.issued	2023
dc.description.abstract	The fast development of novel approaches derived from the Transformers architecture has led to outstanding performance in different scenarios, from Natural Language Processing to Computer Vision. Recently, they achieved impressive results even in the challenging task of non-rigid shape matching. However, little is known about the capability of the Transformer-encoder architecture for the shape matching task, and its performances still remained largely unexplored. In this paper, we step back and investigate the contribution made by the Transformer-encoder architecture compared to its more recent alternatives, focusing on why and how it works on this specific task. Thanks to the versatility of our implementation, we can harness the bi-directional structure of the correspondence problem, making it more interpretable. Furthermore, we prove that positional encodings are essential for processing unordered point clouds. Through a comprehensive set of experiments, we find that attention and positional encoding are (almost) all you need for shape matching. The simple Transformer-encoder architecture, coupled with relative position encoding in the attention mechanism, is able to obtain strong improvements, reaching the current state-of-the-art.	en_US
dc.description.number	5
dc.description.sectionheaders	Shape Correspondence
dc.description.seriesinformation	Computer Graphics Forum
dc.description.volume	42
dc.identifier.doi	10.1111/cgf.14912
dc.identifier.issn	1467-8659
dc.identifier.pages	12 pages
dc.identifier.uri	https://doi.org/10.1111/cgf.14912
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf14912
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by-nc/4.0/
dc.subject	CCS Concepts: Computing methodologies -> Shape analysis; Theory of computation -> Computational geometry
dc.subject	Computing methodologies
dc.subject	Shape analysis
dc.subject	Theory of computation
dc.subject	Computational geometry
dc.title	Attention And Positional Encoding Are (Almost) All You Need For Shape Matching	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: v42i5_16_14912.pdf
Size:: 14.8 MB
Format:: Adobe Portable Document Format

Download

Name:: paper1012_mm1.pdf
Size:: 11.31 MB
Format:: Adobe Portable Document Format

Download

Collections

42-Issue 5
SGP23: Eurographics Symposium on Geometry Processing (CGF 42-5)