A Multimodal Personality Prediction Framework based on Adaptive Graph Transformer Network and Multi-task Learning

Wang, Rongquan; Zhao, Xile; Xu, Xianyu; Hao, Yang

A Multimodal Personality Prediction Framework based on Adaptive Graph Transformer Network and Multi-task Learning

dc.contributor.author	Wang, Rongquan	en_US
dc.contributor.author	Zhao, Xile	en_US
dc.contributor.author	Xu, Xianyu	en_US
dc.contributor.author	Hao, Yang	en_US
dc.contributor.editor	Bousseau, Adrien	en_US
dc.contributor.editor	Day, Angela	en_US
dc.date.accessioned	2025-05-09T09:11:45Z
dc.date.available	2025-05-09T09:11:45Z
dc.date.issued	2025
dc.description.abstract	Multimodal personality analysis targets accurately detecting personality traits by incorporating related multimodal information. However, existing methods focus on unimodal features while overlooking the bimodal association features crucial for this interdisciplinary task. Therefore, we propose a multimodal personality prediction framework based on an adaptive graph transformer network and multi-task learning. Firstly, we utilize pre-trained models to learn specific representations from different modalities. Here, we employ pre-trained multimodal models' encoders as the backbones of the modality-specific extraction methods to mine unimodal features. Specifically, we introduce a novel adaptive graph transformer network to mine personalityrelated bimodal association features. This network effectively learns higher-order temporal dependencies based on relational graphs and emphasizes more significant features. Furthermore, we utilize a multimodal channel attention residual fusion module to obtain the fused features, and we propose a multimodal and unimodal joint learning regression head to learn and predict scores for personality traits. We design a multi-task loss function to enhance the robustness and accuracy of personality prediction. Experimental results on the two benchmark datasets demonstrate the effectiveness of our framework, which outperforms the state-of-the-art methods. The code is available at https://github.com/RongquanWang/PPF-AGTNMTL.	en_US
dc.description.number	2
dc.description.sectionheaders	Fix it in Post: Image and Video Synthesis and Analysis
dc.description.seriesinformation	Computer Graphics Forum
dc.description.volume	44
dc.identifier.doi	10.1111/cgf.70030
dc.identifier.issn	1467-8659
dc.identifier.pages	10 pages
dc.identifier.uri	https://doi.org/10.1111/cgf.70030
dc.identifier.uri	https://diglib.eg.org/handle/10.1111/cgf70030
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Imaging/Video → Image/Video Processing; Interaction → Multimodal/Cross-modal Interaction; Methods/Applications → Artificial Intelligence/Machine Learning
dc.subject	Imaging/Video → Image/Video Processing
dc.subject	Interaction → Multimodal/Cross
dc.subject	modal Interaction
dc.subject	Methods/Applications → Artificial Intelligence/Machine Learning
dc.title	A Multimodal Personality Prediction Framework based on Adaptive Graph Transformer Network and Multi-task Learning	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: cgf70030.pdf
Size:: 1.42 MB
Format:: Adobe Portable Document Format

Download

Name:: paper1128_1.pdf
Size:: 127.84 KB
Format:: Adobe Portable Document Format

Download

Collections

EG 2025 - Full Papers - CGF 44-Issue 2
44-Issue 2