RGSC: Retrieve and Then Generate Image-Text Pairs from Semantic Concepts for Unsupervised Vision-Language Pre-Training
Authors: Zhaopan Xu, Wangbo Zhao, Sijie Ji et al.
Publication: ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Published: May 3, 2026
Source: Crossref