End-to-End Story Visualization Framework with Penalty-Based Evaluation using Vision-Language Models
Authors: Lizheng Zu, Yaoqing Jin, Siyi Cao et al.
Publication: ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Published: May 3, 2026
Source: Crossref