Exploring Confidence As A Reward to Advance LLMS Reasoning
Authors: He Du, Bowen Li, Chengxing Xie et al.
Publication: ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Published: May 3, 2026
Source: Crossref