News

To address these challenges, we propose a cross-modal feature enhancement network (CMFEN) designed to enhance modality-specific audio and visual features as well as overall AVE features. Specifically, ...