Prazo para submissão: 16 December 2024
Data de Notificação: 01/04/2025
Editora: Elsevier
Revista: Computer Speech & Language
Detalhes:
Automatic speech recognition (ASR) has significantly progressed in the single-speaker scenario, owing to extensive training data, sophisticated deep learning architectures, and abundant computing resources. Building on this success, the research community is now tackling real-world multi-speaker speech recognition, where the number and nature of the sound sources are unknown and changing over time. In this scenario, refining core multi-speaker speech processing technologies such as speech separa