News

A deep learning model for zero-shot multi-speaker TTS uses text and speaker identity as input to generate the respective output speech without fine-tuning for speakers not seen during training. The ...
[1] Detection of audio copy-move-forgery with novel feature matching on Mel spectrogram. Expert Systems with Applications (2023). [2] Digital forensics approach for handling audio and video files .