Python Mel Spectrogram

News

On Zero-Shot Multi-Speaker Text-to-Speech Using Deep Learning

A deep learning model for zero-shot multi-speaker TTS uses text and speaker identity as input to generate the respective output speech without fine-tuning for speakers not seen during training. The ...

Nature1mon

Audio Forensics and Forgery Detection - Nature

[1] Detection of audio copy-move-forgery with novel feature matching on Mel spectrogram. Expert Systems with Applications (2023). [2] Digital forensics approach for handling audio and video files .

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

News

On Zero-Shot Multi-Speaker Text-to-Speech Using Deep Learning

Audio Forensics and Forgery Detection - Nature

Trending now