News

About. A multimedia text extraction application using Tkinter and Python. The application extracts text from images, audio, and video files, with additional features such as text-to-speech conversion ...
Text-to-Audio Conversion Use the Hugging Face pipeline and the suno/bark-small model to convert text into audio.; Audio-to-Text Conversion Use the Hugging Face whisper-medium model to transcribe audio ...
Above is the workflow of the google API for converting speech to text. It takes in the voice input from the user device and this is sent to some of the core cloud functions. These functions perform ...
Vision Voice is an innovative assistive technology project. This project aims to improve the accessibility and quality of life of visually impaired people. It utilizes Raspberry Pi 3 as its core ...