How to fine tune OpenAI’s Whisper speech AI for transcriptions

OpenAI Whisper is an automatic speech recognition (ASR) system. It’s designed to convert spoken language into text. Whisper was trained on a diverse range of internet audio, which includes various accents, environments, and languages. This training approach aims to enhance its accuracy and robustness across different speech contexts. To understand its significance, it’s important to … Read more

Real Gemini demo built using GPT4 Vision, Whisper and TTS

If like me you were a little disappointed to learn that the Google Gemini demonstration released earlier this month was more about clever editing rather than technology advancements. You will be pleased to know that perhaps we won’t have to wait too long before something similar is available to use. After seeing the Google Gemini … Read more