Articles by Tag #whisper

Browse our collection of articles on various topics related to IT technologies. Dive in and explore something new!

Building a YouTube Video Search App with Flask, Whisper, and RAG

Building a YouTube Video Search App with Flask, Whisper, and RAG Ever wanted to search for...

Learn More 1 0Oct 9

🧠 Real-Time Smart Speech Assistant with Python, Whisper & LLMs

The future of human-computer interaction isn’t just about recognizing words, it’s about understanding...

Learn More 0 0Oct 10

Running Multi-Agent AI Workflows on Edge Hardware: A Technical Deep Dive

The Challenge: Moving Beyond Cloud Dependencies (And My Hatred of Making Slides) Let me be...

Learn More 5 4Sep 18

Why Language Tech Matters: Developing AI Tools for Small Languages

In a world where artificial intelligence is transforming how we communicate, the survival of small...

Learn More 0 0Oct 25

Whisper + Gradio on Colab: Speech-to-Text in Minutes

What you’ll learn? How to transcribe speech into text using OpenAI Whisper How to build...

Learn More 0 0Oct 28

Whisper Menu Bar

A minimal, clean speech-to-text menu bar application for macOS using OpenAI's Whisper

Learn More 0 0Oct 31

iPhone 的語音辨識功能:語音備忘錄,自動標點分段

本篇要解決的問題 之前寫過幾篇,是用 OpenAI 的 Whisper API 來語音辨識的功能,都是免費可以使用的: Google Colab + Faster...

Learn More 0 2May 22

[Boost]

Building a YouTube Video Search App with Flask, Whisper, and...

Learn More 0 0Oct 9

WhatsApp + MCP: automatic audio transcription

Build a local MCP server that surfaces WhatsApp voice notes and transcribes them with Whisper.

Learn More 2 0Sep 29

Building an AI Conversation Practice App: Part 2 - Backend Speech-to-Text Processing with OpenAI Whisper

This is the second post in a series documenting the technical implementation of a browser-based...

Learn More 0 0Sep 21

Ditch the Cloud: The Open Source AI Tool That Transcribes Interviews Locally

Quick Summary: 📝 noScribe is a free and open-source AI-based software for automated audio...

Learn More 0 0Nov 24

These 5 "Best Practices" Are Stopping You From Getting Hired

Yaar, we need to talk. Some of the advice seniors give to beginners is totally wrong for 2025. I've...

Learn More 17 1Jul 20

Code Faster in Cursor: A Pragmatic Guide to Voice Prompting

Learn a pragmatic technique using Whisper to dictate detailed prompts in Cursor AI, boosting productivity by overcoming the keyboard bottleneck.

Learn More 16 1Apr 3

Adapter l’intelligence artificielle au créole

1 – Le problème Whisper a créé un véritable « boom » dans la communauté IA en 2022 : ce...

Learn More 5 1Feb 17

Transform Your Speech into Text with the Power of OpenAI and useWhisper

This article was generated using ChatGPT from README.md Demo Are you tired of...

Learn More 3 1Jun 4

How to make multilingual videos in 3 minutes

Introduction Multilingual dubbed content is a powerful way to captivate a global audience....

Learn More 1 0Apr 9

Высококачественная транскрипция зашумлённых двухканальных телефонных звонков

Высококачественная транскрипция зашумлённых двухканальных телефонных звонков В одном из...

Learn More 2 0May 2

Why Maryrose Whittaker believes integrating Whisper in the Vodia PBX is a game changer

Maryrose Whittaker, a renowned expert in growth strategy and onboarding operations at Cove Central...

Learn More 0 0Mar 19

Transcrição de Audio Distribuído em OCI com Kubernetes + Whisper

Quando falamos de processamento de arquivos em larga escala, enfrentamos desafios bem conhecidos,...

Learn More 1 0Mar 19

🚀 Getting Started with Deepgram Nova-3 for Real-Time Speech-to-Text

Introduction Deepgram’s Nova-3 is the latest evolution in speech-to-text AI, offering...

Learn More 3 0Feb 25

Why I built Typist - lightning-fast AI audio transcription app

Let's talk about things. There are things you need. There are things you don't need. There are things...

Learn More 0 0Aug 12

Working with Claude and Gemini to fine tune whisper model. The difficult part has been formatting the data. I've started with label-studio.

A post by Lyam Hunt

Learn More 0 0Apr 15

High-Quality Transcription of Noisy Dual-Channel Phone Calls

In one of our recent projects, we needed to transcribe phone calls with extremely poor audio quality....

Learn More 2 0May 1

How AI Tells the Difference Between “Ate” and “Eight” in Speech Recognition

Imagine someone says a word out loud: “eight” or “ate”. To a human ear, they sound almost identical,...

Learn More 1 0May 20

OmniDictate: Free, Local, Real-Time AI Dictation for Windows

In a world dominated by keyboards, the simple act of getting thoughts onto the screen can feel like a...

Learn More 4 2Apr 7

How Machines Hear and Understand Us

Ever wondered how machines "listen" to us? Not just the familiar “Hey Siri” or “OK Google,” but real...

Learn More 0 0Dec 16 '24

OZI - Subtitles Generator with AI

🎬 It all started with a simple wish: watching The Osbournes with subtitles in Portuguese. 😂 Then I...

Learn More 0 0Aug 6

Whisper Speech Recognition on Mac M4: Performance Analysis and Benchmarks

I recently completed a comprehensive analysis of OpenAI's Whisper speech recognition system on Mac M4...

Learn More 1 0Jul 1

Wisper, ffmpeg을 활용한 비디오 자막 자동 생성

유튜브나 넷플릭스 영상을 보면서 "자막을 자동으로 만들어주는 프로그램이 있으면 편하겠다!"라고 생각해 본 적이 있지 않은가? 아무래도 새로운 기술에 대한 영상은 영어로 된 것이...

Learn More 0 0Dec 14 '24

OpenAI.fm! OpenAI's Newest Text-To-Speech Model - Proje Defteri

Hello friends! Today I'll be talking about OpenAI's newly released next-generation audio models....

Learn More 0 0Aug 3