![]() ![]() Rather than using Rev, which I had been using to generate and correct transcripts the past few years, I decide to use Whisper and The Transcriptor to do the job. Then last week, Apple’s financial results came out. This apparently turned David on to Whisper and he’s since revived the site with Whisper-derived transcripts of seven podcasts, including Upgrade. (I also pointed Whisper at an episode of Total Party Kill and it made a remarkably good subtitle track ready for uploading to YouTube.) Who needs to remember all this stuff?Īlong the way I mentioned what I was doing to David Smith, who sent me his code for PodSearch so I could use it to generate my Upgrade archive. So instead, I wrote The Transcriptor, a Shortcut that lets me control-click on audio files and turn them into transcripts in a format of my choice. This was great, but the last thing I needed was to have to remember all the arcane command-line commands required to get the files in the right place. I downloaded and installed Gerganov’s version, downloaded the medium English model, and discovered that it could transcribe a podcast at rates up to 2x! ![]() And it didn’t turn “Thanks for listening to The Incomparable, I’ve been your host Jason Snell” into “Goodnight everybody for listening to be uncomfortable, I’ve been your Hostess and smell.”įortunately, a fellow named Georgi Gerganov made a C++-native port of Whisper that is easy to install and run on macOS and is optimized for Apple silicon. While not perfect, Whisper was staggeringly better than the 2017 transcript and really, much better than any other AI-driven transcription I’d tried recently. And thanks everybody for listening to The Incomparable. And Tony Sindelar, I think you were the king of the Wicker people. I’d like to thank my guests for being here and watching some Batman movies with me…. It’s like extension course for Batman University. This ends this edition of our check-ins with Batman that are affiliated. Goodnight everybody for listening to be uncomfortable I’ve been your Hostess and smell but really I Batman.Īll right, we’re gonna wrap it up. Here’s the state of the art of podcast transcription circa 2017:Īlright we’re going to wrap it up that this ends this edition of our red chickens with Batman that are affiliated with like extension cords for Batman University I’d like to think my gas for being here and watching some Batman movies with me… and told her I think you were the king of the Wicker people. I rapidly discovered that while the python implementation of Whisper would run on my Mac, it ran at about 0.5x speed-so a two-hour podcast would take four hours to transcribe. I thought that I might give Whisper a go in transcribing Upgrade-or at least recent episodes of Upgrade, maybe since episode 400-for my own reference. Whisper’s free, and you can run it on your own computer. Up until then, I’d been doing speech-to-text-most notably, for my transcripts of Apple results calls using various services (Trint, Rev) that charge by the minute. ![]() That project went by the wayside after a while, and I found myself getting frustrated during episodes of Upgrade that I couldn’t refer people back to specific episodes where we had already discussed a topic.Ībout the same time, I began reading about OpenAI Whisper, an automatic speech recognition system that “approaches human level robustness and accuracy” for converting the spoken word into written text. Automating podcast transcripts on my Mac with OpenAI WhisperĪ little section of Upgrade 444 in David Smith’s original Podsearch engine.Ī while ago, David Smith created a site called Podsearch, a search engine for a few of his favorite podcasts, including a couple of mine. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |