Apple's New Transcription APIs Blow Past Whisper in Speed Tests

Apple's new speech-to-text transcription APIs in iOS 26 and macOS Tahoe are delivering dramatically faster speeds compared to rival tools, including OpenAI's Whisper, based on beta testing conducted by MacStories' John Voorhees.

apple record transcribe phone calls

Call recording and transcription in iOS 18.1

Apple uses its own native speech frameworks to power live transcription features in apps like Notes and Voice Memos, as well as phone call transcription in iOS 18.1. To improve efficiency in iOS 26 and macOS Tahoe, Apple has introduced a new SpeechAnalyzer class and SpeechTranscriber module that deal with similar requests.

According to Voorhees, the new models processed a 34-minute, 7GB video file in just 45 seconds using a command line tool called Yap (developed by Voorhees' son, Finn). That's a full 55% faster than MacWhisper's Large V3 Turbo model, which took 1 minute and 41 seconds for the same file.

Other Whisper-based tools performed even slower, with VidCap taking 1:55 and MacWhisper's Large V2 model requiring 3:55 to complete the same transcription task. Voorhees also reported no noticeable difference in transcription quality across models.

The speed advantage comes from Apple's on-device processing approach, which avoids the network overhead that typically slows cloud-based transcription services.

While the time difference might seem modest for individual files, Voorhees notes that the performance gain increases exponentially when processing multiple videos or longer content. For anyone generating subtitles or transcribing lectures regularly, the efficiency boost could save them hours.

The Speech framework components are available across iPhone, iPad, Mac, and Vision Pro platforms in the current beta releases. Voorhees expects Apple's transcription technology to eventually replace Whisper as the go-to solution for Mac transcription apps.

Related Roundups: iOS 26, iPadOS 26, macOS Tahoe
Related Forums: iOS 26, macOS Tahoe

Popular Stories

iOS 26

iOS 26.3 and iOS 26.4 Will Add These New Features to Your iPhone

Tuesday February 3, 2026 7:47 am PST by
We are still waiting for the iOS 26.3 Release Candidate to come out, so the first iOS 26.4 beta is likely still at least a week or two away. Following beta testing, iOS 26.4 will likely be released to the general public in March or April. Below, we have recapped known or rumored iOS 26.3 and iOS 26.4 features so far. iOS 26.3 iPhone to Android Transfer Tool iOS 26.3 makes it easier...
Apple Logo Black

Apple's Next Launch is 'Imminent'

Sunday February 1, 2026 12:31 pm PST by
The calendar has turned to February, and a new report indicates that Apple's next product launch is "imminent," in the form of new MacBook Pro models. "All signs point to an imminent launch of next-generation MacBook Pros that retain the current form factor but deliver faster chips," Bloomberg's Mark Gurman said on Sunday. "I'm told the new models — code-named J714 and J716 — are slated...
iOS 26 Home Feature

Apple Gives Final Warning to Home App Users

Tuesday February 3, 2026 8:55 am PST by
In 2022, Apple introduced a new Apple Home architecture that is "more reliable and efficient," and the deadline to upgrade and avoid issues is fast approaching. In an email this week, Apple gave customers a final reminder to upgrade their Home app by February 10, 2026. Apple says users who do not upgrade may experience issues with accessories and automations, or lose access to their smart...
Aston Martin CarPlay Ultra Screen

Apple's CarPlay Ultra to Expand to These Vehicle Brands Later This Year

Sunday February 1, 2026 10:08 am PST by
Last year, Apple launched CarPlay Ultra, the long-awaited next-generation version of its CarPlay software system for vehicles. Nearly nine months later, CarPlay Ultra is still limited to Aston Martin's latest luxury vehicles, but that should change fairly soon. In May 2025, Apple said many other vehicle brands planned to offer CarPlay Ultra, including Hyundai, Kia, and Genesis. In his Powe...
Apple Foldable Thumb

First Foldable iPhone Design Details Revealed

Monday February 2, 2026 7:19 am PST by
Apple's first foldable iPhone will feature relocated volume buttons, an all-black camera plateau, a smaller Dynamic Island, and more, according to design leaks from a known Weibo leaker. The user known as "Instant Digital" today claimed to share several key details about the design of the foldable iPhone: The volume buttons will be located on the top edge of the device, aligned to the...

Top Rated Comments

Big_D Avatar
8 months ago
Impressive, if it is accurate. What the story doesn't mention is how accurate each of those transcriptions was? Were they all identical? Did one or other have more mistakes? What is the accuracy percentage for each one, and how badly wrong were those mistakes?

I'm not trying to defend ChatGPT, just the speed is a single metric, which isn't very useful if the results are garbage. If the Apple one is faster and more accurate, that is incredible, faster and as accurate, impressive, faster but full of errors, not really that useful.

Hopefully it is the first one: it is faster and more accurate.
Score: 26 Votes (Like | Disagree)
neuropsychguy Avatar
8 months ago

Impressive, if it is accurate. What the story doesn't mention is how accurate each of those transcriptions was? Were they all identical? Did one or other have more mistakes? What is the accuracy percentage for each one, and how badly wrong were those mistakes?

I'm not trying to defend ChatGPT, just the speed is a single metric, which isn't very useful if the results are garbage. If the Apple one is faster and more accurate, that is incredible, faster and as accurate, impressive, faster but full of errors, not really that useful.

Hopefully it is the first one: it is faster and more accurate.
Nothing scientific, but in the MacStories post: "What stood out above all else was Yap’s speed. By harnessing SpeechAnalyzer and SpeechTranscriber on-device, the command line tool tore through the 7GB video file a full 55% faster than MacWhisper’s Large V3 Turbo model, with no noticeable difference in transcription quality."

It would be good to see more formal comparisons with data you suggested. Also, it would be good to know what computer John was using for the test.
Score: 17 Votes (Like | Disagree)
Big_D Avatar
8 months ago

Impressive, if it is accurate.
OK, I read the original article, they all had similar problems with the podcast name, AppStories, writing it as two words instead of CamelCasing it, which is acceptable, and they all had similar problems with people's names. But the Apple tools weren't any less accurate, despite being much faster.
Score: 15 Votes (Like | Disagree)
jmonster Avatar
8 months ago
Not mentioning accuracy at all implies it's not. Lots of models are faster than O3, but they're not better.

This is just silly getting sillier. Write something meaningful.

Whisper works in real time. Anything faster is irrelevant for iOS.

And saying it's because network overhead? When you can run OpenAI's whisper locally?....... mhm.

This is a blatant advertisement just regurgitating apples marketing bullets.
Score: 7 Votes (Like | Disagree)
klasma Avatar
8 months ago
Speech-to-text is a good use case for on-device processing, but yes, accuracy is an important question, not to mention (multi-)language support.
Score: 5 Votes (Like | Disagree)
Basic75 Avatar
8 months ago

While the time difference might seem modest for individual files, Voorhees notes that the performance gain increases exponentially when processing multiple videos or longer content.
That's not how it works. Recommend maths lesson.
Score: 4 Votes (Like | Disagree)