Comparing Speech to Text APIs for Bubble - Whisper AI & AssemblyAI
In this Bubble.io tutorial we discuss and compare two speech recognision or speech to text APIs: the latest release of Whisper API by OpenAI and AssemblyAI. Although more expensive the AssemblyAI API works better with Bubble in several key ways.
Unlock the power of speech-to-text: Discover which API is best for your Bubble app - Whisper or Assembly AI?
Transcribe like a pro: Learn how to handle large audio files and get premium features for your Bubble transcription app.
Boost your Bubble app's capabilities: Explore the key differences between Whisper and Assembly AI, including webhook integration and advanced audio intelligence.
Comparing Speech-to-Text APIs for Bubble: Whisper AI & AssemblyAI
There are some incredibly powerful speech-to-text APIs available that you can link right in with your Bubble app, allowing your users to upload audio files and video files and create a transcript from them. We've already got a video showing you how to use the Whisper API to convert speech to text, but I want this video to be a quick comparison of another service that is Assembly AI.
Price Comparison
The key points I would say is one is price, which is that measured in per minute, the Whisper API is 0.006 dollars, whereas the Assembly API is 0.015 dollars. So the Assembly API is roughly double the price of the Whisper API.
Limitations of Whisper API
But there are some limitations of Whisper that in a side project I'm building has led me to use Assembly AI. One of those, one of the things that is gonna restrict you is that Whisper currently does not accept files larger than 25 megabytes. And that's gonna be particularly difficult if you are wanting to transcribe videos, as an HD video is easily going to exceed 25 megabytes.
Challenges for No-Code Builders
And also, if you're a no-code builder like me, you're not going to have the technical skill to incorporate some library that does compression, you'd have to use another external service to compress your audio or video files, and that increases your cost overall. But Whisper API is very good at accepting some very common formats. And also, I have to say it has the edge on speed.
Bubble API Connector and Response Time
When you send a request from the Bubble API connector to Whisper, you get a response back very quickly. And Bubble is actually there waiting for the response. And if we park that, that's actually one of the shortfalls of Whisper compared to Assembly AI.
Assembly AI Benefits
Now, if we look at Assembly AI, and look at the pricing, so they price it per second, but as I say, that works out as 0.015 dollars a minute. So yeah, double the price of the OpenAI Whisper API. But here is where you get the benefits of Assembly AI. One of them is that although it takes slightly longer to process, you can actually get your response sent to you, or at least notified that your response is ready using a webhook.
Advantages of Assembly AI for Bubble Apps
This means that the Bubble API connector is not waiting for a response to come back from Assembly AI. So even if it takes five minutes, because you've uploaded some huge audio file, your Bubble app can receive that and check that it's ready, and then receive it in and process the data when it's ready. You're not left hanging or your users waiting with the loading bar going across the top, and you're not restricted by the fact that as of the time of recording, the Bubble API connector times out in between 50 and 60 seconds.
Assembly AI Extra Features
Finally, Assembly AI comes with loads of extra features baked in if we look at the audio intelligence. So these are things which if you've got a transcript back from Whisper, you could then pass into ChatGPT, or a GPT 3.5 Turbo or GPT 4, and you could ask the text generation service from OpenAI to create a summarization or sentiment analysis. But it's baked in to Assembly AI.
Additional Capabilities of Assembly AI
You can make an API request with an audio file and you can get back a text summary, et cetera. There are other features, is it gonna show on this page here, such as chapter detection, reduction of personal information, topic detection. They've also got the ability to highlight speakers. And so there are a number of transcription services that we use at Planet No Code where it will label different speakers. I believe that that is possible with Assembly AI.
Conclusion
So there you have it. I just wanted to do a quick summary of basically a process we've been through, or being incredibly excited and amazed with how accurate Whisper API is, but then coming across these issues, these things that were restricting what we were trying to build, and then we found Assembly AI. And I can just say I'm immensely impressed a bit. And for the project we're building, it's well worth that extra cost to be able to work with it, I suppose more leanly and with less errors in Bubble.
Get the Complete Bundle for Just $99
Access 3 courses, 390+ tutorials, and a vibrant community to support every step of your app-building journey.
Start building with total confidence
No more delays. With 30+ hours of expert content, you’ll have the insights needed to build effectively.
Find every solution in one place
No more searching across platforms for tutorials. Our bundle has everything you need, with 390+ videos covering every feature and technique.
Dive deep into every detail
Get beyond the basics with comprehensive, in-depth courses & no code tutorials that empower you to create a feature-rich, professional app.
Save over 70%!
Valued at $80
Valued at $85
Valued at $30
Valued at $110
Valued at $45
Can't find what you're looking for?
Search our 300+ Bubble tutorial videos. Start learning no code today!
Have questions?
We have answers!
Find answers to common questions about our membership plans, programs, and more.
We're here to help you launch your no code SaaS. Reach out to the team and we'll double check our vast library for useful content. We'll advise you on how we'd tackle the same problem and there's a good chance we'll record the video to help the wider community.
As a Planet No Code member, you'll receive a discount on our Bubble coaching sessions. Monthly members receive a 10% discount, while Annual members receive a 17.5% discount. To redeem your discount, simply log into your account and book a coaching session through our platform.
Our 8-week intensive mentorship program is designed to provide personalized guidance and support to help you accelerate your startup journey. You'll be matched with a startup expert who will work with you one-on-one to set goals, overcome challenges, and make rapid progress.
To apply for the Mastery Program, simply click the "Request Invitation" button on our pricing page and fill out the application form. Our team will review your application and schedule a call with you to discuss your goals and determine if the program is a good fit for your needs.
We accept all major credit cards, including Visa, Mastercard, American Express, and Discover.
While we don't offer a free trial, we do provide a 14-day money-back guarantee. If you're not completely satisfied with your membership within the first 14 days, simply contact our support team, and we'll issue a full refund.