*** 2022 PH UPDATE ***:
Hi Product Hunt!
Over the last few years, we've been fortunate to see our Speech-to-Text APIs power applications at hundreds of startups, Fortune 500s, and hackathons.
Our APIs use advanced AI models to accurately transcribe audio/video files. Beyond Speech-to-Text, we have APIs to extract more data out of your audio - like summarization, detecting sensitive content like hate speech, sentiment analysis, etc.
Since we launched on PH, we've:
🎯 Improved transcription accuracy by over 50% - and are now the #1 rated Speech-to-Text API in accuracy on G2
💰 Raised over $35M from top investors like John/Patrick Collison (Stripe founders)
📃 Launched APIs for Summarization, Sentiment Analysis, and Content Moderation (on audio files)
📈 Now process over 2M audio files with our APIs every single day
If you have any questions about our API, or want to give it a try, you can sign up for free and get an API token -- you can also reach out to me directly at dylan[at]assemblyai[dot]com!
✌️
--------
Hey all, I’m the founder of AssemblyAI (https://www.assemblyai.com). We're in the current batch of Y Combinator (S17) and are building an API for customizable speech recognition. Developers and companies use our API for things like transcribing phone calls and building voice powered smart devices. Unlike current speech recognition APIs, developers can customize our API to more accurately recognize an unlimited amount of industry specific words or phrases unique to what they're building without any training required. For example, you can recognize thousands of product or person names with our API. Or you can more accurately recognize commands/phrases common or custom to your use case.
We've developed our own deep neural network speech recognition architecture, and aren't using any open source speech frameworks like Kaldi or Sphinx (just Tensorflow). Because of this, we're able to run things more affordably and pass those savings on to developers.
I used to work on projects that had speech recognition requirements before starting AssemblyAI, and saw how limiting, expensive, and hard to work with traditional speech recognition services and APIs were. We want to help developers and companies easily build products with speech recognition.
Would love feedback from the community on what we're building, and if you have any questions about deep learning or deep learning in production ask away!
@youvegotfox hi! This seems really neat. Is this a potential competitor for specialized recognition services, such as medical niches? Could you for example, have one user be more specialized in radiology lingo while another user is more specialized for cardiology? Does it learn over time with edits from the user and such? I work on an app that uses a 3rd party speech recognition SDK in the medical field, and I'm always looking for other options.
@xcadaverx Exactly! You can have one user for radiology lingo and another for cardiology. And when you query the API with audio data, the transcripts will be customized for each user. Right now there is no user feedback loop, but we do QA that improves the system over time with more use. Would love to chat with you about your app! If you want to email beta@assemblyai.com I will look for your email and follow up with you directly about trying out our API!
@youvegotfox We built VoiceCutTech.com and might look into using your service. How are you different than VoiceBase and are you better at recognizing custom commands? What is or will be your pricing model? Interested to find out more
@automateiq Hey Chris! If you want to shoot an email to beta@assemblyai.com and mention this post, I can look for it and reply to you with some more info. We're very accurate at recognizing custom commands. You can actually restrict the API to just supporting the commands you need to be able to support.
Supporting multiple languages with multiple dialects is the key ... Google/Amazon speech to text works like charm and support even local languages of India, which is the key...
Congratulations on the launch! ?makers AssemblyAI works only with English language for now, is it correct? Do you maybe have plans to add other languages?
Used this already and had a quite wonderfull user experince I would rate Assembly AI 10/10 it makes me read the right content for a construction company in karachi
?makers This sounds amazing! Thank you for the detailed video. But one of the greatest challenges I face while transcribing an audio is punctuation. Would this make punctuation easier and more accurate?
AssemblyAI