Speechmatics is already recognized in multiple markets where its speech-to-text real time transcription engine is being used in daily operations, including for closed-captioning in broadcast operations. The latest speech-to-text engine, Ursa, from the Cambridge-based company has shown to outperform the world's most accurate solutions from much larger companies. The company claims to be 25% more accurate than OpenAI’s Whisper and 38% more accurate than Google.
Speechmatics' real-time transcription and translation delivers the same level of accuracy as its pre-recorded (batch) service, and also provides a sliding scale to enable users to tailor the speed and/or accuracy to meet their needs. This all-in-one API for real-time transcription and translation provides a seamless solution that is both time and cost efficient, which will greatly benefit businesses, the media captioning industry and call centers. It enables businesses to reach a wider geographical audience without additional effort.
Speechmatics has doubled down on these capabilities to develop real-time translation, offering language pairs to and from English, including Bulgarian, German, Greek, Spanish, Hindi, Korean, and Vietnamese. The all-in-one API can also translate multiple languages in one request – for example, a single audio stream can provide real-time English transcription and translation to Japanese, French, Hindi, Mandarin, and Korean simultaneously, or yield results in Finnish, Latvian, Malay, Mandarin, and Turkish simultaneously.
Speechmatics’ real-time transcription and now translation delivers the same level of accuracy as its pre-recorded (batch) service, as well as providing a sliding scale to enable customers to tailor the speed (latency) and/or accuracy to meet their needs. The all-in-one API streamlines processes and speeds up workflows for businesses by combining real-time transcription and translation in one API.
Businesses can reach a wider geographical audience across multiple industries where translating in real-time has previously been a challenging and costly task when completed manually by humans. Particularly for the broadcast industry – valued at over $300 billion in the US alone in 2022 – generating quick and highly accurate translated speech in one API unlocks the ability to caption live stream content and news for viewers from around the world. Similarly, contact centers can scale operations to handle multiple languages using cost-effective automation technology and offer improved customer experiences in native languages.
"This is a landmark development for speech recognition technology, and we are proud to remain at the forefront of innovation, demonstrating the commitment to our mission to understand every voice. This new offering opens up a truly global market for our customers with almost instant translation from the spoken word. As demand from viewers in different regions increases for TV shows and broadcast, sports, events, podcasts, game streaming, YouTube and social media videos, the need for captioned videos in multiple languages has too. We are excited to launch this capability to our customers in the next few weeks and will be continuing to work towards adding even more languages and enabling the engine to translate between languages, so the default isn’t always English," says Damir Derd, Head of Sales Engineering at Speechmatics.
www.speechmatics.com