Sensory Releases TrulyNatural Embedded Speech-to-Text 2.0

August 9 2024, 00:55
Sensory announced the availability of its second-generation TrulyNatural Embedded Speech-to-Text (STT) solution, which the company believes delivers the highest accuracy to size ratio compared to any other Speech-to-Text technology. According to the voice AI and speech recognition technology pioneer, in benchmark comparisons, TrulyNatural STT is an order of magnitude smaller yet more accurate than industry leading and public domain speech engines. 
 

TrulyNatural STT and Truly Natural "lite" are Sensory's flagship speech-to-text technologies, renowned for delivering high accuracy, real-time responsive, and robust performance embedded voice user interfaces, without relying on cloud connectivity. This makes it ideal for applications and products where network availability is unreliable or where data privacy is a paramount concern. In this 2.0 update, Sensory offers a new SDK with smaller models and more accuracy, expanded language coverage, and enhanced compatibility for Windows platforms.

In response to growing demand, Sensory has extended the compatibility of TrulyNatural STT to Windows-based platforms. Windows compatibility opens new possibilities for developers and integrators working on a variety of applications, including consumer electronics, medical, industrial, automotive, enterprise, government and more.

Sensory's TrulyNatural Embedded Speech-to-Text (STT) 2.0 includes advanced acoustic and language models, providing even greater accuracy and faster response times. These models utilize state-of-the-art transformer-like architectures to deliver improved word error rates (WER) and robustness in noisy environments.

TrulyNatural STT now offers even more flexibility in deployment, supporting a wider range of hardware configurations. Whether running on GPUs, high-performance multicore CPUs, or utilizing accelerators like Arm Neon and Helium technologies, TrulyNatural is optimized to deliver the best possible voice user experience.
 
Sensory's TrulyNatural Speech-to-Text (STT) engine is up to 8 times smaller than competitor engines while delivering superior accuracy. Available through the Sensory TNL SDK, TrulyNatural supports models for over 40 languages, with sizes ranging from 20MB to 200MB. By integrating Natural Language Understanding (NLU) models and action lookup tables, developers can create a complete voice assistant in under 35MB.

Combining Sensory's wake word technology, with its domain specific Small Language Models (SLMs) means that Sensory can provide a domain specific voice assistant in as little as 35MB! And Sensory's wake word and STT can be also connected to a Large Language Model (LLM) running on device or in the cloud. TrulyNatural STT technology is able to operate independently of the cloud, ensuring user and data privacy. Embedded models eliminate latency issues associated with cloud-based solutions, providing a seamless and responsive user experience that is always available and free from hallucinations.

TrulyNatural STT is also highly customizable, supporting multiple languages, dialects, and domain-specific vocabulary. It now supports nearly 40 languages and regional variations including: Arabic, Danish, Dutch, English (US, UK, Kids), Finnish, French, German, Greek, Hebrew, Hindi, Indonesian, Italian, Japanese, Korean, Malay, Mandarin, Norwegian, Polish, Portuguese, Romanian, Russian, Spanish, Swedish, Turkish, and Ukrainian.

Sensory has been at the forefront of voice AI innovation, delivering cutting-edge speech recognition and biometric solutions for over 20 years. Sensory's technologies are integrated into millions of devices worldwide, powering applications that range from mobile phones and consumer electronics to automotive and industrial systems.
www.sensory.com
Page description
About Joao Martins
Since 2013, Joao Martins leads audioXpress as editor-in-chief of the US-based magazine and website, the leading audio electronics, audio product development and design publication, working also as international editor for Voice Coil, the leading periodical for... Read more

related items