SoundHound Inc., a provider of voice AI and conversational intelligence technologies, has announced the expansion of its Houndify Voice AI platform to 22 languages. Now, developers employing Houndify can add conversational intelligence to their products and services for users across the globe. The company also announced plans to add over 100 languages and variations as part of its commitment to provide the most comprehensive voice AI solution to brands and users everywhere. Companies seeking to create omnichannel voice experiences can now access the majority of the languages their customers speak, including Spanish, Portuguese, French, Indian-accented English, German, Dutch, Italian, Korean, Japanese, Mandarin, Russian, Polish, Swedish, Arabic, Turkish, Hebrew, and more.
Houndify’s voice recognition is the result of a combination of Speech-to-Meaning and Deep Meaning Understanding technologies. Advancements in Automatic Speech Recognition (ASR) and Natural Language Understanding (NLU) enable the Houndify Voice AI platform to track speech in real-time, understanding the meaning even before the user has finished speaking and delivering the smartest voice experience possible. Developers interested in exploring the Houndify platform can visit Houndify.com to register for a free account or visit our blog.
Qumu Corporation, a provider of cloud-based enterprise video technology, announced the development and release of its live captioning capabilities. The new release of Qumu Cloud provides artificial intelligence (AI)-based translation of voice to on-screen captions for video viewers. Developed by Qumu’s cloud engineering team, it is available to all Qumu customers and included in the license subscription.
Qumu’s new live captioning is designed from the ground up to enable reliable, scalable hosting of virtual events of any size. The feature is currently available in two languages, English and Japanese, with the plan to quickly expand to additional languages. Qumu’s on-demand, or asynchronous, captioning remains available in over 25 languages. Qumu’s live captioning is immediately available as part of the Qumu offering, without the need to upgrade or add on services. To maintain top standards, Qumu is working closely with customers, including a national pharmacy and COVID-19 vaccine administer, to improve the accuracy, viewing and user experience with multiple upgrades planned throughout the year.
Google announced the availability of the latest releases of their Document (Doc) AI platform, Lending DocAI and Procurement DocAI. Most companies are still manually entering data and reliant on guesswork to make sense of it all as the volume and variety of data explodes. Organizations are also leaving heaps of value on the table in the form of new and better customer experiences that can be unlocked with artificial intelligence (AI) applied to documents. The DocAI platform, based on Google’s AI expertise, bring powerful and useful solutions to these challenges. Under the hood are Google’s technologies:
- Computer vision (including OCR) and Natural Language Processing (NLP) that creates pre-trained models for high-value, high-volume documents.
- Google Knowledge Graph to validate and enhance the fields in your documents.
- Training and creation of your own custom document models.
- Human interaction with AI to ensure accuracy where needed.
The new specialized parsers for Lending and Procurement DocAI can be used alongside our existing AutoML Text & Document Classification and AutoML Document Extraction services. Next up is the general availability of Human-in-the-Loop AI, a new DocAI feature that will help companies achieve higher document processing accuracy with the assurance of human review.