Microsoft Unveils Enhanced Azure Artificial Intelligence Voice Solutions During Build 2024 - Insights From ZDNet
Microsoft Unveils Enhanced Azure Artificial Intelligence Voice Solutions During Build 2024 - Insights From ZDNet
SOPA Images/Contributor/Getty
At its annual Build developer conference on Tuesday, Microsoft announced new features for its Azure AI Speech service that enhance voice-enabled, generative AI -powered app development.
Azure AI Speech is already being used for “a variety of use cases including call analytics (audio, text), medical transcription (audio, vision, text), captioning (audio/video, transcription, translation) and chatbots (audio, GPT),” Microsoft said in the release. The service has numerous capabilities to date, including converting audio into text captions for a broadcast or extracting the addresses mentioned on a phone call.
Also: Microsoft Build is this week - here’s what to expect, how to watch, and why I’m excited
One highlight of OpenAI’s GPT-4o reveal last week was an improved Voice Mode , which focused on the enhanced quality of the voice given to the program’s responses. Running to keep up, Microsoft announced it is making Personal Voice generally available.
The feature lets users “create and use their own AI voices for various applications, such as voice assistants, speech translation, and video content creation,” the release explained.
Another new capability is speech analytics, now available in preview. Accessible within Azure AI Studio, Adobe’s development environment, it is supposed to address what the company calls the “soft” analysis of phone calls or other audio sources. A soft element of a call could be semantic content, or how the caller seems to feel, which is presumably subtler than the content of the call itself.
Newsletters
ZDNET Tech Today
ZDNET’s Tech Today newsletter is a daily briefing of the newest, most talked about stories, five days a week.
Subscribe
Sentiment analysis could detect details like the “degree of empathy shown, commitment of the participants and strength of the arguments made or even predict possible conversation flows,” the release explains.
In a transcript of a call, for example, sections could be labelled with a rating of each speaker’s phrase as “positive,” “negative” or “neutral.” You can check out an interactive demo here .
To make quick analysis possible, Microsoft is also rolling out Fast Transcription, which the company claims is “a game changer for transcription at large” because “it can now transcribe 40x faster than real-time (real-time factor<1).”
According to the company, Fast Transcription can save call center agents “thousands of hours” by eliminating the need to manually take notes on a call, and doctors and nurses can analyze conversations with patients in seconds. “Media and content creators can analyze and extract insights from podcasts or interviews as soon as they complete,” the release continued.
Microsoft said the feature will be made available next month.
Example of a post-call analysis with a customer.
Microsoft
To meet the need for disseminating content globally, Microsoft also teased automatic video dubbing, which translates content, synthesizes a voice in the target language, and syncs it to the video of the speaker.
Additionally, the company announced updates to its multi-lingual translation feature, such as the ability to switch languages for captioning while a person is watching a broadcast.
Artificial Intelligence
Photoshop vs. Midjourney vs. DALL-E 3: Only one AI image generator passed my 5 tests
AI-powered ‘narrative attacks’ a growing threat: 3 defense strategies for business leaders
Copilot Pro vs. ChatGPT Plus: Which AI chatbot is worth your $20 a month?
How my 4 favorite AI tools help me get more done at work
- Photoshop vs. Midjourney vs. DALL-E 3: Only one AI image generator passed my 5 tests
- AI-powered ‘narrative attacks’ a growing threat: 3 defense strategies for business leaders
- Copilot Pro vs. ChatGPT Plus: Which AI chatbot is worth your $20 a month?
- How my 4 favorite AI tools help me get more done at work
Also read:
- [New] Top 5 macOS Safari Video Conversion Apps
- [Updated] 2024 Approved Maximize Impact Priority List of Highlight Tweaks
- [Updated] Strategies for Sourcing A-List Cinematography Experts
- 2024 Approved The Melody Meets Discovering Crossfade Magic
- 2024 Approved Top 6 Video Downloaders Preserve Your LinkedIn Media Masterfully
- 在線無限量的自由型PPM至JPG格式 - 使用Movavi转换器进行简单操作
- Apple MacBook Showdown: Comparing the Latest M3 Vs. M2 Models - Find Your Perfect Fit!
- How to Get and Use Pokemon Go Promo Codes On Vivo Y78 5G | Dr.fone
- How to Take and Share Nintendo Switch Screenshots
- In 2024, Turnback Artisan Hub
- Seamless Cell Combination Techniques: A Beginner's Tutorial on String Concat in Excel
- Ultimate Tutorial: PlayStation 5 Controller Setup for Microsoft Windows 10 Users
- Updated Reaction Video Mastery Top iOS and Android Apps
- Title: Microsoft Unveils Enhanced Azure Artificial Intelligence Voice Solutions During Build 2024 - Insights From ZDNet
- Author: Donald
- Created at : 2024-12-24 16:58:22
- Updated at : 2024-12-27 16:28:18
- Link: https://some-tips.techidaily.com/microsoft-unveils-enhanced-azure-artificial-intelligence-voice-solutions-during-build-2024-insights-from-zdnet/
- License: This work is licensed under CC BY-NC-SA 4.0.