Innovative Breakthroughs: Microsoft Upgrades Azure Speech AI at Build 2024, as Covered by ZDNet
Innovative Breakthroughs: Microsoft Upgrades Azure Speech AI at Build 2024, as Covered by ZDNet
SOPA Images/Contributor/Getty
At its annual Build developer conference on Tuesday, Microsoft announced new features for its Azure AI Speech service that enhance voice-enabled, generative AI -powered app development.
Azure AI Speech is already being used for “a variety of use cases including call analytics (audio, text), medical transcription (audio, vision, text), captioning (audio/video, transcription, translation) and chatbots (audio, GPT),” Microsoft said in the release. The service has numerous capabilities to date, including converting audio into text captions for a broadcast or extracting the addresses mentioned on a phone call.
Also: Microsoft Build is this week - here’s what to expect, how to watch, and why I’m excited
One highlight of OpenAI’s GPT-4o reveal last week was an improved Voice Mode , which focused on the enhanced quality of the voice given to the program’s responses. Running to keep up, Microsoft announced it is making Personal Voice generally available.
The feature lets users “create and use their own AI voices for various applications, such as voice assistants, speech translation, and video content creation,” the release explained.
Another new capability is speech analytics, now available in preview. Accessible within Azure AI Studio, Adobe’s development environment, it is supposed to address what the company calls the “soft” analysis of phone calls or other audio sources. A soft element of a call could be semantic content, or how the caller seems to feel, which is presumably subtler than the content of the call itself.
Newsletters
ZDNET Tech Today
ZDNET’s Tech Today newsletter is a daily briefing of the newest, most talked about stories, five days a week.
Subscribe
Sentiment analysis could detect details like the “degree of empathy shown, commitment of the participants and strength of the arguments made or even predict possible conversation flows,” the release explains.
In a transcript of a call, for example, sections could be labelled with a rating of each speaker’s phrase as “positive,” “negative” or “neutral.” You can check out an interactive demo here .
To make quick analysis possible, Microsoft is also rolling out Fast Transcription, which the company claims is “a game changer for transcription at large” because “it can now transcribe 40x faster than real-time (real-time factor<1).”
According to the company, Fast Transcription can save call center agents “thousands of hours” by eliminating the need to manually take notes on a call, and doctors and nurses can analyze conversations with patients in seconds. “Media and content creators can analyze and extract insights from podcasts or interviews as soon as they complete,” the release continued.
Microsoft said the feature will be made available next month.
Example of a post-call analysis with a customer.
Microsoft
To meet the need for disseminating content globally, Microsoft also teased automatic video dubbing, which translates content, synthesizes a voice in the target language, and syncs it to the video of the speaker.
Additionally, the company announced updates to its multi-lingual translation feature, such as the ability to switch languages for captioning while a person is watching a broadcast.
Artificial Intelligence
Photoshop vs. Midjourney vs. DALL-E 3: Only one AI image generator passed my 5 tests
AI-powered ‘narrative attacks’ a growing threat: 3 defense strategies for business leaders
Copilot Pro vs. ChatGPT Plus: Which AI chatbot is worth your $20 a month?
How my 4 favorite AI tools help me get more done at work
- Photoshop vs. Midjourney vs. DALL-E 3: Only one AI image generator passed my 5 tests
- AI-powered ‘narrative attacks’ a growing threat: 3 defense strategies for business leaders
- Copilot Pro vs. ChatGPT Plus: Which AI chatbot is worth your $20 a month?
- How my 4 favorite AI tools help me get more done at work
Also read:
- [New] Unlock Better Videos A 2.2 Enhancer User's Manual
- [Updated] 2024 Approved Proven YouTube Intra Creation Strategies, Free Edition
- [Updated] Brief Blueprints Sending iOS Videos & Images for 2024
- [Updated] Ultimate Hue Harmonizer Software
- Convert Images for Free with Movavi's JPG to JPEG Tool - Easy and Reliable Image Format Change
- Erfahren Sie Von Der WinxVideo-AI: Effizientes Screen-, Webcam- Und Sprachaufzeichnungsprogramm
- Forgot Locked Apple iPhone 12 Pro Max Password? Learn the Best Methods To Unlock
- How to Upgrade to the Newest Magiccard Rio Pro Driver on Windows 10, 8.1 & 7
- In 2024, Navigating Through Mixed Reality An Overview
- In 2024, Premier Free Tools for Easy JPG/GIF Transformation
- In 2024, The Professionals' Selection Top 5 Drones Ranked
- S Best Time-Lapse Video Editing Tools Free, Paid, and Everything in Between for 2024
- Synching Worlds Instagram to TikTok Essentials
- Upgrade Alert Navigate Changes with Confidence for 2024
- Title: Innovative Breakthroughs: Microsoft Upgrades Azure Speech AI at Build 2024, as Covered by ZDNet
- Author: Donald
- Created at : 2024-12-18 22:11:16
- Updated at : 2024-12-21 01:46:52
- Link: https://some-tips.techidaily.com/innovative-breakthroughs-microsoft-upgrades-azure-speech-ai-at-build-2024-as-covered-by-zdnet/
- License: This work is licensed under CC BY-NC-SA 4.0.