AI-900: What Are the Key Features of Speech Recognition and Synthesis in AI Workloads?

Home » Exam » AI-900: What Are the Key Features of Speech Recognition and Synthesis in AI Workloads?

Discover how to classify features like virtual assistants, closed captions, and transcription into AI workload categories such as speech recognition and synthesis for better understanding.

Table of Contents

Question
Answer
Explanation

Question

Drag and drop the features into their appropriate AI workload categories. Use each feature only once.

Features:

Virtual assistants
Voice menus for phone systems
Closed captions for videos
Determining user input for further processing
Automated note dictation
Reading text messages or emails hands-free
Transcription of phone calls or meetings
Public announcements in various locations

Answer

Speech recognition:

Closed captions for videos
Determining user input for further processing
Automated note dictation
Transcription of phone calls or meetings

Speech synthesis:

Virtual assistants
Voice menus for phone systems
Reading text messages or emails hands-free
Public announcements in various locations

Explanation

Speech recognition involves converting spoken words to data. It transforms audio recordings or live microphone input into a format that computers can understand, often by converting them to text (transcription). The process of analysis has two components:

Acoustic model: Analyzes the audio to identify patterns and map them to phonemes (basic units of sound).
Language model: Uses statistical methods to predict the most likely sequence of words based on the identified phonemes and knowledge of language patterns.

Speech recognition applications include:

Closed captions for videos
Transcription of phone calls or meetings
Automated note dictation
Determining user input for further processing

Speech synthesis involves vocalizing data and converting text into spoken audio output. Its required information includes:

Text to be spoken
Voice selection (e.g., male, female, accent)

The process of speech synthesis has the following components:

Tokenization: Breaking down text into individual words.
Phonetic assignment: Assigning phonetic sounds to each word.
Prosodic unit creation: Grouping words into phrases, clauses, or sentences for proper intonation.
Phoneme conversion and audio generation: Converting phonemes to audio with a chosen voice, speaking rate, pitch, and volume.

Speech synthesis applications include:

Spoken responses to user input (e.g., virtual assistants)
Voice menus for phone systems
Reading text messages or emails hands-free
Public announcements in various locations

How to Categorize AI Workload Features: Speech Recognition vs. Speech Synthesis?

Microsoft Azure AI Fundamentals AI-900 certification exam practice question and answer (Q&A) dump with detail explanation and reference available free, helpful to pass the Microsoft Azure AI Fundamentals AI-900 exam and earn Microsoft Azure AI Fundamentals AI-900 certification.