Discover how to classify features like virtual assistants, closed captions, and transcription into AI workload categories such as speech recognition and synthesis for better understanding.
Table of Contents
Question
Drag and drop the features into their appropriate AI workload categories. Use each feature only once.
Features:
- Virtual assistants
- Voice menus for phone systems
- Closed captions for videos
- Determining user input for further processing
- Automated note dictation
- Reading text messages or emails hands-free
- Transcription of phone calls or meetings
- Public announcements in various locations
Answer
Speech recognition:
- Closed captions for videos
- Determining user input for further processing
- Automated note dictation
- Transcription of phone calls or meetings
Speech synthesis:
- Virtual assistants
- Voice menus for phone systems
- Reading text messages or emails hands-free
- Public announcements in various locations
Explanation
Speech recognition involves converting spoken words to data. It transforms audio recordings or live microphone input into a format that computers can understand, often by converting them to text (transcription). The process of analysis has two components:
- Acoustic model: Analyzes the audio to identify patterns and map them to phonemes (basic units of sound).
- Language model: Uses statistical methods to predict the most likely sequence of words based on the identified phonemes and knowledge of language patterns.
Speech recognition applications include:
- Closed captions for videos
- Transcription of phone calls or meetings
- Automated note dictation
- Determining user input for further processing
Speech synthesis involves vocalizing data and converting text into spoken audio output. Its required information includes:
- Text to be spoken
- Voice selection (e.g., male, female, accent)
The process of speech synthesis has the following components:
- Tokenization: Breaking down text into individual words.
- Phonetic assignment: Assigning phonetic sounds to each word.
- Prosodic unit creation: Grouping words into phrases, clauses, or sentences for proper intonation.
- Phoneme conversion and audio generation: Converting phonemes to audio with a chosen voice, speaking rate, pitch, and volume.
Speech synthesis applications include:
- Spoken responses to user input (e.g., virtual assistants)
- Voice menus for phone systems
- Reading text messages or emails hands-free
- Public announcements in various locations
Microsoft Azure AI Fundamentals AI-900 certification exam practice question and answer (Q&A) dump with detail explanation and reference available free, helpful to pass the Microsoft Azure AI Fundamentals AI-900 exam and earn Microsoft Azure AI Fundamentals AI-900 certification.