Explore the power of Optical Character Recognition (OCR) in Computer Vision API for scanning images, extracting text regions, lines, and words. Enhance image analysis and text extraction effortlessly.
Table of Contents
Question
Application scans image and immediately returns the following information: regions, lines for each region, and words for each line of text.
What Computer vision API does the application use for image scan?
A. Read
B. NLP
C. OCR
D. Text Analytics
E. LUIS
Answer
C. OCR
Explanation
The Computer Vision API used for scanning images and providing information on regions, lines for each region, and words for each line of text is:
C. OCR (Optical Character Recognition): It’s designed to extract text information from images, breaking it down into regions, lines, and words for analysis and understanding.
OCR API is the part of Computer Vision services that extracts small amounts of text within an image. It is a synchronous service specially designed to provide an immediate result.
It returns the following information:
Regions on the image with text defined by bounding box coordinates
Lines of text in each region also with bounding box coordinates
Words in each line with bounding box coordinates
Option A is incorrect since Read API helps to “read” texts within the images of predominantly documents. It is an asynchronous service specially designed for the heavy on text documents with many distortions.
All other options are incorrect because they are not dealing with extracting text from the images.
Microsoft Azure AI Fundamentals AI-900 certification exam practice question and answer (Q&A) dump with detail explanation and reference available free, helpful to pass the Microsoft Azure AI Fundamentals AI-900 exam and earn Microsoft Azure AI Fundamentals AI-900 certification.