AI-900: Choosing the Right Computer Vision for Mobile Expense Scanning App

Home » Exam » AI-900: Choosing the Right Computer Vision for Mobile Expense Scanning App

Discover the ideal computer vision technique for developing a mobile app that enables employees to scan and store their expenses while traveling. Explore the advantages and applications of semantic segmentation, image classification, object detection, and optical character recognition (OCR) in this informative guide.

Question

Table of Contents

Question
Answer
Explanation
References

You need to develop a mobile app for employees to scan and store their expenses while travelling.
Which type of computer vision should you use?

A. semantic segmentation
B. image classification
C. object detection
D. optical character recognition (OCR)

Answer

D. optical character recognition (OCR)

Explanation

The correct answer is D. optical character recognition (OCR).

OCR is a type of computer vision that can extract text from images, such as scanned documents, receipts, invoices, etc. OCR can be used to develop a mobile app for employees to scan and store their expenses while travelling, as it can recognize the text information on the expense receipts and convert it into a digital format that can be stored and processed.

Semantic segmentation, image classification, and object detection are other types of computer vision, but they are not suitable for this scenario. Semantic segmentation is a technique that assigns a label to every pixel in an image, such as identifying the boundaries of objects or regions. Image classification is a technique that assigns a label to an entire image, such as identifying the category of an object or scene. Object detection is a technique that locates and labels specific objects within an image, such as drawing bounding boxes around faces or cars. These techniques are useful for different applications, such as autonomous driving, face recognition, or medical imaging, but they cannot extract text from images.

Azure’s Computer Vision API includes Optical Character Recognition (OCR) capabilities that extract printed or handwritten text from images. You can extract text from images, such as photos of license plates or containers with serial numbers, as well as from documents – invoices, bills, financial reports, articles, and more.

References

Microsoft Docs > Azure > Cognitive Services > Computer Vision > What is Optical character recognition?

Microsoft Azure AI Fundamentals AI-900 certification exam practice question and answer (Q&A) dump with detail explanation and reference available free, helpful to pass the Microsoft Azure AI Fundamentals AI-900 exam and earn Microsoft Azure AI Fundamentals AI-900 certification.

Microsoft Azure AI Fundamentals AI-900 certification exam practice question and answer (Q&A) dump