Skip to Content

DP-100: What method of the TextClassificationJob class should you use to configure the language for NLP text classification in Azure Machine Learning?

To set the language when training a multilingual NLP text classification model using automated ML in Azure Machine Learning with the Python SDK v2, use the set_featurization method of the TextClassificationJob class. This allows you to specify the language of your text data for proper text preprocessing and featurization.

Table of Contents

Question

You manage an Azure Machine Learning workspace.

You plan to train a natural language processing (NLP) text classification model in multiple languages by using Azure Machine Learning Python SDK v2.

You need to configure the language of the text classification job by using automated machine learning.

Which method of the TextClassificationJob class should you use?

A. set_data
B. set_featurization
C. set_sweep
D. set_training_parameters

Answer

B. set_featurization

Explanation

When training a natural language processing (NLP) text classification model in multiple languages using the Azure Machine Learning Python SDK v2 with automated machine learning, you need to configure the language of the text data being used. This ensures the text is properly preprocessed and featurized for the specific language.

To set the language, you should use the set_featurization method of the TextClassificationJob class. This method allows you to specify settings for the automated featurization and preprocessing steps applied to the text data before training the model.

The other options are incorrect:

  • set_data is used to specify the input training data, not configure language
  • set_sweep defines the hyperparameter tuning settings, unrelated to text language
  • set_training_parameters sets general settings like model algorithms to try, not language

So in summary, use TextClassificationJob.set_featurization to define the language of your multilingual text data when training an NLP text classifier using automated machine learning with the Azure ML SDK v2. This will ensure proper text processing and optimal model performance.

Microsoft DP-100 certification exam assessment practice question and answer (Q&A) dump including multiple choice questions (MCQ) and objective type questions, with detail explanation and reference available free, helpful to pass the Microsoft DP-100 exam and earn Microsoft DP-100 certification.