Train an object detector

Object detection is a form of computer vision in which a model is trained to detect the presence and location of one or more classes of object in an image.

Photograph with the location and type of fruits detected.

There are two components to an object detection prediction:

  • The class label of each object detected in the image. For example, you might ascertain that an image contains an apple, an orange, and a banana.
  • The location of each object within the image, indicated as coordinates of a bounding box that encloses the object.

To train an object detection model, you can use the Azure AI Custom Vision portal to upload and label images before training, evaluating, testing, and publishing the model; or you can use the REST API or a language-specific SDK to write code that performs the training tasks.

Image labeling

You can use Azure AI Custom Vision to create projects for image classification or object detection. The most significant difference between training an image classification model and training an object detection model is the labeling of the images with tags. While image classification requires one or more tags that apply to the whole image, object detection requires that each label consists of a tag and a region that defines the bounding box for each object in an image.

Labeling images in the Azure AI Custom Vision portal

The Azure AI Custom Vision portal provides a graphical interface that you can use to label your training images.

Screenshot of tagged images in the Azure AI Custom Vision portal.

The easiest option for labeling images for object detection is to use the interactive interface in the Azure AI Custom Vision portal. This interface automatically suggests regions that contain objects, to which you can assign tags or adjust by dragging the bounding box to enclose the object you want to label.

Additionally, after tagging an initial batch of images, you can train the model. Subsequent labeling of new images can benefit from the smart labeler tool in the portal, which can suggest not only the regions, but the classes of object they contain.

application services

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *