Object Detection and Segmentation

From the previous chapter we know that Image classification only really deals with the case when we have a single instance of a class in an input image. Even then it only provides a coarse output for us, letting us know what object is present in an image but not where it is. A more interesting scenario is when we want to find where all instances of a class, or even multiple different classes, are located in an input image.

To deal with this more challenging problem, object detection and segmentation come into the picture. These are areas of computer vision that until recently were very challenging. However, applying convolutional neural networks to these problems has gained a lot of attention in recent years and consequently, for the most part, these problems can now be considered solved. In this chapter we will see how CNNs have managed to tackle these difficult tasks so well.

The following image shows the differences between different solutions Segmentation, Localization, Detection and Instance Segmentation:

Before we start discussing object detection, we need to understand another important concept—localization. It is a key building block for improving classification and enabling detection. We will see that these three concepts are closely related to each other as we move from image classification to classification with localization, and finally to object detection.

In this chapter, we will learn about the following interesting topics:

  • Image classification with localization
  • Object detection
  • Semantic segmentation
  • Instance segmentation
  • How to build convolutional neural networks to perform all these tasks
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.189.189.67