Resources for the rectangular segmentation of an image (ML)

Click For Summary
SUMMARY

The discussion focuses on the challenge of segmenting images into rectangular regions using Convolutional Neural Networks (CNNs) and extracting these regions effectively. The user seeks resources for implementing rectangular segmentation, specifically looking for methods to create bounding boxes around classified objects. OpenCV is highlighted as a valuable tool for computer vision tasks, although the user finds existing resources lacking in practical examples for their specific needs. The term "bounding box" is identified as crucial for refining search queries related to this topic.

PREREQUISITES
  • Understanding of Convolutional Neural Networks (CNNs)
  • Familiarity with image classification techniques
  • Knowledge of OpenCV for computer vision tasks
  • Basic concepts of image segmentation
NEXT STEPS
  • Explore OpenCV's documentation on bounding box extraction
  • Research techniques for improving object detection accuracy in CNNs
  • Learn about advanced segmentation methods, such as Mask R-CNN
  • Investigate the use of non-maximum suppression for refining bounding boxes
USEFUL FOR

Machine learning practitioners, computer vision developers, and anyone interested in improving image segmentation techniques using CNNs and OpenCV.

Avatrin
Messages
242
Reaction score
6
Hi

I see there are several articles about how CNN's are used to isolate and classify an object within an nxm rectangular region. While I know how to classify an image into one of p classes, I am not sure how to segment an image into rectangular regions which contain certain objects and, let's say, extract those regions.

I understand how to segment an image into non-rectangular regions by classifying each pixel by its and its neighbouring pixels values. However, I am not sure how to approach the problem of creating rectangular regions containing an object belonging to a class and extract that.

What are some good resources where I can learn to do this?
 
Technology news on Phys.org
Google can be your best friend: rectangular segmentation.
For specific code examples you can learn to use OpenCV, a very good library for computer vision and machine learning with a lot of resources and a big community of users.
 
Well, Google wasn't of much help. The first page is full of papers for segmentation using rectangles, but it doesn't exactly give me an efficient method to extract a rectangular region containing an object; The methods are used for something different entirely.

I am just looking for a method which is smarter than the one that seems the most obvious: Finding the top, bottom, left- and rightmost pixels classified as belonging to class A and creating a region based on that (a misclassified pixel would completely ruin the segmentation + I cannot find multiple objects belonging to the same class in an image).

However, I made some progress; The term for the rectangular region I was looking for is a bounding box. So, my Google searches have improved. I guess I'll find something soon enough.
 

Similar threads

  • · Replies 226 ·
8
Replies
226
Views
16K
  • · Replies 4 ·
Replies
4
Views
11K
  • · Replies 43 ·
2
Replies
43
Views
12K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
8
Views
5K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 21 ·
Replies
21
Views
5K
  • · Replies 58 ·
2
Replies
58
Views
8K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 2 ·
Replies
2
Views
3K