Resources for the rectangular segmentation of an image (ML)

Click For Summary
The discussion focuses on the challenge of segmenting images into rectangular regions containing specific objects using machine learning techniques, particularly CNNs. The user is familiar with classifying images but seeks guidance on extracting rectangular areas, known as bounding boxes, that encapsulate classified objects. While they have found resources on non-rectangular segmentation, they struggle to locate efficient methods for rectangular extraction. OpenCV is mentioned as a valuable library for computer vision, but the user finds existing literature on rectangular segmentation unhelpful for their specific needs. The user expresses optimism about improving their search results with the correct terminology.
Avatrin
Messages
242
Reaction score
6
Hi

I see there are several articles about how CNN's are used to isolate and classify an object within an nxm rectangular region. While I know how to classify an image into one of p classes, I am not sure how to segment an image into rectangular regions which contain certain objects and, let's say, extract those regions.

I understand how to segment an image into non-rectangular regions by classifying each pixel by its and its neighbouring pixels values. However, I am not sure how to approach the problem of creating rectangular regions containing an object belonging to a class and extract that.

What are some good resources where I can learn to do this?
 
Technology news on Phys.org
Google can be your best friend: rectangular segmentation.
For specific code examples you can learn to use OpenCV, a very good library for computer vision and machine learning with a lot of resources and a big community of users.
 
Well, Google wasn't of much help. The first page is full of papers for segmentation using rectangles, but it doesn't exactly give me an efficient method to extract a rectangular region containing an object; The methods are used for something different entirely.

I am just looking for a method which is smarter than the one that seems the most obvious: Finding the top, bottom, left- and rightmost pixels classified as belonging to class A and creating a region based on that (a misclassified pixel would completely ruin the segmentation + I cannot find multiple objects belonging to the same class in an image).

However, I made some progress; The term for the rectangular region I was looking for is a bounding box. So, my Google searches have improved. I guess I'll find something soon enough.
 

Similar threads

  • · Replies 226 ·
8
Replies
226
Views
15K
  • · Replies 4 ·
Replies
4
Views
11K
  • · Replies 43 ·
2
Replies
43
Views
12K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
8
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 21 ·
Replies
21
Views
5K
Replies
58
Views
7K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 2 ·
Replies
2
Views
3K