Resources for the rectangular segmentation of an image (ML)

AI Thread Summary
The discussion focuses on the challenge of segmenting images into rectangular regions containing specific objects using machine learning techniques, particularly CNNs. The user is familiar with classifying images but seeks guidance on extracting rectangular areas, known as bounding boxes, that encapsulate classified objects. While they have found resources on non-rectangular segmentation, they struggle to locate efficient methods for rectangular extraction. OpenCV is mentioned as a valuable library for computer vision, but the user finds existing literature on rectangular segmentation unhelpful for their specific needs. The user expresses optimism about improving their search results with the correct terminology.
Avatrin
Messages
242
Reaction score
6
Hi

I see there are several articles about how CNN's are used to isolate and classify an object within an nxm rectangular region. While I know how to classify an image into one of p classes, I am not sure how to segment an image into rectangular regions which contain certain objects and, let's say, extract those regions.

I understand how to segment an image into non-rectangular regions by classifying each pixel by its and its neighbouring pixels values. However, I am not sure how to approach the problem of creating rectangular regions containing an object belonging to a class and extract that.

What are some good resources where I can learn to do this?
 
Technology news on Phys.org
Google can be your best friend: rectangular segmentation.
For specific code examples you can learn to use OpenCV, a very good library for computer vision and machine learning with a lot of resources and a big community of users.
 
Well, Google wasn't of much help. The first page is full of papers for segmentation using rectangles, but it doesn't exactly give me an efficient method to extract a rectangular region containing an object; The methods are used for something different entirely.

I am just looking for a method which is smarter than the one that seems the most obvious: Finding the top, bottom, left- and rightmost pixels classified as belonging to class A and creating a region based on that (a misclassified pixel would completely ruin the segmentation + I cannot find multiple objects belonging to the same class in an image).

However, I made some progress; The term for the rectangular region I was looking for is a bounding box. So, my Google searches have improved. I guess I'll find something soon enough.
 
Dear Peeps I have posted a few questions about programing on this sectio of the PF forum. I want to ask you veterans how you folks learn program in assembly and about computer architecture for the x86 family. In addition to finish learning C, I am also reading the book From bits to Gates to C and Beyond. In the book, it uses the mini LC3 assembly language. I also have books on assembly programming and computer architecture. The few famous ones i have are Computer Organization and...
I have a quick questions. I am going through a book on C programming on my own. Afterwards, I plan to go through something call data structures and algorithms on my own also in C. I also need to learn C++, Matlab and for personal interest Haskell. For the two topic of data structures and algorithms, I understand there are standard ones across all programming languages. After learning it through C, what would be the biggest issue when trying to implement the same data...
Back
Top