Borrar filtros
Borrar filtros

Datastores for logical files when training mask R-CNN?

1 visualización (últimos 30 días)
Alex
Alex el 21 de Feb. de 2023
Comentada: Clive Fox el 11 de Abr. de 2024
When making training data for a Mask RCNN, you need the following things:
4 column cell array, where column 1 is an imageDatastore of the images, 2-3 is a boxLabelDatastore with bounding boxes and what they contain, and column 4, an imageDatastore with a logical array, each row Height x Width x Number of labels.
The first 3 are no problem, but number 4 vexes me. Here it suggests using poly2mask to generate the logical arrays, and a custom read function to put it into an imageDatastore, which to me feels like the ??? step in
1. Be poor. 2. ???, 3. Profit!
I've scoured the documentation and can find nothing about how to make matlab/imageDatastores manage logical arrays as images, or how to use it like a GroundTruth and get the file locations. I think I have an idea, but it seems so bulky and annoying, I hope there's a straightforward way.
My question:
If I have a folder My_Data, that only contains My_Sample_XXX.mat files which are Height x Width x Number of labels logical arrays, how do I get this into a datastore, like in the links?
  1 comentario
Clive Fox
Clive Fox el 21 de Abr. de 2023
I am having the exact same problem - please anyone got some suggestions?

Iniciar sesión para comentar.

Respuesta aceptada

Clive Fox
Clive Fox el 21 de Abr. de 2023
OK I think I found something which works
Don't the logical mask as a .mat file but as a binary.png
imwrite(mask_img,'mask_1.png'); where mask_img is the logical array
Then ...
mask_ds = imageDatastore('mask_1.png');
Seems to work so far.
  1 comentario
Alex
Alex el 28 de Abr. de 2023
This indeed seems to do the trick! Thank you so much!
For anyone in my exact situation, trying to get a maskRCNN to work:
  • I produced a GT with polygons
  • I used the usercreated function MPolyToMask + ImWrite (plus a function for adding the first polygon coordinate to the end of each polygon to "close the shape") to get data and images
  • For rectangles, I used my polygon coordinates and regionprops(CC,'BoundingBox'); to get those values and saved them in a table
This can be used to build the datastores.
Onwards to new roadblocks!

Iniciar sesión para comentar.

Más respuestas (1)

Kevin Willeford
Kevin Willeford el 22 de Sept. de 2023
Hi everyone,
I'm stuck on this one too. I have M X N X numObjects logical arrays. I only have two object categories per image; therefore, when I try to save the arrays as a .png, it doesn't work.
So, how to convert logical arrays with multiple objects per image into a datastore? The line saying "just create a custom read function" is baffling me.
  2 comentarios
Bryce
Bryce el 7 de Feb. de 2024
Hey,
Did you figure it out?
Clive Fox
Clive Fox el 11 de Abr. de 2024
See my solution above.

Iniciar sesión para comentar.

Categorías

Más información sobre Recognition, Object Detection, and Semantic Segmentation en Help Center y File Exchange.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by