Message Boards

3752 Views

2 Replies

4 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Data Science Image Processing Wolfram Language Machine Learning

Automatic preprocessing of image data using UMAP in DimensionReduce

Posted 2 years ago

I have been getting really nice clustering results using the UMAP method in DimensionReduce (Mathematica 13) where the data is a list of images, for example: rep = DimensionReduce[ imagelist, 3, Method->"UMAP", TargetDevice -> "GPU"]; Without specifying any other options (like FeatureExtractor), does the above command implement any kind of automatic preprocessing on the image data before applying UMAP? The reason I ask is that when I compare the Python implementation of UMAP on the same set of images (where the RGB values are converted to numpy arrays, with no other preprocessing) I get results that are consistently much worse. So it seems like there is something useful that the Mathematica algorithm is doing under the hood to the images. Would it be possible to find out the details? Thanks, Mike

POSTED BY: Michael Hinczewski

2 Replies

Sort By:

Posted 2 years ago

Thank you, that's helpful!

POSTED BY: Michael Hinczewski

Posted 2 years ago

Hi Michael, you can explore the internals of the `DimensionReducerFunction` to check the preprocessor—ideally you should be able to do Information[_DimensionReducerFunction, "FeatureExtractor"] but we have not hook it up there yet. In the meantime you can check what the internal processor is doing (hover over each processor to see more info) reducer[[1, "Processor"]] and apply it to an image reducer[[1, "Processor"]] @* reducer[[1, "Preprocessor"]] @ RandomImage[] Remember to use `DimensionReduction` instead of `DimensionReduce` in order to get the function and not the reduced data directly.

POSTED BY: Giulio Alessandrini

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback