The Wolfram Neural Net repository offers ResNET-50, a very successful net to identify the main object in an image.
After downloading the net, I wanted to learn about the structure of the NetEncoder
Mathematica tells us that the type is "Image" and the "Image Size" is 224x224.
I have two question related to this:
Does this mean that the net automatically resizes the original image (e.g. from 250x250 pixels) to 224x224?
Thank you for your time.
224 by 224 is a common practice to make 32x32 = 1024 more samples from one 256 by 256 input since AlexNet. I am afraid it is hard coded in the input layer and you need to crop images by yourself.
You may want to try ImageAugmentationLayer mentioned in this blog. Prepend this layer to your NN will do the image augmentation for you.