With the release of the new Wolfram ImageIdentify site, I decided try to make a mini version of this directly in the Wolfram Language using the Classify function. The ImageIdentify web site is capable of identifying about 10,000 entities, but I set a more modest goal of identifying just three entities: Lion, or Tiger, or bear? (Oh my!!!).
To start off we need sample images of many lions, tigers, and bears. A simple way to get those images, is to use an image search tool, like Microsoft's Bing image search. Simply drag ten or more representative images from your browser into Mathematica or save the images and Import them. You should end up with three lists like this:
data:image/s3,"s3://crabby-images/72257/72257c27c9c98ad58d4338244b7d5ebf6f488d76" alt="enter image description here"
Next, we need to write the classification code. We can call Classify which each category:
LionOrTigerOrBear = Classify[ <|
"lion" -> lions,
"tiger" -> tigers,
"bear" -> bears
|> ]
And that's it for the setup part! Now we're ready to try out if things are working.
Here is a test case with an image of a bear that is not in the original training data:
data:image/s3,"s3://crabby-images/eb538/eb538d22fa501ca07517a490b9d2d6eda2d2e6af" alt="enter image description here"
The classifier found a match! Next, is an image of a tiger running through some water:
data:image/s3,"s3://crabby-images/7b1f2/7b1f2286f57e5fa75f2efafb9df2499333b681cb" alt="enter image description here"
This is failing, possibly because the training images did not have a good matching image. And possibly, the result is a 'bear' because some bear images have water in the background. Whatever the underlying reason, the solution is to add more images to the training data (the more images you train with the better the results).
To make sure that tigers can be identified let's use an image of another tiger, resting in some grass:
data:image/s3,"s3://crabby-images/d6da6/d6da638b4d91de4bada16f19c5cd9fdb0f16e426" alt="enter image description here"
This worked! And not to leave out the lions, let's do an image of those as well:
data:image/s3,"s3://crabby-images/e6e36/e6e3607b209c80c90b111e55adb48130c55e4a81" alt="enter image description here"
Of course, you can give the LionOrTigerOrBear function an image of a lion, a tiger and a bear and see what happens:
data:image/s3,"s3://crabby-images/e77fb/e77fbe3c912f34eae3df25ff076c8aa10ab0c481" alt="enter image description here"
In this case, the classifier picked bear. And it is interesting to note that this algorithm always picks one of the three choices, even if you feed it an image of something that is neither a lion, a tiger, nor a bear:
data:image/s3,"s3://crabby-images/8adb4/8adb4ae4f52fe81e930e363631798619c1dc143b" alt="enter image description here"
You can look at the estimated probabilities of each classification:
data:image/s3,"s3://crabby-images/d2846/d28460b5d1a60e4ddb5b9bf527fb4ea53cb4033a" alt="enter image description here"
Since 'tiger' scored the highest (with a estimated likelihood of about 81%) it gets returned as the top pick. To make the classfier say "I don't know" you can add a threshold, below which the result will be Indeterminate:
data:image/s3,"s3://crabby-images/f27ea/f27eab82dd804c63ab393f5a886da320119a1798" alt="enter image description here"