Delightful!
Out of curiosity: You used Dall-E-2...have you tried Dall-E-3 as well? It should be as simple as specifying "Model"->"dall-e-3" for ImageSynthesize.
Also perhaps amusing would be to try round-tripping the images through gpt-4-vision : "What is the wolfram language code that would generate this image?" type queries