An interesting development with the most recent version of Chat-GPT, with its integration of DALL-E 3, is that you can insert images as a part of a prompt.
This also means you can give it feedback on an image that it has created.
You can say: “I like the bottom left image (from a grid of four), but I need it to be more like [text input - whatever]”.
Sure enough, the ‘Generative AI’ agent will go away and recreate an image based on your part-verbal and part-visual instruction.
It’s possible the system simply updates the language prompt, but it’s more likely that it has taken the image and the written feedback and fed this back into the engine to generate a second image.
In any case, it is now possible to prompt GPT-4 with images.
This is called a ‘multi-modal model’, and it opens GPT-4 up to a whole new world of vulnerabilities.