The Achilles Heel of Modern Language Models

Nov 23, 2023

An interesting development with the most recent version of Chat-GPT, with its integration of DALL-E 3, is that you can insert images as a part of a prompt.

This also means you can give it feedback on an image that it has created.

You can say: “I like the bottom left image (from a grid of four), but I need it to be more like [text input - whatever]”.

Sure enough, the ‘Generative AI’ agent will go away and recreate an image based on your part-verbal and part-visual instruction.

It’s possible the system simply updates the language prompt, but it’s more likely that it has taken the image and the written feedback and fed this back into the engine to generate a second image.

In any case, it is now possible to prompt GPT-4 with images.

This is called a ‘multi-modal model’, and it opens GPT-4 up to a whole new world of vulnerabilities.

From the Journal

Corporate News

Advai collaborates with FCA on second cohort of AI Live Testing

April 22, 2026

Learn

Why AI Isn't a Bubble - It's a Testing Crisis

April 13, 2026

Author:

Damian Ruck

Learn

From Trials to Tribulations: The Challenges of Evaluating AI Agents

March 23, 2026

Author:

Pranay Shah

Corporate News

Advai collaborates with FCA on second cohort of AI Live Testing

April 22, 2026

Learn

Why AI Isn't a Bubble - It's a Testing Crisis

April 13, 2026

Author:

Damian Ruck

Corporate News

Advai collaborates with FCA on second cohort of AI Live Testing

April 22, 2026

Learn

Why AI Isn't a Bubble - It's a Testing Crisis

April 13, 2026

Author:

Damian Ruck

Platform

Journal

About

Careers

The Achilles Heel of Modern Language Models

From the Journal