1
Nish Tahir
Nish Tahir

This week, we've got the most promising way to keep your content safe from all those pesky LLMs... Cat facts! arxiv.org

This paper builds on work that has attempted to measure various LLM's resiliency against pertubations - random noise/unrelated content - in the the prompts. These guys had the amazing idea to include a random unrelated (sometimes cat) fact in prompts then measure the response accuracy.

In the reasoning models they tested on various standard benchmarks they found that the models consistently scored worse overall whenever the unrelated facts were present

This is pretty consistent with some other work that has been done. What came to my mind was GSM-Symbolic which came out last year arxiv.org

The easy conclusion here is that the reasoning models tested are pretty sensitive to the content introduced in the context window. So craft your prompt/context carefully. More information may not always be better. But highly relevant information might be.

1 / 4

Share this Page