- LLMs can hide arbitrary undetectable information in their responses https://arxiv.org/abs/2401.10360 79 comments machinelearning
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2401.10360] Excuse me, sir? Your language model is leaking (information)
See how to search.