Lobsters
- Some Notes on Adversarial Attacks on LLMs https://cybernetist.com/2024/09/23/some-notes-on-adversarial-attacks-on-llms/ 8 comments ai
Linking pages
Linked pages
- Go or Rust? Just Listen to the Bots - Cybernetist https://cybernetist.com/2024/04/25/go-or-rust-just-listen-to-the-bots/ 91 comments
- OWASP Top 10 for Large Language Model Applications | OWASP Foundation https://owasp.org/www-project-top-10-for-large-language-model-applications/ 53 comments
- [2310.04406] Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models https://arxiv.org/abs/2310.04406 21 comments
- Autoregressive model - Wikipedia http://en.wikipedia.org/wiki/Autoregressive_model#Derivation 11 comments
- [2308.03825] "Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models https://arxiv.org/abs/2308.03825 4 comments
- [2312.02119] Tree of Attacks: Jailbreaking Black-Box LLMs Automatically https://arxiv.org/abs/2312.02119 2 comments
- [2402.05668] Comprehensive Assessment of Jailbreak Attacks Against LLMs https://arxiv.org/abs/2402.05668 1 comment
- A Small Tool for Exploring Text Embeddings - Cybernetist https://cybernetist.com/2024/03/27/a-small-tool-for-exploring-text-embeddings/ 1 comment
- [1712.06751] HotFlip: White-Box Adversarial Examples for Text Classification https://arxiv.org/abs/1712.06751 0 comments
- GitHub - jind11/TextFooler: A Model for Natural Language Attack on Text Classification and Inference https://github.com/jind11/TextFooler 0 comments
- Large language model - Wikipedia https://en.wikipedia.org/wiki/Large_language_model 0 comments
- [2310.02446] Low-Resource Languages Jailbreak GPT-4 https://arxiv.org/abs/2310.02446 0 comments
- [2403.04769] Using Hallucinations to Bypass GPT4's Filter https://arxiv.org/abs/2403.04769 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:cybernetist.com
Search title: Some Notes on Adversarial Attacks on LLMs - Cybernetist
See how to search.