Some Notes on Adversarial Attacks on LLMs - Cybernetist - discu.eu

Lobsters

Some Notes on Adversarial Attacks on LLMs https://cybernetist.com/2024/09/23/some-notes-on-adversarial-attacks-on-llms/ 8 comments 23/9/2024 vibecoding

Linking pages

You Should Probably Pay Attention to Tokenizers - Cybernetist https://cybernetist.com/2024/10/21/you-should-probably-pay-attention-to-tokenizers/ 100 comments

Linked pages

Go or Rust? Just Listen to the Bots - Cybernetist https://cybernetist.com/2024/04/25/go-or-rust-just-listen-to-the-bots/ 91 comments
OWASP Top 10 for Large Language Model Applications | OWASP Foundation https://owasp.org/www-project-top-10-for-large-language-model-applications/ 53 comments
[2310.04406] Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models https://arxiv.org/abs/2310.04406 21 comments
Autoregressive model - Wikipedia http://en.wikipedia.org/wiki/Autoregressive_model#Derivation 11 comments
[2308.03825] "Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models https://arxiv.org/abs/2308.03825 4 comments
[2312.02119] Tree of Attacks: Jailbreaking Black-Box LLMs Automatically https://arxiv.org/abs/2312.02119 2 comments
[2402.05668] Comprehensive Assessment of Jailbreak Attacks Against LLMs https://arxiv.org/abs/2402.05668 1 comment
A Small Tool for Exploring Text Embeddings - Cybernetist https://cybernetist.com/2024/03/27/a-small-tool-for-exploring-text-embeddings/ 1 comment
[1712.06751] HotFlip: White-Box Adversarial Examples for Text Classification https://arxiv.org/abs/1712.06751 0 comments
GitHub - jind11/TextFooler: A Model for Natural Language Attack on Text Classification and Inference https://github.com/jind11/TextFooler 0 comments
Large language model - Wikipedia https://en.wikipedia.org/wiki/Large_language_model 0 comments
[2310.02446] Low-Resource Languages Jailbreak GPT-4 https://arxiv.org/abs/2310.02446 0 comments
[2403.04769] Using Hallucinations to Bypass GPT4's Filter https://arxiv.org/abs/2403.04769 0 comments

Related searches:

Search whole site: site:cybernetist.com

Search title: Some Notes on Adversarial Attacks on LLMs - Cybernetist

See how to search.

Submit link to: