Hacker News
- The LLM Jailbreaking Bible: Code Implementation and Overview https://generalanalysis.com/blog/jailbreak_cookbook 2 comments
- The Jailbreak Bible https://generalanalysis.com/blog/jailbreak_cookbook 4 comments
Linked pages
- [2412.03556] Best-of-N Jailbreaking https://arxiv.org/abs/2412.03556 17 comments
- ChatGPT jailbreak forces it to break its own rules https://www.cnbc.com/2023/02/06/chatgpt-jailbreak-forces-it-to-break-its-own-rules.html 8 comments
- [2308.03825] "Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models https://arxiv.org/abs/2308.03825 4 comments
- [2307.15043] Universal and Transferable Adversarial Attacks on Aligned Language Models https://arxiv.org/abs/2307.15043 3 comments
- [2312.02119] Tree of Attacks: Jailbreaking Black-Box LLMs Automatically https://arxiv.org/abs/2312.02119 2 comments
- ChatGPT’s alter ego, Dan: users jailbreak AI program to get around ethical safeguards | ChatGPT | The Guardian https://www.theguardian.com/technology/2023/mar/08/chatgpt-alter-ego-dan-users-jailbreak-ai-program-to-get-around-ethical-safeguards 1 comment
- Many-shot jailbreaking \ Anthropic https://www.anthropic.com/research/many-shot-jailbreaking 1 comment
- [2305.13860] Jailbreaking ChatGPT via Prompt Engineering: An Empirical Study https://arxiv.org/abs/2305.13860 0 comments
- Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations | Research - AI at Meta https://ai.meta.com/research/publications/llama-guard-llm-based-input-output-safeguard-for-human-ai-conversations/ 0 comments
- GitHub - elder-plinius/L1B3RT4S: TOTALLY HARMLESS LIBERATION PROMPTS FOR GOOD LIL AI'S https://github.com/elder-plinius/L1B3RT4S 0 comments
- [2402.04249] HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal https://arxiv.org/abs/2402.04249 0 comments
- GitHub - General-Analysis/GA: An encyclopedia of jailbreaking techniques to make AI models safer. https://github.com/General-Analysis/GA 0 comments
Related searches:
Search whole site: site:generalanalysis.com
Search title: The Jailbreak Cookbook - General Analysis
See how to search.