Linking pages
- Model alignment protects against accidental harms, not intentional ones https://www.aisnakeoil.com/p/model-alignment-protects-against 0 comments
- Safety as a Scientific Pursuit - by Tom McGrath https://banburismus.substack.com/p/safety-as-a-scientific-pursuit 0 comments
- GitHub - elicit/machine-learning-list https://github.com/elicit/machine-learning-list 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2311.12786] Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks
See how to search.