Hacker News
Linking pages
- To Understand Language is to Understand Generalization | Eric Jang https://evjang.com/2021/12/17/lang-generalization.html 38 comments
- GitHub - dpaleka/llm-chess-proofgame: LLMs playing chess are sensitive to how the position came to be https://github.com/dpaleka/llm-chess-proofgame 31 comments
- Reinforcement Learning as a fine-tuning paradigm | Ankesh Anand https://ankeshanand.com/blog/2022/01/08/rl-fine-tuning.html 8 comments
- Eric Jang on Robots Learning at Google and Generalization via Language https://thegradientpub.substack.com/p/eric-jang-on-robots-learning-at-google 0 comments
- Last Week in AI #140: Adobe's Deepfake tool, Clearview AI takes part in third-party test, how AI can help supply chains https://lastweekin.ai/p/140 0 comments
- Software² | Minqi Jiang https://blog.minch.co/2022/11/15/software-squared.html 0 comments
- Modern AI is Domestification https://thegradient.pub/ai-is-domestification/ 0 comments
- Safety as a Scientific Pursuit - by Tom McGrath https://banburismus.substack.com/p/safety-as-a-scientific-pursuit 0 comments
Linked pages
- AlphaStar: Grandmaster level in StarCraft II using multi-agent reinforcement learning https://deepmind.com/blog/article/AlphaStar-Grandmaster-level-in-StarCraft-II-using-multi-agent-reinforcement-learning 468 comments
- The Bitter Lesson http://incompleteideas.net/IncIdeas/BitterLesson.html 366 comments
- [1611.03530] Understanding deep learning requires rethinking generalization https://arxiv.org/abs/1611.03530 24 comments
- [2108.07258] On the Opportunities and Risks of Foundation Models https://arxiv.org/abs/2108.07258 11 comments
- [1707.01495] Hindsight Experience Replay https://arxiv.org/abs/1707.01495 11 comments
- [2106.01345] Decision Transformer: Reinforcement Learning via Sequence Modeling https://arxiv.org/abs/2106.01345 9 comments
- [2102.06356] A Large Batch Optimizer Reality Check: Traditional, Generic Optimizers Suffice Across Batch Sizes https://arxiv.org/abs/2102.06356 7 comments
- Mirror test - Wikipedia http://en.wikipedia.org/wiki/Mirror_test 7 comments
- Jürgen Schmidhuber's Solution to AI Consciousness - YouTube https://www.youtube.com/watch?v=q4fFuZgOZn8 6 comments
- Markov decision process - Wikipedia https://en.wikipedia.org/wiki/Markov_decision_process#Algorithms 2 comments
- [2106.02039] Offline Reinforcement Learning as One Big Sequence Modeling Problem https://arxiv.org/abs/2106.02039 1 comment
- Machine Learning Trick of the Day (5): Log Derivative Trick – The Spectator http://blog.shakirm.com/2015/11/machine-learning-trick-of-the-day-5-log-derivative-trick/ 1 comment
- https://language-play.github.io/ 0 comments
- [2001.08361] Scaling Laws for Neural Language Models https://arxiv.org/abs/2001.08361 0 comments
- SEER: The start of a more powerful, flexible, and accessible era for computer vision https://ai.facebook.com/blog/seer-the-start-of-a-more-powerful-flexible-and-accessible-era-for-computer-vision/ 0 comments
- [1812.06162] An Empirical Model of Large-Batch Training https://arxiv.org/abs/1812.06162 0 comments
Related searches:
Search whole site: site:evjang.com
Search title: Just Ask for Generalization | Eric Jang
See how to search.