Hacker News
- Q* Hypothesis: Enhancing Reasoning, Rewards, and Synthetic Data https://www.interconnects.ai/p/q-star 63 comments
Linking pages
- Will scaling work? - by Dwarkesh Patel - Dwarkesh Podcast https://www.dwarkeshpatel.com/p/will-scaling-work 286 comments
- Why Won’t OpenAI Say What the Q* Algorithm Is? - The Atlantic https://www.theatlantic.com/technology/archive/2023/11/openai-sam-altman-q-algorithm-breakthrough-project/676163/ 3 comments
- These Clues Hint at the True Nature of OpenAI’s Shadowy Q* Project | WIRED https://www.wired.com/story/fast-forward-clues-hint-openai-shadowy-q-project/ 1 comment
- We aren’t running out of training data, we are running out of open training data https://www.interconnects.ai/p/the-data-wall 0 comments
- Reverse engineering OpenAI’s o1 - by Nathan Lambert https://www.interconnects.ai/p/reverse-engineering-openai-o1 0 comments
Linked pages
- Exclusive: Sam Altman's ouster at OpenAI was precipitated by letter to board about AI breakthrough -sources | Reuters https://www.reuters.com/technology/sam-altmans-ouster-openai-was-precipitated-by-letter-board-about-ai-breakthrough-2023-11-22/ 2918 comments
- A* search algorithm - Wikipedia https://en.wikipedia.org/wiki/A*_search_algorithm 63 comments
- Monte Carlo tree search - Wikipedia https://en.wikipedia.org/wiki/Monte_Carlo_tree_search 12 comments
- [2305.10601] Tree of Thoughts: Deliberate Problem Solving with Large Language Models https://arxiv.org/abs/2305.10601 3 comments
- [2305.20050] Let's Verify Step by Step https://arxiv.org/abs/2305.20050 3 comments
- OpenAI Dropped Work on New ‘Arrakis’ AI Model in Rare Setback — The Information https://www.theinformation.com/articles/openai-dropped-work-on-new-arrakis-ai-model-in-rare-setback 3 comments
- Beyond human data: RLAIF needs a rebrand https://www.interconnects.ai/p/beyond-human-data-rlaif 0 comments
- [2211.14275] Solving math word problems with process- and outcome-based feedback https://arxiv.org/abs/2211.14275#deepmind 0 comments
Related searches:
Search whole site: site:interconnects.ai
Search title: The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data
See how to search.