Hacker News
- RedPajama v2 Open Dataset with 30T Tokens for Training LLMs https://together.ai/blog/redpajama-data-v2 60 comments
Linking pages
- I. From GPT-4 to AGI: Counting the OOMs - SITUATIONAL AWARENESS https://situational-awareness.ai/from-gpt-4-to-agi/ 91 comments
- The New Kings of Open Source AI (Oct 2023 Recap) https://www.latent.space/p/oct-2023 3 comments
- II. From AGI to Superintelligence: the Intelligence Explosion - SITUATIONAL AWARENESS https://situational-awareness.ai/from-agi-to-superintelligence/ 3 comments
- How to train your own Large Multimodal Model — with Hugo Laurençon & Leo Tronchon of HuggingFace M4 Research https://www.latent.space/p/idefics 0 comments
- Cloud Intelligence at the speed of 5000 tok/s - with Ce Zhang and Vipul Ved Prakash of Together AI https://www.latent.space/p/together 0 comments
- GitHub - lmmlzn/Awesome-LLMs-Datasets: Summarize existing representative LLMs text datasets. https://github.com/lmmlzn/Awesome-LLMs-Datasets 0 comments
- GPT-6 (2025) – Dr Alan D. Thompson – Life Architect https://lifearchitect.ai/gpt-6/ 0 comments
- How much LLM training data is there, in the limit? – Educating Silicon https://www.educatingsilicon.com/2024/05/09/how-much-llm-training-data-is-there-in-the-limit/ 0 comments
- GitHub - fabiochiusano/ai-news-tracker: ~300 news for quickly getting up-to-date with the generative AI landscape https://github.com/fabiochiusano/ai-news-tracker 0 comments
- GitHub - fabiochiusano/Awesome-AI-News: ~300 news for quickly getting up-to-date with the generative AI landscape https://github.com/fabiochiusano/Awesome-AI-News/tree/main 0 comments
Related searches:
Search whole site: site:together.ai
Search title: RedPajama-Data-v2: An open dataset with 30 trillion tokens for training large language models
See how to search.