Hacker News
- Tech companies are turning to 'synthetic data' to train AI models https://theconversation.com/tech-companies-are-turning-to-synthetic-data-to-train-ai-models-but-theres-a-hidden-cost-246248 0 comments
Linked pages
- AI models collapse when trained on recursively generated data | Nature https://www.nature.com/articles/s41586-024-07566-y 851 comments
- OpenAI cofounder Ilya Sutskever predicts the end of AI pre-training - The Verge https://www.theverge.com/2024/12/13/24320811/what-ilya-sutskever-sees-openai-model-data-training 240 comments
- http://www.chatgpt.com 209 comments
- ISO - International Organization for Standardization https://www.iso.org/home.html 163 comments
- AI datasets are filled with errors. It's warping what we know about AI | MIT Technology Review https://www.technologyreview.com/2021/04/01/1021619/ai-data-errors-warp-machine-learning-progress 38 comments
- Elon Musk says all human data for AI training ‘exhausted’ | Artificial intelligence (AI) | The Guardian https://www.theguardian.com/technology/2025/jan/09/elon-musk-data-ai-training-artificial-intelligence 28 comments
- Creative Commons — Attribution-ShareAlike 4.0 International — CC BY-SA 4.0 https://creativecommons.org/licenses/by-sa/4.0/ 8 comments
- [2211.04325] Will we run out of data? Limits of LLM scaling based on human-generated data https://arxiv.org/abs/2211.04325 1 comment
- AI 'gold rush' for chatbot training data could run out of human-written text | AP News https://apnews.com/article/ai-artificial-intelligence-training-data-running-out-9676145bac0d30ecce1513c20561b87d 1 comment
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:theconversation.com
Search title: Tech companies are turning to ‘synthetic data’ to train AI models – but there’s a hidden cost
See how to search.