Linking pages
Linked pages
- Reddit says it's made $203M so far licensing its data | TechCrunch https://techcrunch.com/2024/02/22/reddit-says-its-made-203m-so-far-licensing-its-data/ 64 comments
- Here's Proof You Can Train an AI Model Without Slurping Copyrighted Content | WIRED https://www.wired.com/story/proof-you-can-train-ai-without-slurping-copyrighted-content/ 49 comments
- Copyright Term Extension Act - Wikipedia https://en.wikipedia.org/wiki/Copyright_Term_Extension_Act 26 comments
- Issues | Electronic Frontier Foundation https://www.eff.org/work 4 comments
- Releasing Common Corpus: the largest public domain dataset for training LLMs https://huggingface.co/blog/Pclanglais/common-corpus 3 comments
- Open Source AI Deep Dive – Open Source Initiative https://opensource.org/deepdive 2 comments
- The Stack - BigCode https://www.bigcode-project.org/docs/about/the-stack/ 1 comment
- Authors Guild, Inc. v. Google, Inc. - Wikipedia https://en.wikipedia.org/wiki/Authors_Guild,_Inc._v._Google,_Inc. 1 comment
- EleutherAI https://www.eleuther.ai/ 0 comments
- Open Source AI Definition – Weekly update May 6 – Open Source Initiative https://opensource.org/blog/open-source-ai-definition-weekly-update-may-6 0 comments
- Open Source AI Definition – Weekly update May 13 – Open Source Initiative https://opensource.org/blog/open-source-ai-definition-weekly-update-may-13 0 comments
Related searches:
Search whole site: site:opensource.org
Search title: Why datasets built on public domain might not be enough for AI – Open Source Initiative
See how to search.