- [D] Appreciating the complexity of large language models data pipelines https://blog.christianperone.com/2023/06/appreciating-llms-data-pipelines/ 5 comments machinelearning
Linked pages
- Common Crawl https://commoncrawl.org/ 85 comments
- [2101.00027] The Pile: An 800GB Dataset of Diverse Text for Language Modeling https://arxiv.org/abs/2101.00027 81 comments
- [1607.01759] Bag of Tricks for Efficient Text Classification https://arxiv.org/abs/1607.01759 15 comments
- [2304.01373] Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling https://arxiv.org/abs/2304.01373 7 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:blog.christianperone.com
Search title: Appreciating the complexity of large language models data pipelines | Christian S. Perone
See how to search.