Hacker News
- Common Crawl https://commoncrawl.org/ 7 comments
- Common Crawl https://commoncrawl.org/ 61 comments
- Common Crawl http://commoncrawl.org/ 5 comments
- CommonCrawl: an open repository of web crawl data that is universally accessible http://www.commoncrawl.org/ 8 comments
Lobsters
Linking pages
- What Is ChatGPT Doing … and Why Does It Work?—Stephen Wolfram Writings https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work/ 986 comments
- Understanding ChatGPT - Atmosera https://www.atmosera.com/ai/understanding-chatgpt/ 232 comments
- psuter.net https://psuter.net/2019/07/07/z-index 136 comments
- What every software engineer should know about search https://scribe.rip/p/what-every-software-engineer-should-know-about-search-27d1df99f80d 132 comments
- A look at search engines with their own indexes - Seirdy https://seirdy.one/posts/2021/03/10/search-engines-with-own-indexes/ 125 comments
- Microsoft unveils AI model that understands image content, solves visual puzzles | Ars Technica https://arstechnica.com/?p=1920920 102 comments
- Index 1,600,000,000 Keys with Automata and Rust - Andrew Gallant's Blog https://blog.burntsushi.net/transducers/ 92 comments
- Why You (Probably) Don't Need to Fine-tune an LLM - Tidepool by Aquarium https://www.tidepool.so/2023/08/17/why-you-probably-dont-need-to-fine-tune-an-llm/ 73 comments
- Exploring Transfer Learning with T5: the Text-To-Text Transfer Transformer – Google AI Blog https://ai.googleblog.com/2020/02/exploring-transfer-learning-with-t5.html 66 comments
- Microsoft unveils AI model that understands image content, solves visual puzzles | Ars Technica https://arstechnica.com/information-technology/2023/03/microsoft-unveils-kosmos-1-an-ai-language-model-with-visual-perception-abilities/ 54 comments
- How to turn an ordinary gzip archive into a database | Artem Golubin https://rushter.com/blog/gzip-indexing/ 47 comments
- Fun and Dystopia With AI-Based Code Generation Using GPT-J-6B | Max Woolf's Blog https://minimaxir.com/2021/06/gpt-j-6b/ 41 comments
- Bigger data; same laptop http://www.frankmcsherry.org/graph/scalability/cost/2015/02/04/COST2.html 38 comments
- Language-Agnostic BERT Sentence Embedding – Google AI Blog https://ai.googleblog.com/2020/08/language-agnostic-bert-sentence.html 35 comments
- Minority Voices 'Filtered' Out of Google Natural Language Processing Models - Unite.AI https://www.unite.ai/minority-voices-filtered-out-of-google-natural-language-processing-models/ 34 comments
- GitHub - openvenues/libpostal: A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data. https://github.com/openvenues/libpostal 32 comments
- Lookism in TikTok. Intro | by Enryu | Sep, 2022 | Medium https://medium.com/@enryu9000/lookism-in-tiktok-3def0f20cf78 31 comments
- Meta unveils a new large language model that can run on a single GPU [Updated] | Ars Technica https://arstechnica.com/information-technology/2023/02/chatgpt-on-your-pc-meta-unveils-new-ai-model-that-can-run-on-a-single-gpu/ 28 comments
- GitHub - smicallef/spiderfoot: SpiderFoot automates OSINT for threat intelligence and mapping your attack surface. https://github.com/smicallef/spiderfoot 26 comments
- A look at search engines with their own indexes - Seirdy https://seirdy.one/2021/03/10/search-engines-with-own-indexes.html 22 comments
Would you like to stay up to date with Web Development? Checkout Web Development Weekly.
Related searches:
Search whole site: site:commoncrawl.org
Search title: Common Crawl
See how to search.