Hacker News
- The Stack: 3 TB of permissively licensed source code in 30 programming languages https://huggingface.co/datasets/bigcode/the-stack 5 comments
Linking pages
- The K8s YAML dataset | Substratus https://www.substratus.ai/blog/k8s-yaml-dataset 3 comments
- AI Fundamentals: Datasets 101 - Latent Space https://www.latent.space/p/datasets-101 1 comment
- GitHub - wsxiaoys/awesome-ai-coding https://github.com/wsxiaoys/awesome-ai-coding 0 comments
- GitHub - huggingface/huggingface-vscode: Code completion VSCode extension for OSS models https://github.com/huggingface/huggingface-vscode 0 comments
- Dolma, OLMo, and the Future of Open-Source LLMs https://cameronrwolfe.substack.com/p/dolma-olmo-and-the-future-of-open 0 comments
- GitHub - lmmlzn/Awesome-LLMs-Datasets: Summarize existing representative LLMs text datasets. https://github.com/lmmlzn/Awesome-LLMs-Datasets 0 comments
- GitHub - huggingface/llm-vscode: LLM powered development for VSCode https://github.com/huggingface/llm-vscode 0 comments
- GitHub - hiyouga/LLaMA-Factory: A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024) https://github.com/hiyouga/LLaMA-Factory 0 comments
Related searches:
Search whole site: site:huggingface.co
Search title: bigcode/the-stack · Datasets at Hugging Face
See how to search.