Hacker News
- Show HN: Curator – an open-source library for synthetic data generation https://github.com/bespokelabsai/curator 6 comments
- Spent an hour coding and got a neat improvement in accuracy with a 14x cheaper model. Distillation is underrated https://github.com/bespokelabsai/curator 0 comments programming
Linking pages
- Scaling up Open Reasoning with OpenThinker-32B | Open Thoughts https://www.open-thoughts.ai/blog/scale 1 comment
- GitHub - mlfoundations/evalchemy: Automatic evals for LLMs https://github.com/mlfoundations/evalchemy 0 comments
- What DeepSeek means for the world - by Mahesh https://madiator.substack.com/p/what-deepseek-means-for-the-world 0 comments
Linked pages
- tqdm documentation https://tqdm.github.io/ 162 comments
- GitHub - Textualize/rich: Rich is a Python library for rich text and beautiful formatting in the terminal. https://github.com/Textualize/rich 136 comments
- https://platform.openai.com/docs/guides/batch 1 comment
- GitHub - open-thoughts/open-thoughts: Fully open data curation for reasoning models https://github.com/open-thoughts/open-thoughts 1 comment
- bespokelabs/Bespoke-Stratos-17k · Datasets at Hugging Face https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-17k 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - bespokelabsai/curator: Synthetic data curation for post-training and structured data extraction
See how to search.