Hacker News
- Cleanlab https://github.com/cleanlab/cleanlab 2 comments python
- [P] Announcing cleanlab 2.0: Automatically Find Errors in ML Datasets https://github.com/cleanlab/cleanlab 12 comments machinelearning
Linking pages
- GitHub - cleanlab/cleanvision: Automatically find issues in image datasets and practice data-centric computer vision https://github.com/cleanlab/cleanvision 15 comments
- Handling Mislabeled Tabular Data to Improve Your XGBoost Model | by Chris Mauck | Jan, 2023 | Towards AI https://pub.towardsai.net/handling-mislabeled-tabular-data-to-improve-your-xgboost-model-fbe051f4a6a6 1 comment
- GitHub - daytonaio/ai-enablement-stack: A Community-Driven Mapping of AI Development Tools https://github.com/daytonaio/ai-enablement-stack 1 comment
- GitHub - r0f1/datascience: Curated list of Python resources for data science. https://github.com/r0f1/datascience 0 comments
- GitHub - kelvins/awesome-mlops: A curated list of awesome MLOps tools https://github.com/kelvins/awesome-mlops 0 comments
- GitHub - koaning/doubtlab: Doubt your data, find bad labels. https://github.com/koaning/doubtlab 0 comments
- Researchers Release Cleanlab 2.0: An Open-Source Python Framework For Machine Learning And Analytics With Messy, Real-World Data - MarkTechPost https://www.marktechpost.com/2022/04/25/researchers-release-cleanlab-2-0-an-open-source-python-framework-for-machine-learning-and-analytics-with-messy-real-world-data/ 0 comments
- GitHub - academic/awesome-datascience: An awesome Data Science repository to learn and apply for real world problems. https://github.com/bulutyazilim/awesome-datascience 0 comments
- GitHub - trackawesomelist/trackawesomelist: Track 500+ Awesome List Updates, Track it - not just star it! https://github.com/trackawesomelist/trackawesomelist 0 comments
- GitHub - vihar/awesome-oss-saas: A collection of open-source saas tools https://github.com/vihar/awesome-oss-saas 0 comments
- GitHub - Renumics/awesome-open-data-centric-ai: Curated list of open source tooling for data-centric AI on unstructured data. https://github.com/Renumics/awesome-open-data-centric-ai 0 comments
Linked pages
- Introduction to Data-Centric AI https://dcai.csail.mit.edu/ 31 comments
- [2103.14749] Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks https://arxiv.org/abs/2103.14749 4 comments
- Label Errors https://labelerrors.com/ 3 comments
- Most AI & Analytics are impaired by data issues. Now AI can help you fix them. https://cleanlab.ai/blog/data-centric-ai/ 2 comments
- An Introduction to Confident Learning: Finding and Learning with Label Errors in Datasets https://l7.curtisnorthcutt.com/confident-learning 1 comment
- [2207.03061] Back to the Basics: Revisiting Out-of-Distribution Detection Baselines https://arxiv.org/abs/2207.03061 0 comments