Linking pages
- GitHub - lancedb/lance: Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming.. https://github.com/lancedb/lance 21 comments
- GitHub - eto-ai/lance: Blazing fast exploration and analysis of computer vision data using SQL and DuckDB, backed by an Apache-Arrow compatible data format https://github.com/eto-ai/lance 1 comment
- GitHub - r0f1/datascience: Curated list of Python resources for data science. https://github.com/r0f1/datascience 0 comments
- GitHub - ml-tooling/best-of-ml-python: 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly. https://github.com/ml-tooling/best-of-ml-python 0 comments
- GitHub - kelvins/awesome-mlops: A curated list of awesome MLOps tools https://github.com/kelvins/awesome-mlops 0 comments
- GitHub - ray-project/xgboost_ray: Distributed XGBoost on Ray https://github.com/ray-project/xgboost_ray 0 comments
- Gathering and Using Big Data from Public APIs for Data Science - The GitHub Popularity Project https://www.kamwithk.com/big-data-from-public-apis-for-data-science-the-github-popularity-project 0 comments
- GitHub - rom1504/img2dataset: Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine. https://github.com/rom1504/img2dataset 0 comments