- Apache Parquet Rust implementation (parquet-rs) Release 0.1.0 https://parquet.apache.org/ 16 comments rust
Linking pages
- Analyzing multi-gigabyte JSON files locally | thenybble.de https://thenybble.de/posts/json-analysis/ 321 comments
- GitHub - jhuangtw/xg2xg: by ex-googlers, for ex-googlers - a lookup table of similar tech & services https://github.com/jhuangtw-dev/xg2xg 240 comments
- Announcing PartiQL: One query language for all your data | AWS Open Source Blog https://aws.amazon.com/blogs/opensource/announcing-partiql-one-query-language-for-all-your-data/ 94 comments
- GitHub - akullpp/awesome-java: A curated list of awesome frameworks, libraries and software for the Java programming language. https://github.com/akullpp/awesome-java 90 comments
- Amazon’s Exabyte-Scale Migration from Apache Spark to Ray on Amazon EC2 | AWS Open Source Blog https://aws.amazon.com/blogs/opensource/amazons-exabyte-scale-migration-from-apache-spark-to-ray-on-amazon-ec2/ 90 comments
- Introduction | Parseable https://www.parseable.io/docs/introduction 89 comments
- GitHub - Hafthor/zsvutil: ZSV Utility for converting csv/tsv to/from zip-separated-values https://github.com/Hafthor/zsvutil 69 comments
- pandas/install.rst at cca50247f3953b55cb1cfe36852af362723452c5 · TomAugspurger/pandas · GitHub https://github.com/tomaugspurger/pandas/blob/cca50247f3953b55cb1cfe36852af362723452c5/doc/source/install.rst#plan-for-dropping-python-27 56 comments
- Amazon Redshift Spectrum – Exabyte-Scale In-Place Queries of S3 Data | AWS News Blog https://aws.amazon.com/blogs/aws/amazon-redshift-spectrum-exabyte-scale-in-place-queries-of-s3-data/ 54 comments
- Fastest Way to Read Excel in Python | Haki Benita https://hakibenita.com/fast-excel-python 48 comments
- The ultimate guide to Pandas’ read_csv() function | by Finn Andersen | Mar, 2023 | Medium https://medium.com/@finndersen/the-ultimate-guide-to-pandas-read-csv-function-5377874e27d5 40 comments
- Building an open data pipeline in 2024 - by Dan Goldin https://blog.twingdata.com/p/building-an-open-data-pipeline-in 32 comments
- GitHub - jqnatividad/qsv: CSVs sliced, diced & analyzed. https://github.com/jqnatividad/qsv 31 comments
- Druid: A Real-time Analytical Data Store https://www.micahlerner.com/2022/05/15/druid-a-real-time-analytical-data-store.html 30 comments
- How to analyse 100 GB of data on your laptop with Python | by Jovan Veljanoski | Towards Data Science https://towardsdatascience.com/how-to-analyse-100s-of-gbs-of-data-on-your-laptop-with-python-f83363dda94 26 comments
- Big Data file formats - Blog | luminousmen https://luminousmen.com/post/big-data-file-formats 24 comments
- Announcing AWS Glue DataBrew – A Visual Data Preparation Tool That Helps You Clean and Normalize Data Faster | AWS News Blog https://aws.amazon.com/blogs/aws/announcing-aws-glue-databrew-a-visual-data-preparation-tool-that-helps-you-clean-and-normalize-data-faster/ 19 comments
- Finding a standard dataset format for machine learning | OpenML Blog https://openml.github.io/blog/openml/data/2020/03/23/Finding-a-standard-dataset-format-for-machine-learning.html 17 comments
- GitHub - sushrut141/pg_analytica: Postgres extension that speeds up analytics queries by upto 90% https://github.com/sushrut141/pg_analytica 17 comments
- Apache Arrow, Parquet, Flight and Their Ecosystem are a Game Changer for OLAP | InfluxData https://www.influxdata.com/blog/apache-arrow-parquet-flight-and-their-ecosystem-are-a-game-changer-for-olap/ 14 comments
Would you like to stay up to date with Rust? Checkout Rust
Weekly.
Related searches:
Search whole site: site:parquet.apache.org
Search title: Parquet
See how to search.