Hacker News
- Documentation for the JSON Lines text file format https://jsonlines.org/ 88 comments
- JSON Lines http://jsonlines.org/examples/ 48 comments
- JSON lines http://jsonlines.org/examples/ 86 comments programming
Linking pages
- LSP could have been better https://matklad.github.io/2023/10/12/lsp-could-have-been-better.html 260 comments
- Advanced Web Scraping: Bypassing "403 Forbidden," captchas, and more | sangaline.com http://sangaline.com/post/advanced-web-scraping-tutorial/ 238 comments
- GitHub - kellyjonbrazil/jc: CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts. https://github.com/kellyjonbrazil/jc 197 comments
- How To Finetune GPT Like Large Language Models on a Custom Dataset - Lightning AI https://lightning.ai/pages/blog/how-to-finetune-gpt-like-large-language-models-on-a-custom-dataset/ 122 comments
- GitHub - mikeroyal/Self-Hosting-Guide: Self-Hosting Guide. Learn all about locally hosting (on premises & private web servers) and managing software applications by yourself or your organization. Including Cloud, LLMs, WireGuard, Automation, Home Assistant, and Networking. https://github.com/mikeroyal/Self-Hosting-Guide 108 comments
- GitHub - dbohdan/structured-text-tools: A list of command line tools for manipulating structured text data https://github.com/dbohdan/structured-text-tools 106 comments
- GitHub - EntilZha/PyFunctional: Python library for creating data pipelines with chain functional programming https://github.com/EntilZha/PyFunctional 97 comments
- GitHub - tailscale/golink: A private shortlink service for tailnets https://github.com/tailscale/golink 90 comments
- GitHub - clovaai/donut: Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022 https://github.com/clovaai/donut 90 comments
- GitHub - EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library. https://github.com/EleutherAI/gpt-neox 67 comments
- GitHub - tidwall/gjson: Get JSON values quickly - JSON parser for Go https://github.com/tidwall/gjson/blob/master/readme.md 66 comments
- Reducing Memory Usage in Ruby | Tenderlove Making https://tenderlovemaking.com/2018/01/23/reducing-memory-usage-in-ruby.html 63 comments
- Parsing 18 billion JSON lines with Go | by Roffe | ITNEXT https://medium.com/@roffe/parsing-18-billion-lines-json-with-go-738be6ee5ed2?amp%3Bsk=0a57d3811168ab4d48c37387f69bb92c&source=friends_link 55 comments
- GitHub - tidwall/jj: JSON Stream Editor (command line utility) https://github.com/tidwall/jj 48 comments
- 2.0 · asciinema blog http://blog.asciinema.org/post/two-point-o/ 46 comments
- GitHub - cicada-lang/whereabouts: Logic programming with JSON http://github.com/cicada-lang/cicada-whereabouts 44 comments
- GitHub - yujiosaka/headless-chrome-crawler: Distributed crawler powered by Headless Chrome https://github.com/yujiosaka/headless-chrome-crawler 33 comments
- GitHub - jqnatividad/qsv: CSVs sliced, diced & analyzed. https://github.com/jqnatividad/qsv 31 comments
- GitHub - asg017/sqlite-lines: A SQLite extension for reading large files line-by-line (NDJSON, logs, txt, etc.) https://github.com/asg017/sqlite-lines 29 comments
- GitHub - tidwall/gjson.rs: Get JSON values quickly - JSON parser for Rust https://github.com/tidwall/gjson.rs 24 comments