Hacker News
- Extracting tabular data from U.S. Senators' scanned-in personal finance reports https://github.com/dannguyen/abbyy-finereader-ocr-senate 6 comments
Linking pages
Linked pages
- ImageMagick – Convert, Edit, or Compose Digital Images http://www.imagemagick.org/script/index.php 111 comments
- GitHub - tesseract-ocr/tesseract: Tesseract Open Source OCR Engine (main repository) https://github.com/tesseract-ocr/tesseract 81 comments
- Tabula: Extract Tables from PDFs http://tabula.technology/ 55 comments
- Poppler https://poppler.freedesktop.org/ 17 comments
- Office of the Clerk, U.S. House of Representatives http://clerk.house.gov/public_disc/financial-search.aspx 15 comments
- Vision AI | Cloud Vision API | Google Cloud https://cloud.google.com/vision/ 2 comments
- Heart of Nerd Darkness: Why Updating Dollars for Docs Was So Difficult — ProPublica http://www.propublica.org/nerds/item/heart-of-nerd-darkness-why-dollars-for-docs-was-so-difficult 0 comments