Hacker News
- Global population datasets underrepresent rural population https://www.nature.com/articles/s41467-025-56906-7 98 comments
- What are the best tools for web scraping and analysis of natural language to populate a dataset? None 6 comments datasets
- TIL that we have a datasets subreddit. Please give this some love so that we can populate it. http://www.reddit.com/r/datasets/ 23 comments programming
- New model predicts characteristics of terror attacks with 90% accuracy. The model was populated with the specifics of 150,000 attacks carried out between 1970-2015. The framework analyzes relations among the dataset, tracing patterns and ultimately predicting the characteristics of future attacks. http://www.upi.com/science_news/2017/03/02/new-model-predicts-characteristics-of-terror-attacks-with-90-percent-accuracy/8271488478622/?utm_source=sec&utm_campaign=sl&utm_medium=1 4 comments science
- factbook.json 2020 Update - 260+ Public Domain (Free) World Country Profiles / Datasets (incl. Population, Internet Users, etc.) https://github.com/factbook/factbook.json 3 comments javascript
- At present, 1.8 billion people are "drought-stricken" globally—representing nearly one out of every four of the global population of 8 billion—according to the UNCCD analysis, which sampled international disaster datasets from 101 countries https://phys.org/news/2023-12-people-drought-stricken.html#google_vignette 4 comments environment
- factbook gem & factbook.json 2020 Update - 260+ Public Domain (Free) World Country Profiles / Datasets (incl. Population, Internet Users, etc.) https://github.com/factbook/factbook 8 comments ruby
- Suicide risk more than quadruples for people with cancer. After analyzing the data of 8 million patients, the researchers found that 13,311 of the patients in the dataset -- 0.15 percent -- died by suicide, more than four times the risk of the general population https://news.psu.edu/story/553921/2019/01/14/research/suicide-risk-more-quadruples-people-cancer 4 comments science