Hacker News
- Multiple AI companies bypassing web standard to scrape publisher sites https://www.reuters.com/technology/artificial-intelligence/multiple-ai-companies-bypassing-web-standard-scrape-publisher-sites-licensing-2024-06-21/ 24 comments
- Multiple AI companies bypassing web standard to scrape publisher sites, licensing firm says https://www.reuters.com/technology/artificial-intelligence/multiple-ai-companies-bypassing-web-standard-scrape-publisher-sites-licensing-2024-06-21/ 4 comments technology
Linking pages
- Perplexity’s grand theft AI - The Verge https://www.theverge.com/2024/6/27/24187405/perplexity-ai-twitter-lie-plagiarism 90 comments
- AI companies are reportedly still scraping websites despite protocols meant to block them https://www.engadget.com/ai-companies-are-reportedly-still-scraping-websites-despite-protocols-meant-to-block-them-132308524.html 51 comments
- Several AI companies said to be ignoring robots dot txt exclusion, scraping content without permission: report | Tom's Hardware https://www.tomshardware.com/tech-industry/artificial-intelligence/several-ai-companies-said-to-be-ignoring-robots-dot-txt-exclusion-scraping-content-without-permission-report 29 comments
- OpenAI, Anthropic Ignore Rule That Prevents Bots Scraping Web Content - Business Insider https://www.businessinsider.com/openai-anthropic-ai-ignore-rule-scraping-web-contect-robotstxt 28 comments
- Robots.txt Won't Save You—Ryan Bagley https://rb.ax/blog/robots.txt-wont-save-you/ 1 comment
- Scrape like a pro... but not like an AI company https://substack.thewebscraping.club/p/do-not-scrape-like-ai-companies 1 comment
- Condé Nast has reportedly accused AI search startup Perplexity of plagiarism https://www.engadget.com/conde-nast-has-reportedly-accused-ai-search-startup-perplexity-of-plagiarism-191639677.html 0 comments
Related searches:
Search whole site: site:www.reuters.com
Search title: Multiple AI companies bypassing web standard to scrape publisher sites
See how to search.