Artificial intelligence/training: Difference between revisions
Added archive URLs for 12 citation(s) using CRWCitationBot |
Add archival link for citation [1] |
||
| Line 78: | Line 78: | ||
On 15 June 2024, an investigation by Apple blog MacStories found that Perplexity does not follow its own documented policies when accessing content the user requests from the web. In their testing, the scraper pretended to be Chrome 111 running on Windows 10, connecting from an IP address not found in Perplexity's publicly-listed IP address ranges.<ref>{{Cite web |last=Knight |first=Robb |date=15 Jun 2024 |title=Perplexity AI Is Lying about Their User Agent |url=https://rknight.me/blog/perplexity-ai-is-lying-about-its-user-agent/ |url-status=live |archive-url=http://web.archive.org/web/20260123072701/https://rknight.me/blog/perplexity-ai-is-lying-about-its-user-agent/ |archive-date=23 Jan 2026}}</ref> MacStories' findings were confirmed by a WIRED investigation.<ref>{{Cite web |last=Mehrotra |first=Dhruv |last2=Marchman |first2=Tim |date=19 Jun 2024 |title=Perplexity Is a Bullshit Machine |url=https://www.wired.com/story/perplexity-is-a-bullshit-machine/ |website=WIRED |url-status=live |archive-url=http://web.archive.org/web/20260201173126/https://www.wired.com/story/perplexity-is-a-bullshit-machine/ |archive-date=1 Feb 2026}}</ref> Perplexity responded by removing its list of IP addresses. | On 15 June 2024, an investigation by Apple blog MacStories found that Perplexity does not follow its own documented policies when accessing content the user requests from the web. In their testing, the scraper pretended to be Chrome 111 running on Windows 10, connecting from an IP address not found in Perplexity's publicly-listed IP address ranges.<ref>{{Cite web |last=Knight |first=Robb |date=15 Jun 2024 |title=Perplexity AI Is Lying about Their User Agent |url=https://rknight.me/blog/perplexity-ai-is-lying-about-its-user-agent/ |url-status=live |archive-url=http://web.archive.org/web/20260123072701/https://rknight.me/blog/perplexity-ai-is-lying-about-its-user-agent/ |archive-date=23 Jan 2026}}</ref> MacStories' findings were confirmed by a WIRED investigation.<ref>{{Cite web |last=Mehrotra |first=Dhruv |last2=Marchman |first2=Tim |date=19 Jun 2024 |title=Perplexity Is a Bullshit Machine |url=https://www.wired.com/story/perplexity-is-a-bullshit-machine/ |website=WIRED |url-status=live |archive-url=http://web.archive.org/web/20260201173126/https://www.wired.com/story/perplexity-is-a-bullshit-machine/ |archive-date=1 Feb 2026}}</ref> Perplexity responded by removing its list of IP addresses. | ||
On 27 June 2024, [[Amazon]] announced an investigation into Perplexity AI, suggesting the behavior may be considered abusive under Amazon Web Services terms of service:<ref name="perplexity-aws">{{Cite web |last=Mehrotra |first=Dhruv |date=27 Jun 2024 |title=Amazon Is Investigating Perplexity Over Claims of Scraping Abuse |url=https://www.wired.com/story/aws-perplexity-bot-scraping-investigation/ |website=WIRED}}</ref> | On 27 June 2024, [[Amazon]] announced an investigation into Perplexity AI, suggesting the behavior may be considered abusive under Amazon Web Services terms of service:<ref name="perplexity-aws">{{Cite web |last=Mehrotra |first=Dhruv |date=27 Jun 2024 |title=Amazon Is Investigating Perplexity Over Claims of Scraping Abuse |url=https://www.wired.com/story/aws-perplexity-bot-scraping-investigation/ |website=WIRED |url-status=live |archive-url=https://web.archive.org/web/20260202160655/https://www.wired.com/story/aws-perplexity-bot-scraping-investigation/ |archive-date=2 Feb 2026}}</ref> | ||
<blockquote> | <blockquote> | ||