Kai (talk | contribs)
m Corrected information
Kai (talk | contribs)
m Added archived pages to citations
Line 24: Line 24:


===Archived website removal===
===Archived website removal===
The Archive accepts DMCA takedown requests of websites whose owners no longer want their sites archived<ref>{{Cite news |last=Bixenspan |first=David |date=2018-11-28 |title=When the Internet Archive Forgets |url=https://gizmodo.com/when-the-internet-archive-forgets-1830462131 |url-status=live |access-date=2025-08-31 |work=[[Gizmodo]]}}</ref> causing certain sites to be inaccessible.
The Archive accepts DMCA takedown requests of websites whose owners no longer want their sites archived<ref>{{Cite news |last=Bixenspan |first=David |date=2018-11-28 |title=When the Internet Archive Forgets |url=https://gizmodo.com/when-the-internet-archive-forgets-1830462131 |url-status=live |archive-url=https://web.archive.org/web/20250805030527/https://gizmodo.com/when-the-internet-archive-forgets-1830462131 |archive-date=2025-08-05 |access-date=2025-08-31 |work=[[Gizmodo]]}}</ref> causing certain sites to be inaccessible.


The Internet Archive ''used'' to hide material covered by robots.txt restrictions but that was changed on April 17, 2017.<ref>{{Cite web |last=Graham |first=Mark |date=2017-04-17 |title=Robots.txt meant for search engines don’t work well for web archives |url=https://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/ |url-status=live |archive-url=https://web.archive.org/web/20170417131508/http://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/ |archive-date=2017-04-17 |access-date=2025-08-31 |website=Internet Archive}}</ref>
The Internet Archive ''used'' to hide material covered by robots.txt restrictions but that was changed on April 17, 2017.<ref>{{Cite web |last=Graham |first=Mark |date=2017-04-17 |title=Robots.txt meant for search engines don’t work well for web archives |url=https://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/ |url-status=live |archive-url=https://web.archive.org/web/20170417131508/http://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/ |archive-date=2017-04-17 |access-date=2025-08-31 |website=Internet Archive}}</ref>


===Removal of noindex function on uploaded items===
===Removal of noindex function on uploaded items===
On 2023 the Internet Archive reportedly removed the ability for users to use the noindex function, which used to result in the items being hidden from its internal search engine, while making the items whose noindex value is true to appear on the search engine. The decision was criticized on the grounds that it may jeopardize users' rights, including privacy. When confronted about it, Jason Scott, who's a staffmember of the Internet Archive, reportedly responded with the following:<ref>https://old.reddit.com/r/DataHoarder/comments/156s7di/the_removal_of_noindex_from_the_internet_archive/</ref><ref>https://old.reddit.com/r/NoStupidQuestions/comments/142nm9h/internet_archive_ish/</ref>
On 2023 the Internet Archive reportedly removed the ability for users to use the noindex function, which used to result in the items being hidden from its internal search engine, while making the items whose noindex value is true to appear on the search engine. The decision was criticized on the grounds that it may jeopardize users' rights, including privacy. When confronted about it, Jason Scott, who's a staffmember of the Internet Archive, reportedly responded with the following:<ref>{{Cite web |date=2023-07-22 |title=The removal of "noindex" from the Internet Archive, and associated risks. |url=https://old.reddit.com/r/DataHoarder/comments/156s7di/the_removal_of_noindex_from_the_internet_archive/ |url-status=live |archive-url=https://web.archive.org/web/20241214121917/https://old.reddit.com/r/DataHoarder/comments/156s7di/the_removal_of_noindex_from_the_internet_archive/ |archive-date=2024-12-14 |access-date=2025-10-28 |website=Reddit}}</ref><ref>{{Cite web |date=2023-06-06 |title=Internet Archive Ish |url=https://old.reddit.com/r/NoStupidQuestions/comments/142nm9h/internet_archive_ish/ |url-status=live |archive-url=https://web.archive.org/web/20241215072041/https://old.reddit.com/r/NoStupidQuestions/comments/142nm9h/internet_archive_ish/ |archive-date=2024-12-15 |access-date=2025-10-28 |website=Reddit}}</ref>


<code>''There is no bug or mistake in removing no-index settings for many Internet Archive items in the Community collection.''
<code>''There is no bug or mistake in removing no-index settings for many Internet Archive items in the Community collection.''