Internet Archive: Difference between revisions

moved stubnotice
No edit summary
 
(One intermediate revision by one other user not shown)
Line 21: Line 21:
These uploads cannot be viewed by logged-out users and cannot be downloaded by anyone except the admins, making any of these pieces of content inaccessible.<ref>{{Cite web |title=Internet Archive Forums: Log In Required, after logging in. |url=https://archive.org/post/1092552/log-in-required-after-logging-in |url-status=live |archive-url=https://archive.ph/fFVg6 |archive-date=2025-08-16 |access-date=2025-08-16 |website=[[Internet Archive]]}}</ref>
These uploads cannot be viewed by logged-out users and cannot be downloaded by anyone except the admins, making any of these pieces of content inaccessible.<ref>{{Cite web |title=Internet Archive Forums: Log In Required, after logging in. |url=https://archive.org/post/1092552/log-in-required-after-logging-in |url-status=live |archive-url=https://archive.ph/fFVg6 |archive-date=2025-08-16 |access-date=2025-08-16 |website=[[Internet Archive]]}}</ref>


===Archived Website Censorship===
===Archived website removal===
The Internet Archive retroactively removes or hides material covered by robots.txt restrictions.{{Citation needed}}<!--(They may have stopped doing this - also don't confuse this for the [https://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/ criteria for actually saving/archiving a page], I am talking about end user access to saved/archived content).-->
The Archive accepts DMCA takedown requests of websites whose owners no longer want their sites archived<ref>{{Cite news |last=Bixenspan |first=David |date=2018-11-28 |title=When the Internet Archive Forgets |url=https://gizmodo.com/when-the-internet-archive-forgets-1830462131 |url-status=live |access-date=2025-08-31 |work=[[Gizmodo]]}}</ref> causing certain sites to be inaccessible.


===Data Breaches (2012-2024)===
The Internet Archive ''used'' to hide material covered by robots.txt restrictions but that was changed on April 17, 2017.<ref>{{Cite web |last=Graham |first=Mark |date=2017-04-17 |title=Robots.txt meant for search engines don’t work well for web archives |url=https://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/ |url-status=live |archive-url=https://web.archive.org/web/20170417131508/http://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/ |archive-date=2017-04-17 |access-date=2025-08-31 |website=Internet Archive}}</ref>
On May 19, 2017, The Archive's Development Manager Katie Barrett made a blog post detailing that anyone who had created their account before 2012 had to change their password as the site had been breached with user's public information and lightly encrypted passwords being leaked.<ref>{{Cite web |last=Barrett |first=Katie |date=2017-05-19 |title=Re: User account breach {{!}} Internet Archive Blogs |url=https://blog.archive.org/2017/05/19/re-user-account-breach/ |url-status=live |archive-url=https://web.archive.org/web/20250520030556/https://blog.archive.org/2017/05/19/re-user-account-breach/ |archive-date=2025-05-20 |access-date=2025-08-16 |website=[[Internet Archive]]}}</ref>
===Data breaches (2012-2024)===
On May 19, 2017, The Archive's Development Manager made a blog post detailing that anyone who had created their account before 2012 had to change their password as the site had been breached with user's public information and lightly encrypted passwords being leaked.<ref>{{Cite web |last=Barrett |first=Katie |date=2017-05-19 |title=Re: User account breach {{!}} Internet Archive Blogs |url=https://blog.archive.org/2017/05/19/re-user-account-breach/ |url-status=live |archive-url=https://web.archive.org/web/20250520030556/https://blog.archive.org/2017/05/19/re-user-account-breach/ |archive-date=2025-05-20 |access-date=2025-08-16 |website=[[Internet Archive]]}}</ref>


On October 9, 2024, users on the Internet Archive got pop-ups that the website had been hacked with notifications appearing from the perpetrators at around 9PM CST,<ref>{{Cite web |date=2024-10-09 |title=Dark Web Informer on X |url=https://x.com/DarkWebInformer/status/1844123206413943274 |url-status=live |archive-url=https://ghostarchive.org/archive/ADnLW |archive-date=2024-10-12 |access-date=2025-08-16 |website=[[Twitter]]}}</ref> and an hour later Troy Hunt of HaveIBeenPwned confirmed the breach.<ref>{{Cite web |last=Hunt |first=Troy |date=2024-10-09 |title=Troy Hunt on X: "Hi folks, yes, I'm aware of this. I've been in communication with the Internet Archive over the last few days re the data breach, didn't know the site was defaced until people started flagging it with me just now. More soon." / X |url=https://x.com/troyhunt/status/1844136762727448644 |url-status=live |archive-url=https://ghostarchive.org/archive/R8bRB |archive-date=2024-08-10 |access-date=2025-08-16 |website=[[Twitter]]}}</ref>
On October 9, 2024, users on the Internet Archive got pop-ups that the website had been hacked with notifications appearing from the perpetrators at around 9PM CST,<ref>{{Cite web |date=2024-10-09 |title=Dark Web Informer on X |url=https://x.com/DarkWebInformer/status/1844123206413943274 |url-status=live |archive-url=https://ghostarchive.org/archive/ADnLW |archive-date=2024-10-12 |access-date=2025-08-16 |website=[[Twitter]]}}</ref> and an hour later Troy Hunt of HaveIBeenPwned confirmed the breach.<ref>{{Cite web |last=Hunt |first=Troy |date=2024-10-09 |title=Troy Hunt on X: "Hi folks, yes, I'm aware of this. I've been in communication with the Internet Archive over the last few days re the data breach, didn't know the site was defaced until people started flagging it with me just now. More soon." / X |url=https://x.com/troyhunt/status/1844136762727448644 |url-status=live |archive-url=https://ghostarchive.org/archive/R8bRB |archive-date=2024-08-10 |access-date=2025-08-16 |website=[[Twitter]]}}</ref>