Jump to content

Internet Archive: Difference between revisions

From Consumer Rights Wiki
Kai (talk | contribs)
Added Incidents Section with Sources
Kai (talk | contribs)
m References: Fixed References
Line 16: Line 16:
{{StubNotice}}
{{StubNotice}}


== Consumer-impact summary ==
==Consumer-impact summary==
[TBA]
[TBA]


== Incidents ==
==Incidents==


=== Login-only items for legally dubious content (2016-present) ===
===Login-only items for legally dubious content (2016-present)===
On January 13th of 2016 Hank Bromley (hank_b) of the Internet Archive made a collection for uploads considered legally dubious and only viewable with an account. [https://archive.org/details/loggedin?tab=about]
On January 13th of 2016 Hank Bromley (hank_b) of the Internet Archive made a collection for uploads considered legally dubious and only viewable with an account. <ref>{{Cite web |title=Download & Streaming : Log In Required : Internet Archive |url=https://archive.org/details/loggedin?tab=about |url-status=live |archive-url=https://archive.ph/rSKdG |archive-date=2025-08-16 |access-date=2025-08-16 |website=[[Internet Archive]]}}</ref>


These uploads cannot be viewed by users logged out and cannot be downloaded by anyone outside of the admins making any of these pieces of content inaccessible. [https://archive.org/post/1092552/log-in-required-after-logging-in]
These uploads cannot be viewed by users logged out and cannot be downloaded by anyone outside of the admins making any of these pieces of content inaccessible. <ref>{{Cite web |title=Internet Archive Forums: Log In Required, after logging in. |url=https://archive.org/post/1092552/log-in-required-after-logging-in |url-status=live |archive-url=https://archive.ph/fFVg6 |archive-date=2025-08-16 |access-date=2025-08-16 |website=[[Internet Archive]]}}</ref>


=== Archived Website Censorship ===
===Archived Website Censorship===
IA Retroactively removes (hides) material covered by robots.txt restrictions (They may have stopped doing this - also don't confuse this for the [https://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/ criteria for actually saving/archiving a page], I am talking about end user access to saved/archived content).
IA Retroactively removes (hides) material covered by robots.txt restrictions (They may have stopped doing this - also don't confuse this for the [https://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/ criteria for actually saving/archiving a page], I am talking about end user access to saved/archived content).


=== Data Breaches (<2012-2024) ===
===Data Breaches (2012-2024)===
On May 19th of 2017 The Archive's Development Manager Katie Barrett made a blog post detailing that anyone who'd made their accounts ''before 2012'' had to change their passwords as the site had been breached with user's public information and lightly encrypted passwords being leaked. [https://blog.archive.org/2017/05/19/re-user-account-breach/]
On May 19th of 2017 The Archive's Development Manager Katie Barrett made a blog post detailing that anyone who'd made their accounts ''before 2012'' had to change their passwords as the site had been breached with user's public information and lightly encrypted passwords being leaked. <ref>{{Cite web |last=Barrett |first=Katie |date=2017-05-19 |title=Re: User account breach {{!}} Internet Archive Blogs |url=https://blog.archive.org/2017/05/19/re-user-account-breach/ |url-status=live |archive-url=https://web.archive.org/web/20250520030556/https://blog.archive.org/2017/05/19/re-user-account-breach/ |archive-date=2025-05-20 |access-date=2025-08-16 |website=[[Internet Archive]]}}</ref>


On October 9th of 2024 users on the Internet Archive get pop-ups that the website had been hacked with notifications appearing from the perpetrators at around 9PM CST [https://xcancel.com/usetraceix/status/1844121916573090027][https://xcancel.com/DarkWebInformer/status/1844123206413943274][https://xcancel.com/jwdomb/status/1844123760720548040], and an hour later Troy Hunt of HaveIBeenPwned would confirm the breach. [https://xcancel.com/troyhunt/status/1844136762727448644]
On October 9th of 2024 users on the Internet Archive get pop-ups that the website had been hacked with notifications appearing from the perpetrators at around 9PM CST <ref>{{Cite web |date=2024-10-09 |title=Dark Web Informer on X: "🚨🚨🚨The Wayback Machine looks like it has been compromised.


Around 31 Million Users were affected with their User IDs, Emails, Encrypted Passwords and Usernames being leaked in it. [https://www.tomsguide.com/computing/online-security/31-million-users-impacted-by-internet-archive-data-breach-what-we-know][https://www.pcmag.com/news/hacker-defaces-internet-archive-claims-it-suffered-a-breach][https://blog.archive.org/2024/10/21/internet-archive-services-update-2024-10-21/][https://www.theverge.com/2024/10/14/24269741/internet-archive-online-read-only-data-breach-outage]
web[.]archive[.]org https://t.co/MvPshUrs7i" / X |url=https://x.com/DarkWebInformer/status/1844123206413943274 |url-status=live |archive-url=https://ghostarchive.org/archive/ADnLW |archive-date=2024-10-12 |access-date=2025-08-16 |website=[[Twitter]]}}</ref>, and an hour later Troy Hunt of HaveIBeenPwned would confirm the breach. <ref>{{Cite web |last=Hunt |first=Troy |date=2024-10-09 |title=Troy Hunt on X: "Hi folks, yes, I'm aware of this. I've been in communication with the Internet Archive over the last few days re the data breach, didn't know the site was defaced until people started flagging it with me just now. More soon." / X |url=https://x.com/troyhunt/status/1844136762727448644 |url-status=live |archive-url=https://ghostarchive.org/archive/R8bRB |archive-date=2024-08-10 |access-date=2025-08-16 |website=[[Twitter]]}}</ref>


== References ==
Around 31 Million Users were affected with their User IDs, Emails, Encrypted Passwords and Usernames being leaked in it. <ref>{{Cite news |last=LeClair |first=Dave |date=2024-10-11 |title="31 million users impacted by Internet Archive data breach — what we know" |url=https://www.tomsguide.com/computing/online-security/31-million-users-impacted-by-internet-archive-data-breach-what-we-know |url-status=live |archive-url=https://web.archive.org/web/20241109231711/https://www.tomsguide.com/computing/online-security/31-million-users-impacted-by-internet-archive-data-breach-what-we-know |archive-date=2024-11-09 |access-date=2025-08-16 |work=Tom's Guide}}</ref>
 
==References==
<references />

Revision as of 19:47, 16 August 2025

Internet Archive
Basic information
Founded 1996
Legal structure Private
Industry Digital Library
Official website https://archive.org/

The Internet Archive is an American non-profit digital library founded in 1996 to provide free "universal access to all knowledge" and preserve digital history.


https://archive.org/

https://web.archive.org/


Article Status Notice: This Article is a stub


This article is underdeveloped, and needs additional work to meet the wiki's Content Guidelines and be in line with our Mission Statement for comprehensive coverage of consumer protection issues. Learn more ▼

Consumer-impact summary

[TBA]

Incidents

Login-only items for legally dubious content (2016-present)

On January 13th of 2016 Hank Bromley (hank_b) of the Internet Archive made a collection for uploads considered legally dubious and only viewable with an account. [1]

These uploads cannot be viewed by users logged out and cannot be downloaded by anyone outside of the admins making any of these pieces of content inaccessible. [2]

Archived Website Censorship

IA Retroactively removes (hides) material covered by robots.txt restrictions (They may have stopped doing this - also don't confuse this for the criteria for actually saving/archiving a page, I am talking about end user access to saved/archived content).

Data Breaches (2012-2024)

On May 19th of 2017 The Archive's Development Manager Katie Barrett made a blog post detailing that anyone who'd made their accounts before 2012 had to change their passwords as the site had been breached with user's public information and lightly encrypted passwords being leaked. [3]

On October 9th of 2024 users on the Internet Archive get pop-ups that the website had been hacked with notifications appearing from the perpetrators at around 9PM CST [4], and an hour later Troy Hunt of HaveIBeenPwned would confirm the breach. [5]

Around 31 Million Users were affected with their User IDs, Emails, Encrypted Passwords and Usernames being leaked in it. [6]

References

  1. "Download & Streaming : Log In Required : Internet Archive". Internet Archive. Archived from the original on 2025-08-16. Retrieved 2025-08-16.
  2. "Internet Archive Forums: Log In Required, after logging in". Internet Archive. Archived from the original on 2025-08-16. Retrieved 2025-08-16.
  3. Barrett, Katie (2017-05-19). "Re: User account breach | Internet Archive Blogs". Internet Archive. Archived from the original on 2025-05-20. Retrieved 2025-08-16.
  4. "Dark Web Informer on X: "🚨🚨🚨The Wayback Machine looks like it has been compromised. web[.]archive[.]org https://t.co/MvPshUrs7i" / X". Twitter. 2024-10-09. Archived from the original on 2024-10-12. Retrieved 2025-08-16. {{cite web}}: External link in |title= (help); line feed character in |title= at position 84 (help)
  5. Hunt, Troy (2024-10-09). "Troy Hunt on X: "Hi folks, yes, I'm aware of this. I've been in communication with the Internet Archive over the last few days re the data breach, didn't know the site was defaced until people started flagging it with me just now. More soon." / X". Twitter. Archived from the original on 2024-08-10. Retrieved 2025-08-16.
  6. LeClair, Dave (2024-10-11). ""31 million users impacted by Internet Archive data breach — what we know"". Tom's Guide. Archived from the original on 2024-11-09. Retrieved 2025-08-16.