Jump to content

Internet Archive: Difference between revisions

From Consumer Rights Wiki
Rain (talk | contribs)
Added introduction.
Kai (talk | contribs)
Added Incidents Section with Sources
Line 16: Line 16:
{{StubNotice}}
{{StubNotice}}


==Censorship==
== Consumer-impact summary ==
Topics:
[TBA]


Login-only items for legally dubious content,
== Incidents ==


Obeys removal requests by site owners sometimes of entire domains,
=== Login-only items for legally dubious content (2016-present) ===
On January 13th of 2016 Hank Bromley (hank_b) of the Internet Archive made a collection for uploads considered legally dubious and only viewable with an account. [https://archive.org/details/loggedin?tab=about]


Retroactively removes (hides) material covered by robots.txt restrictions (They may have stopped doing this - also don't confuse this for the [https://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/ criteria for actually saving/archiving a page], I am talking about end user access to saved/archived content)
These uploads cannot be viewed by users logged out and cannot be downloaded by anyone outside of the admins making any of these pieces of content inaccessible. [https://archive.org/post/1092552/log-in-required-after-logging-in]
 
=== Archived Website Censorship ===
IA Retroactively removes (hides) material covered by robots.txt restrictions (They may have stopped doing this - also don't confuse this for the [https://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/ criteria for actually saving/archiving a page], I am talking about end user access to saved/archived content).
 
=== Data Breaches (<2012-2024) ===
On May 19th of 2017 The Archive's Development Manager Katie Barrett made a blog post detailing that anyone who'd made their accounts ''before 2012'' had to change their passwords as the site had been breached with user's public information and lightly encrypted passwords being leaked. [https://blog.archive.org/2017/05/19/re-user-account-breach/]
 
On October 9th of 2024 users on the Internet Archive get pop-ups that the website had been hacked with notifications appearing from the perpetrators at around 9PM CST [https://xcancel.com/usetraceix/status/1844121916573090027][https://xcancel.com/DarkWebInformer/status/1844123206413943274][https://xcancel.com/jwdomb/status/1844123760720548040], and an hour later Troy Hunt of HaveIBeenPwned would confirm the breach. [https://xcancel.com/troyhunt/status/1844136762727448644]
 
Around 31 Million Users were affected with their User IDs, Emails, Encrypted Passwords and Usernames being leaked in it. [https://www.tomsguide.com/computing/online-security/31-million-users-impacted-by-internet-archive-data-breach-what-we-know][https://www.pcmag.com/news/hacker-defaces-internet-archive-claims-it-suffered-a-breach][https://blog.archive.org/2024/10/21/internet-archive-services-update-2024-10-21/][https://www.theverge.com/2024/10/14/24269741/internet-archive-online-read-only-data-breach-outage]
 
== References ==

Revision as of 19:22, 16 August 2025

Internet Archive
Basic information
Founded 1996
Legal structure Private
Industry Digital Library
Official website https://archive.org/

The Internet Archive is an American non-profit digital library founded in 1996 to provide free "universal access to all knowledge" and preserve digital history.


https://archive.org/

https://web.archive.org/


Article Status Notice: This Article is a stub


This article is underdeveloped, and needs additional work to meet the wiki's Content Guidelines and be in line with our Mission Statement for comprehensive coverage of consumer protection issues. Learn more ▼

Consumer-impact summary

[TBA]

Incidents

Login-only items for legally dubious content (2016-present)

On January 13th of 2016 Hank Bromley (hank_b) of the Internet Archive made a collection for uploads considered legally dubious and only viewable with an account. [1]

These uploads cannot be viewed by users logged out and cannot be downloaded by anyone outside of the admins making any of these pieces of content inaccessible. [2]

Archived Website Censorship

IA Retroactively removes (hides) material covered by robots.txt restrictions (They may have stopped doing this - also don't confuse this for the criteria for actually saving/archiving a page, I am talking about end user access to saved/archived content).

Data Breaches (<2012-2024)

On May 19th of 2017 The Archive's Development Manager Katie Barrett made a blog post detailing that anyone who'd made their accounts before 2012 had to change their passwords as the site had been breached with user's public information and lightly encrypted passwords being leaked. [3]

On October 9th of 2024 users on the Internet Archive get pop-ups that the website had been hacked with notifications appearing from the perpetrators at around 9PM CST [4][5][6], and an hour later Troy Hunt of HaveIBeenPwned would confirm the breach. [7]

Around 31 Million Users were affected with their User IDs, Emails, Encrypted Passwords and Usernames being leaked in it. [8][9][10][11]

References