Generative AI: Difference between revisions
No edit summary |
TasmanianRex (talk | contribs) |
||
(12 intermediate revisions by 7 users not shown) | |||
Line 2: | Line 2: | ||
<!-- ADMIN COMMENT: This article is fine as a collection of consumer-facing GenAI issues and incidents, however we need to avoid straying into territory relating to workers rights, the ethics of web-scraping for training data, or other similar concerns. This wiki is to be strictly focussed on consumer affairs! If a company changes the terms of a contract to force users into letting their private data be used for AI training, that is relevant. If a company has laid off 500 workers to replace them with a chatbot, the treatment of those workers is not relevant to this wiki (although consumer-facing issues caused by the use of the chatbot might be!).--> | <!-- ADMIN COMMENT: This article is fine as a collection of consumer-facing GenAI issues and incidents, however we need to avoid straying into territory relating to workers rights, the ethics of web-scraping for training data, or other similar concerns. This wiki is to be strictly focussed on consumer affairs! If a company changes the terms of a contract to force users into letting their private data be used for AI training, that is relevant. If a company has laid off 500 workers to replace them with a chatbot, the treatment of those workers is not relevant to this wiki (although consumer-facing issues caused by the use of the chatbot might be!).--> | ||
Generative AI, also referred to as GenAI or simply AI, is a program whose existence is to generate pieces of media based | '''Generative AI''', also referred to as GenAI or simply AI, is a program whose existence is to generate pieces of media based on a simple prompt (e.g. "How long do I heat popcorn for in the microwave?" or "bowl of buttery popcorn, realistic, artstation, pretty") with various and random results. GenAI over its currently short existence being accessible to the public has garnered large amounts of concern across the various fields it has been applied to. <!-- I used to help operate a Kialo discussion covering Generative AI, that discussion may be beneficial for reference as a way to further flesh this page out. Just please take note that most claims are a few years old and may not be accurate, so please fact-check any statements from there before mentioning anything anti-consumer here | ||
https://www.kialo.com/is-ai-art-theft-60905 --> | https://www.kialo.com/is-ai-art-theft-60905 --> | ||
== General | ==General controversies surrounding generative AI== | ||
{| class="wikitable" | {| class="wikitable" | ||
|+ | |+ | ||
Line 18: | Line 18: | ||
|- | |- | ||
|Replacing skilled workers with AI | |Replacing skilled workers with AI | ||
|Due to its generalized nature, jobs across fields from digital art to writing and programming have had experienced staff replaced by lesser-paid (and often lesser-experienced) employees who would be tasked to use generative tools to do their work. | |Due to its generalized nature, jobs across fields from digital art to writing and programming have had experienced staff replaced by lesser-paid (and often lesser-experienced) employees who would be tasked to use generative tools to do their work. To remain relevant to the wiki's purpose, the usage leads to the detriment of product quality for consumers, such as representatives replaced with chatbots, or products being sold by companies use poorly-generated content that may harm the consumer.<ref>{{Cite web |last=Grady |first=Constance |date=29 Apr 2024 |title=The AI grift that can literally poison you |url=https://www.vox.com/24141648/ai-ebook-grift-mushroom-foraging-mycological-society |url-status=live |access-date=31 Mar 2025 |website=Vox}}</ref><!-- Reference included more to represent what is intended --> | ||
| | | | ||
|- | |- | ||
Line 26: | Line 26: | ||
|} | |} | ||
== Specific | ==Specific controversies involving generative AI== | ||
=== Reddit | ===Reddit - Training AI on user-generated content=== | ||
In late 2024, Reddit announced the release of 'Reddit Answers' | In late 2024, [[Reddit]] announced the release of 'Reddit Answers,' a large language model (LLM) that was publicly stated<ref>{{Cite web |title=Reddit Answers (Currently in Beta) |url=https://support.reddithelp.com/hc/en-us/articles/32026729424916-Reddit-Answers-Currently-in-Beta |url-status=live |access-date=31 Mar 2025 |website=[[Reddit]]}}</ref> to use content created by users to train the tool, without requiring prior consent or prior public notice. <!-- Needs further coverage here --> | ||
=== DeviantArt DreamUp <!-- Considering the over 2 year long history that continues to have new drama stir from this, we should look into eventually making a dedicated article focused on DreamUp --> === | ===DeviantArt - DreamUp<!-- Considering the over 2 year long history that continues to have new drama stir from this, we should look into eventually making a dedicated article focused on DreamUp -->=== | ||
While more speculative, it is reasonable for users to assume<ref>https://www.reddit.com/r/AI_Generator_Guide/comments/167sbit/what_i_think_about_deviantarts_ai_choices/</ref> that when DeviantArt initially automatically opted all users into allowing their work to be training data for generative AI<ref>https://www.deviantart.com/izzy-paw/journal/deviantart-s-AI-art-program-and-how-to-opt-out-936581886</ref><ref>https://en.digitalreport.com.tr/deviantart-dreamup-how-to-use-ai-opt-out/</ref> | While more speculative, it is reasonable for users to assume<ref>{{Cite web |title=What I think about DeviantArt's AI Choices |url=https://www.reddit.com/r/AI_Generator_Guide/comments/167sbit/what_i_think_about_deviantarts_ai_choices/ |url-status=live |access-date=31 Mar 2025 |website=[[Reddit]]}}</ref> that when [[DeviantArt]] initially automatically opted all users into allowing their work to be training data for generative AI,<ref>{{Cite web |date=11 Nov 2022 |title=deviantart's AI art program (and how to opt out) |url=https://www.deviantart.com/izzy-paw/journal/deviantart-s-AI-art-program-and-how-to-opt-out-936581886 |url-status=live |access-date=31 Mar 2025 |website=Deviant Art}}</ref><ref>{{Cite web |last=Erdine |first=Önder |date=16 Nov 2022 |title=DeviantArt DreamUp is the latest contender for the AI art crown |url=https://en.digitalreport.com.tr/deviantart-dreamup-how-to-use-ai-opt-out/ |url-status=live |access-date=31 Mar 2025 |website=Digital Report}}</ref> that all content uploaded to DeviantArt was used as training data for their DreamUp tool, however according to statements from DeviantArt CEO Moti Levy,<ref>{{Cite web |last=Robertston |first=Adi |date=15 Nov 2022 |title=How DeviantArt is navigating the AI art minefield |url=https://www.theverge.com/2022/11/15/23449036/deviantart-ai-art-dreamup-training-data-controversy |url-status=live |access-date=31 Mar 2025 |website=The Verge}}</ref> DeviantArt did not plan or intend to train their tool based on user-generated works and that any user-generated works that were used in their model, were introduced by StabilityAI. Regardless, the introduction of DreamUp to the art sharing platform has both stirred controversy on the platform,<ref>{{Cite web |last=Edwards |first=Benj |date=11 Nov 2022 |title=DeviantArt upsets artists with its new AI art generator, DreamUp [Updated] |url=https://arstechnica.com/information-technology/2022/11/deviantart-upsets-artists-with-its-new-ai-art-generator-dreamup/ |url-status=live |access-date=31 Mar 2025 |website=[[ArsTechnica]]}}</ref> and also fractured the platform into 2 parties,<ref>{{Cite web |last=Duchess Celestia |date=13 Nov 2022 |title=DeviantART Just Betrayed Its Whole Community. (DreamUp AI Controversy) {{!}}{{!}} SPEEDPAINT + COMMENTARY |url=https://www.youtube.com/watch?v=IGj_3OhMrAU |url-status=live |access-date=31 Mar 2025 |website=[[YouTube]]}}</ref> those for generative AI (typically those who hold newer accounts) and those against (typically users who have existed on the platform for far longer.) Due to the introduction of DreamUp, the platform has been cluttered by AI generated images, and staff have historically, frequently, and intentionally featured multiple users who exclusively upload GenAI content<ref>{{Cite web |date=22 Jul 2024 |title=DeviantArt Seller: StygianAI |url=https://www.deviantart.com/team/art/DeviantArt-Seller-StygianAI-1077776294 |access-date=31 Mar 2025 |website=Deviant Art}}</ref><ref>{{Cite web |date=9 Oct 2024 |title=Create on DeviantArt: VeilAI |url=https://www.deviantart.com/team/art/Create-on-DeviantArt-VeilAI-1108146133 |url-status=live |access-date=31 Mar 2025 |website=Deviant Art}}</ref><ref>{{Cite web |date=15 Jul 2024 |title=DeviantArt Seller: ExeFelix |url=https://www.deviantart.com/team/art/DeviantArt-Seller-ExeFelix-1075192370 |url-status=live |access-date=31 Mar 2025 |website=Deviant Art}}</ref> or post content that uses generative content as a base,<ref>{{Cite web |date=9 Oct 2024 |title=Create on DeviantArt: AKoukis |url=https://www.deviantart.com/team/art/Create-on-DeviantArt-AKoukis-1108151629 |url-status=live |access-date=31 Mar 2025 |website=Deviant Art}}</ref> with a majority of featured creators being ones who nearly or exclusively upload AI generated content.<!-- I was scrolling through their gallery and most featured artist posts were about AI creators, I stopped my search when I reached posts that released before the generative AI controversies on the platform occurred, which had a rough stopping point of around Q4 2022. | ||
https://www.deviantart.com/team/gallery --><!-- Due to my close familiarity with the situation, yes, I developed this section a lot more than initially planned. --> | https://www.deviantart.com/team/gallery --><!-- Due to my close familiarity with the situation, yes, I developed this section a lot more than initially planned. --> | ||
===LAION-5b training database=== | |||
Many users have had their content scraped by LAION to power their training database, and the only way they can opt out is via a third party<ref>[https://haveibeentrained.com/ Have I Been Trained?]</ref>. | |||
=== | ====<big>StackOverflow - Overflow AI training data based on user generated content</big>==== | ||
In Late July, 2023, [[Overflow AI]] was released by [[StackOverflow]]. The content used to train this AI was built off of questions and answers left by the StackOverflow community.<ref>{{Cite news |last=StackOverflow |date=Jul 27, 2023 |title=Announcing OverflowAI |url=https://stackoverflow.blog/2023/07/27/announcing-overflowai/ |access-date=Mar 30, 2025 |work=StackOverflow blog}}</ref> This action essentially subverts an existing policy from [[StackOverflow]], where users cannot generate AI responses for answers.<ref>{{Cite web |last=StackOverflow |title=Generative AI Policy |url=https://stackoverflow.com/help/gen-ai-policy |access-date=Mar 30, 2025 |website=StackOverflow help center}}</ref> The only way users currently have the capability of not having their content scraped for Overflow AI is to manually delete all posts and topics before deleting your account, however the effectiveness is questionable considering their [[Get Out Clause]],<ref>{{Cite web |last=StackOverflow |title=How do I delete all my contributions? |url=https://stackoverflow.com/help/delete-content |access-date=Mar 30, 2025 |website=StackOverflow help center}}</ref> which allows them to retrieve content that was deleted. | |||
== References == | ===Microsoft - GitHub CoPilot trained on free user repositories=== | ||
Labeled on the FAQ for [[GitHub CoPilot]],<ref>{{Cite web |last=CoPilot |title=FAQ |url=https://copilot.github.trust.page/faq?s=v2qe7voltpwtv2usl4ikhs#ip-and-open-source |access-date=Mar 30, 2025 |website=github}}</ref> users who pay for either a ''Pro'' or ''Enterprise'' tier plan do not have their repositories (''repos'') scanned for the purposes of training CoPilot. This can be considered a form of [[racketeering]], as consumers are forced into paying if they wish to not have their content be indirectly profited off of by [[Microsoft: Family 365 subscripcion forced upsell|Microsoft]]. There are theories that private repos may not be used for training purposes,{{Citation needed}}<!-- Mentioned in previous version of this section --> but it is unable to be verified at this time. Users on this platform have shown some backlash since this has eroded trust in [[GitHub]].<ref>{{Cite web |last=Dlindmark |date=Feb 23, 2025 |title=Does GitHub Copilot use any code from individual users to train GitHub's model (or any successor model)? #152229 |url=https://github.com/orgs/community/discussions/152229 |access-date=Mar 30, 2025 |website=GitHub}}</ref><!-- Possibly move this topic into a main page as forms of racketeering is pretty serious --> | |||
==References== | |||
<references /> | <references /> |
Latest revision as of 08:01, 31 March 2025
❗Article Status Notice: This Article is a stub
This article is underdeveloped, and needs additional work to meet the wiki's Content Guidelines and be in line with our Mission Statement for comprehensive coverage of consumer protection issues. Issues may include:
- This article needs to be expanded to provide meaningful information
- This article requires additional verifiable evidence to demonstrate systemic impact
- More documentation is needed to establish how this reflects broader consumer protection concerns
- The connection between individual incidents and company-wide practices needs to be better established
- The article is simply too short, and lacks sufficient content
How You Can Help:
- Add documented examples with verifiable sources
- Provide evidence of similar incidents affecting other consumers
- Include relevant company policies or communications that demonstrate systemic practices
- Link to credible reporting that covers these issues
- Flesh out the article with relevant information
This notice will be removed once the article is sufficiently developed. Once you believe the article is ready to have its notice removed, visit the Discord (join here) and post to the #appeals
channel, or mention its status on the article's talk page.
Generative AI, also referred to as GenAI or simply AI, is a program whose existence is to generate pieces of media based on a simple prompt (e.g. "How long do I heat popcorn for in the microwave?" or "bowl of buttery popcorn, realistic, artstation, pretty") with various and random results. GenAI over its currently short existence being accessible to the public has garnered large amounts of concern across the various fields it has been applied to.
General controversies surrounding generative AI[edit | edit source]
Controversy | Brief Description | Related Article(s)/Section(s) |
---|---|---|
Training data collected without consent | Various platforms have scraped data ranging within the petabytes concerning content created by users and potentially owned by companies, without first obtaining an adequate license to use this data. This has gone so far as to not even request consent or even notifying users in advance that their content was used to train AI-powered tools. | |
Replacing skilled workers with AI | Due to its generalized nature, jobs across fields from digital art to writing and programming have had experienced staff replaced by lesser-paid (and often lesser-experienced) employees who would be tasked to use generative tools to do their work. To remain relevant to the wiki's purpose, the usage leads to the detriment of product quality for consumers, such as representatives replaced with chatbots, or products being sold by companies use poorly-generated content that may harm the consumer.[1] | |
Specific controversies involving generative AI[edit | edit source]
Reddit - Training AI on user-generated content[edit | edit source]
In late 2024, Reddit announced the release of 'Reddit Answers,' a large language model (LLM) that was publicly stated[2] to use content created by users to train the tool, without requiring prior consent or prior public notice.
DeviantArt - DreamUp[edit | edit source]
While more speculative, it is reasonable for users to assume[3] that when DeviantArt initially automatically opted all users into allowing their work to be training data for generative AI,[4][5] that all content uploaded to DeviantArt was used as training data for their DreamUp tool, however according to statements from DeviantArt CEO Moti Levy,[6] DeviantArt did not plan or intend to train their tool based on user-generated works and that any user-generated works that were used in their model, were introduced by StabilityAI. Regardless, the introduction of DreamUp to the art sharing platform has both stirred controversy on the platform,[7] and also fractured the platform into 2 parties,[8] those for generative AI (typically those who hold newer accounts) and those against (typically users who have existed on the platform for far longer.) Due to the introduction of DreamUp, the platform has been cluttered by AI generated images, and staff have historically, frequently, and intentionally featured multiple users who exclusively upload GenAI content[9][10][11] or post content that uses generative content as a base,[12] with a majority of featured creators being ones who nearly or exclusively upload AI generated content.
LAION-5b training database[edit | edit source]
Many users have had their content scraped by LAION to power their training database, and the only way they can opt out is via a third party[13].
StackOverflow - Overflow AI training data based on user generated content[edit | edit source]
In Late July, 2023, Overflow AI was released by StackOverflow. The content used to train this AI was built off of questions and answers left by the StackOverflow community.[14] This action essentially subverts an existing policy from StackOverflow, where users cannot generate AI responses for answers.[15] The only way users currently have the capability of not having their content scraped for Overflow AI is to manually delete all posts and topics before deleting your account, however the effectiveness is questionable considering their Get Out Clause,[16] which allows them to retrieve content that was deleted.
Microsoft - GitHub CoPilot trained on free user repositories[edit | edit source]
Labeled on the FAQ for GitHub CoPilot,[17] users who pay for either a Pro or Enterprise tier plan do not have their repositories (repos) scanned for the purposes of training CoPilot. This can be considered a form of racketeering, as consumers are forced into paying if they wish to not have their content be indirectly profited off of by Microsoft. There are theories that private repos may not be used for training purposes,[citation needed] but it is unable to be verified at this time. Users on this platform have shown some backlash since this has eroded trust in GitHub.[18]
References[edit | edit source]
- ↑ Grady, Constance (29 Apr 2024). "The AI grift that can literally poison you". Vox. Retrieved 31 Mar 2025.
{{cite web}}
: CS1 maint: url-status (link) - ↑ "Reddit Answers (Currently in Beta)". Reddit. Retrieved 31 Mar 2025.
{{cite web}}
: CS1 maint: url-status (link) - ↑ "What I think about DeviantArt's AI Choices". Reddit. Retrieved 31 Mar 2025.
{{cite web}}
: CS1 maint: url-status (link) - ↑ "deviantart's AI art program (and how to opt out)". Deviant Art. 11 Nov 2022. Retrieved 31 Mar 2025.
{{cite web}}
: CS1 maint: url-status (link) - ↑ Erdine, Önder (16 Nov 2022). "DeviantArt DreamUp is the latest contender for the AI art crown". Digital Report. Retrieved 31 Mar 2025.
{{cite web}}
: CS1 maint: url-status (link) - ↑ Robertston, Adi (15 Nov 2022). "How DeviantArt is navigating the AI art minefield". The Verge. Retrieved 31 Mar 2025.
{{cite web}}
: CS1 maint: url-status (link) - ↑ Edwards, Benj (11 Nov 2022). "DeviantArt upsets artists with its new AI art generator, DreamUp [Updated]". ArsTechnica. Retrieved 31 Mar 2025.
{{cite web}}
: CS1 maint: url-status (link) - ↑ Duchess Celestia (13 Nov 2022). "DeviantART Just Betrayed Its Whole Community. (DreamUp AI Controversy) || SPEEDPAINT + COMMENTARY". YouTube. Retrieved 31 Mar 2025.
{{cite web}}
: CS1 maint: url-status (link) - ↑ "DeviantArt Seller: StygianAI". Deviant Art. 22 Jul 2024. Retrieved 31 Mar 2025.
- ↑ "Create on DeviantArt: VeilAI". Deviant Art. 9 Oct 2024. Retrieved 31 Mar 2025.
{{cite web}}
: CS1 maint: url-status (link) - ↑ "DeviantArt Seller: ExeFelix". Deviant Art. 15 Jul 2024. Retrieved 31 Mar 2025.
{{cite web}}
: CS1 maint: url-status (link) - ↑ "Create on DeviantArt: AKoukis". Deviant Art. 9 Oct 2024. Retrieved 31 Mar 2025.
{{cite web}}
: CS1 maint: url-status (link) - ↑ Have I Been Trained?
- ↑ StackOverflow (Jul 27, 2023). "Announcing OverflowAI". StackOverflow blog. Retrieved Mar 30, 2025.
- ↑ StackOverflow. "Generative AI Policy". StackOverflow help center. Retrieved Mar 30, 2025.
- ↑ StackOverflow. "How do I delete all my contributions?". StackOverflow help center. Retrieved Mar 30, 2025.
- ↑ CoPilot. "FAQ". github. Retrieved Mar 30, 2025.
- ↑ Dlindmark (Feb 23, 2025). "Does GitHub Copilot use any code from individual users to train GitHub's model (or any successor model)? #152229". GitHub. Retrieved Mar 30, 2025.