Generative artificial intelligence: Difference between revisions
m →Major media platforms with controversies relating to Generative AI: weird formatting bug |
m Added Related Articles to the new table (oops!) |
||
| Line 35: | Line 35: | ||
|'''Racketeering'''; scraping all users' public code repositories without prior notice nor consent for a large language model ('''GitHub CoPilot'''), then locking the ability to opt-out behind a subscription service | |'''Racketeering'''; scraping all users' public code repositories without prior notice nor consent for a large language model ('''GitHub CoPilot'''), then locking the ability to opt-out behind a subscription service | ||
|Labeled on the FAQ for GitHub CoPilot,<ref>{{Cite web |last=CoPilot |title=FAQ |url=https://copilot.github.trust.page/faq?s=v2qe7voltpwtv2usl4ikhs#ip-and-open-source |access-date=Mar 30, 2025 |website=github}}</ref> users who pay for either a ''Pro'' or ''Enterprise'' tier plan do not have their repositories (''repos'') scanned for the purposes of training CoPilot. This can be considered a form of [[racketeering]], as consumers are forced into paying if they wish to not have their content be indirectly profited off of by [[Microsoft: Family 365 subscripcion forced upsell|Microsoft]]. There are theories that private repos may not be used for training purposes,{{Citation needed}}<!-- Mentioned in previous version of this section --> but it is unable to be verified at this time. Users have shown backlash, and the ordeal has eroded public trust in [[GitHub]].<ref>{{Cite web |last=Dlindmark |date=Feb 23, 2025 |title=Does GitHub Copilot use any code from individual users to train GitHub's model (or any successor model)? #152229 |url=https://github.com/orgs/community/discussions/152229 |access-date=Mar 30, 2025 |website=GitHub}}</ref> | |Labeled on the FAQ for GitHub CoPilot,<ref>{{Cite web |last=CoPilot |title=FAQ |url=https://copilot.github.trust.page/faq?s=v2qe7voltpwtv2usl4ikhs#ip-and-open-source |access-date=Mar 30, 2025 |website=github}}</ref> users who pay for either a ''Pro'' or ''Enterprise'' tier plan do not have their repositories (''repos'') scanned for the purposes of training CoPilot. This can be considered a form of [[racketeering]], as consumers are forced into paying if they wish to not have their content be indirectly profited off of by [[Microsoft: Family 365 subscripcion forced upsell|Microsoft]]. There are theories that private repos may not be used for training purposes,{{Citation needed}}<!-- Mentioned in previous version of this section --> but it is unable to be verified at this time. Users have shown backlash, and the ordeal has eroded public trust in [[GitHub]].<ref>{{Cite web |last=Dlindmark |date=Feb 23, 2025 |title=Does GitHub Copilot use any code from individual users to train GitHub's model (or any successor model)? #152229 |url=https://github.com/orgs/community/discussions/152229 |access-date=Mar 30, 2025 |website=GitHub}}</ref> | ||
| | |[[GitHub]] | ||
|- | |- | ||
|'''DeviantArt''' | |'''DeviantArt''' | ||
| Line 46: | Line 46: | ||
|Training on all active users' data without prior notice nor explicit consent for a proprietary large language and image generation model ('''Grok'''); leaking over 370,000 private chats with Grok over Google search results without prior notice nor user consent | |Training on all active users' data without prior notice nor explicit consent for a proprietary large language and image generation model ('''Grok'''); leaking over 370,000 private chats with Grok over Google search results without prior notice nor user consent | ||
|Sometime in mid 2024, [[X Corp]]'s platform of the same moniker quietly changed its Terms of Service to allow all user-generated content submitted through it to be used to "Train artificial intelligence models, whether generative or another type", specifically with a permanent license for said content granted to the corporation and without any form of monetary compensation.<ref name=":0">{{Cite web |date=2026-01-05 |title=X Terms of Service |url=https://x.com/en/tos#:~:text=You%20agree%20that%20this%20license%20includes%20the%20right%20for%20us%20to%20%28i%29%20analyze%20text%20and%20other%20information%20you%20provide%20and%20to%20otherwise%20provide%2C%20promote%2C%20and%20improve%20the%20Services%2C%20including%2C%20for%20example%2C%20for%20use%20with%20and%20training%20of%20our%20machine%20learning%20and%20artificial%20intelligence%20models%2C%20whether%20generative%20or%20another%20type%3B%20and%20%28ii%29%20to%20make%20Content%20submitted%20to%20or%20through%20the%20Services%20available%20to%20other%20companies%2C%20organizations%20or%20individuals%2C%20including%2C%20for%20example%2C%20for%20improving%20the%20Services%20and%20the%20syndication%2C%20broadcast%2C%20distribution%2C%20repost%2C%20promotion%20or%20publication%20of%20such%20Content%20on%20other%20media%20and%20services%2C%20subject%20to%20our%20terms%20and%20conditions%20for%20such%20Content%20use%2E%20Such%20additional%20uses%20by%20us%2C%20or%20other%20companies%2C%20organizations%20or%20individuals%2C%20is%20made%20with%20no%20compensation%20paid%20to%20you%20with%20respect%20to%20the%20Content%20that%20you%20submit%2C%20post%2C%20transmit%20or%20otherwise%20make%20available%20through%20the%20Services%20as%20the%20use%20of%20the%20Services%20by%20you%20is%20hereby%20agreed%20as%20being%20sufficient%20compensation%20for%20the%20Content%20and%20grant%20of%20rights%20herein%2E |access-date=2026-01-05 |website=X}}</ref> Though there remains an option to opt-out in the platform's privacy settings, users were opted-in by default.<ref>{{Cite web |last=Cohen |first=Jason |date=2025-08-08 |title=Your Posts On X Are Being Used To Train Grok AI. Here's How To Stop It |url=https://www.pcmag.com/how-to/your-tweets-x-posts-train-elon-musk-grok-ai-how-to-stop-it-opt-out |url-status=live |archive-url=https://web.archive.org/web/20251103053227/https://www.pcmag.com/how-to/your-tweets-x-posts-train-elon-musk-grok-ai-how-to-stop-it-opt-out |archive-date=2025-11-03 |access-date=2026-01-05 |website=PC Magazine}}</ref> The following is speculative, as X Corp and sister corporation xAI have been almost entirely silent about the specifics of their methodology, but it's reasonable to assume that all users' available Content (as defined in the platform's Terms of Service)<ref name=":0" /> was scraped and introduced to xAI (and possibly other third-parties') datasets when the change was established. In addition, users' private conversations with the chatbot, which are shared through saved links, were/are accessible to search engines like Google and showed up for the first time in publicly accessible search results in August 2025.<ref>{{Cite web |last=Martin |first=Iain |date=2025-08-20 |title=Elon Musk’s xAI Published Hundreds Of Thousands Of Grok Chatbot Conversations |url=https://www.forbes.com/sites/iainmartin/2025/08/20/elon-musks-xai-published-hundreds-of-thousands-of-grok-chatbot-conversations/ |url-status=live |archive-url=https://web.archive.org/web/20250820105050/https://www.forbes.com/sites/iainmartin/2025/08/20/elon-musks-xai-published-hundreds-of-thousands-of-grok-chatbot-conversations/ |archive-date=2025-08-20 |access-date=2026-01-05 |website=Forbes}}</ref> | |Sometime in mid 2024, [[X Corp]]'s platform of the same moniker quietly changed its Terms of Service to allow all user-generated content submitted through it to be used to "Train artificial intelligence models, whether generative or another type", specifically with a permanent license for said content granted to the corporation and without any form of monetary compensation.<ref name=":0">{{Cite web |date=2026-01-05 |title=X Terms of Service |url=https://x.com/en/tos#:~:text=You%20agree%20that%20this%20license%20includes%20the%20right%20for%20us%20to%20%28i%29%20analyze%20text%20and%20other%20information%20you%20provide%20and%20to%20otherwise%20provide%2C%20promote%2C%20and%20improve%20the%20Services%2C%20including%2C%20for%20example%2C%20for%20use%20with%20and%20training%20of%20our%20machine%20learning%20and%20artificial%20intelligence%20models%2C%20whether%20generative%20or%20another%20type%3B%20and%20%28ii%29%20to%20make%20Content%20submitted%20to%20or%20through%20the%20Services%20available%20to%20other%20companies%2C%20organizations%20or%20individuals%2C%20including%2C%20for%20example%2C%20for%20improving%20the%20Services%20and%20the%20syndication%2C%20broadcast%2C%20distribution%2C%20repost%2C%20promotion%20or%20publication%20of%20such%20Content%20on%20other%20media%20and%20services%2C%20subject%20to%20our%20terms%20and%20conditions%20for%20such%20Content%20use%2E%20Such%20additional%20uses%20by%20us%2C%20or%20other%20companies%2C%20organizations%20or%20individuals%2C%20is%20made%20with%20no%20compensation%20paid%20to%20you%20with%20respect%20to%20the%20Content%20that%20you%20submit%2C%20post%2C%20transmit%20or%20otherwise%20make%20available%20through%20the%20Services%20as%20the%20use%20of%20the%20Services%20by%20you%20is%20hereby%20agreed%20as%20being%20sufficient%20compensation%20for%20the%20Content%20and%20grant%20of%20rights%20herein%2E |access-date=2026-01-05 |website=X}}</ref> Though there remains an option to opt-out in the platform's privacy settings, users were opted-in by default.<ref>{{Cite web |last=Cohen |first=Jason |date=2025-08-08 |title=Your Posts On X Are Being Used To Train Grok AI. Here's How To Stop It |url=https://www.pcmag.com/how-to/your-tweets-x-posts-train-elon-musk-grok-ai-how-to-stop-it-opt-out |url-status=live |archive-url=https://web.archive.org/web/20251103053227/https://www.pcmag.com/how-to/your-tweets-x-posts-train-elon-musk-grok-ai-how-to-stop-it-opt-out |archive-date=2025-11-03 |access-date=2026-01-05 |website=PC Magazine}}</ref> The following is speculative, as X Corp and sister corporation xAI have been almost entirely silent about the specifics of their methodology, but it's reasonable to assume that all users' available Content (as defined in the platform's Terms of Service)<ref name=":0" /> was scraped and introduced to xAI (and possibly other third-parties') datasets when the change was established. In addition, users' private conversations with the chatbot, which are shared through saved links, were/are accessible to search engines like Google and showed up for the first time in publicly accessible search results in August 2025.<ref>{{Cite web |last=Martin |first=Iain |date=2025-08-20 |title=Elon Musk’s xAI Published Hundreds Of Thousands Of Grok Chatbot Conversations |url=https://www.forbes.com/sites/iainmartin/2025/08/20/elon-musks-xai-published-hundreds-of-thousands-of-grok-chatbot-conversations/ |url-status=live |archive-url=https://web.archive.org/web/20250820105050/https://www.forbes.com/sites/iainmartin/2025/08/20/elon-musks-xai-published-hundreds-of-thousands-of-grok-chatbot-conversations/ |archive-date=2025-08-20 |access-date=2026-01-05 |website=Forbes}}</ref> | ||
| | |[[X Corp]] | ||
|- | |- | ||
|'''Stack Overflow''' | |'''Stack Overflow''' | ||
| Line 56: | Line 56: | ||
|Training on all users' content without prior notice nor explicit consent for a large language model ('''Reddit Answers''') | |Training on all users' content without prior notice nor explicit consent for a large language model ('''Reddit Answers''') | ||
|In late 2024, [[Reddit]] announced the release of 'Reddit Answers,' a large language model (LLM) that was publicly stated<ref>{{Cite web |title=Reddit Answers (Currently in Beta) |url=https://support.reddithelp.com/hc/en-us/articles/32026729424916-Reddit-Answers-Currently-in-Beta |url-status=live |access-date=31 Mar 2025 |website=[[Reddit]]}}</ref> to use content created by users to train the tool, without requiring prior consent or prior public notice. Reddit's Terms of Service explicitly allows the platform to scrape user-generated content for the purpose of training "AI and machine learning models",<ref>{{Cite web |first= |date=May 29, 2025 |title=User Agreement |url=https://redditinc.com/policies/user-agreement |url-status=live |archive-url=https://web.archive.org/web/20260101023854/https://redditinc.com/policies/user-agreement |archive-date=2026-01-01 |access-date=2026-01-05 |website=[[Reddit]]}}</ref> though this is likely limited to Reddit's proprietary dataset; it has legally discouraged third-party services from scraping the platform as per the Reddit Public Content Policy.<ref>{{Cite web |date= |title=Public Content Policy |url=https://support.reddithelp.com/hc/en-us/articles/26410290525844-Public-Content-Policy |url-status=live |archive-url=https://web.archive.org/web/20251230012504/https://support.reddithelp.com/hc/en-us/articles/26410290525844-Public-Content-Policy |archive-date=30 Dec 2025 |access-date=2025-01-05 |website=[[Reddit]]}}</ref> | |In late 2024, [[Reddit]] announced the release of 'Reddit Answers,' a large language model (LLM) that was publicly stated<ref>{{Cite web |title=Reddit Answers (Currently in Beta) |url=https://support.reddithelp.com/hc/en-us/articles/32026729424916-Reddit-Answers-Currently-in-Beta |url-status=live |access-date=31 Mar 2025 |website=[[Reddit]]}}</ref> to use content created by users to train the tool, without requiring prior consent or prior public notice. Reddit's Terms of Service explicitly allows the platform to scrape user-generated content for the purpose of training "AI and machine learning models",<ref>{{Cite web |first= |date=May 29, 2025 |title=User Agreement |url=https://redditinc.com/policies/user-agreement |url-status=live |archive-url=https://web.archive.org/web/20260101023854/https://redditinc.com/policies/user-agreement |archive-date=2026-01-01 |access-date=2026-01-05 |website=[[Reddit]]}}</ref> though this is likely limited to Reddit's proprietary dataset; it has legally discouraged third-party services from scraping the platform as per the Reddit Public Content Policy.<ref>{{Cite web |date= |title=Public Content Policy |url=https://support.reddithelp.com/hc/en-us/articles/26410290525844-Public-Content-Policy |url-status=live |archive-url=https://web.archive.org/web/20251230012504/https://support.reddithelp.com/hc/en-us/articles/26410290525844-Public-Content-Policy |archive-date=30 Dec 2025 |access-date=2025-01-05 |website=[[Reddit]]}}</ref> | ||
| | |[[Reddit]] | ||
|} | |} | ||