Artificial intelligence: Difference between revisions
Removed working that falsely suggested AI companies at large ignore copyright law. Several recent cases show the principal act of training on copyrighted data is fair use, such as this one: https://www.theguardian.com/technology/2025/jun/26/meta-wins-ai-copyright-lawsuit-as-us-judge-rules-against-authors Other cases are pending or have resolved in a similar fashion. Using pirated material to train is one thing, but content available on the open Internet can be used for fair use purposes. |
|||
(5 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
{{ToneWarning}} | {{Irrelevant}}{{ToneWarning}} | ||
'''Artificial intelligence''' (AI) is a field of computer science producing | '''Artificial intelligence''' (AI) is a field of computer science producing systems that aim to solve problems which humans solve by using intelligence. So far, no AI solutions are intelligent. AI is not a new concept - it has been of interest as early as the 1950s. AI is a catch-all, it encompasses many areas and techniques, so merely saying that something uses AI tells one little about it. | ||
[[wikipedia: | Since the November 2022 launch of [[ChatGPT]], [[wikipedia:Large language model|large language model]] (LLM) chatbots have been a main focus of industry, with tens of billions of dollars in funding allocated to producing more popular LLMs. Also a significant focus are [[wikipedia:Text-to-image model|text-to-image models]], which "draw" an image using written prompt, and [[wikipedia:Text-to-video model|text-to-video models]], which extend the text-to-image concept across several smooth video frames. | ||
The current well-funded, | [[wikipedia:Generative artificial intelligence|Generative artificial intelligence]] models are trained through vast amounts of existing human-generated content. Using the example of an LLM, by gathering statistics on patterns of words that people use, the model can generate sequences of words that seem similar to what a person might have written. LLM do not understand anything, they can not reason. Everything they generate is just a randomly modulated pattern of tokens. People reading sequences of tokens sometimes see things they think of as being true. Sequences which do not make sense to the reader, or which are false are called [[wikipedia:Hallucination (artificial intelligence)|hallucination]]. LLM are typically trained to produce output which is pleasing to people, therefore they often produce output which seems confidently-written, and use patterns which praise the user (sycophancy). | ||
LLM are a glorified autocomplete. People are used to dealing with people, and many overestimate the abilities of things that exhibit complex, person like patterns. Promoters of “AI” systems take advantage of this tendency, using suggestive names (like “reasoning,” and “learning”) and grand claims (“PHD level”), which make it harder for people to understand these systems. | |||
From November 2022 to 2025, venture capitalists and companies threw hundreds of billions into AI, but received minimal returns. When companies seek returns, consumers can expect that products may be orphaned, services may be reduced, customer data to be sold or repurposed, costs to rise, and companies to reduce staff or fail. Historically, AI has had brief periods of intense hype, followed by disillusionment, and “AI winters.” | |||
The current well-funded, industry of artificial intelligence tools has resulted in rampant unethical use of content. Startups intending to produce AI services have been scraping the internet for content to train future models at a fast pace, and members of the field are concerned that they are approaching the limit of publicly-available content to train from.<ref>{{Cite web |last=Tremayne-Pengelly |first=Alexandra |date=16 Dec 2024 |title=Ilya Sutskever Warns A.I. Is Running Out of Data—Here’s What Will Happen Next |url=https://observer.com/2024/12/openai-cofounder-ilya-sutskever-ai-data-peak/ |website=Observer}}</ref> | |||
==Unethical website scraping== | ==Unethical website scraping== | ||
Line 97: | Line 103: | ||
In some cases, these AI models can also be hijacked for malicious purposes. Demonstrated from the usage of Comet ([[Perplexity]]), users can run arbitrary prompts to the browser's built-in AI assistant via hiding text in the HTML comments, non-visible webpage text, or simple comments on a webpage.<ref>{{Cite web |date=Aug 20, 2025 |title=Tweet from Brave |url=https://xcancel.com/brave/status/1958152314914508893#m |access-date=Aug 24, 2025 |website=X (formerly [[Twitter]])}}</ref> These arbitrary prompts can then be abused to hijack sensitive information, or worse, break into high-value accounts, such as for banking or game libraries.<ref>{{Cite web |date=Aug 23, 2025 |title=Tweet from zack (in SF) |url=https://xcancel.com/zack_overflow/status/1959308058200551721 |access-date=Aug 24, 2025 |website=X (formerly [[Twitter]])}}</ref> | In some cases, these AI models can also be hijacked for malicious purposes. Demonstrated from the usage of Comet ([[Perplexity]]), users can run arbitrary prompts to the browser's built-in AI assistant via hiding text in the HTML comments, non-visible webpage text, or simple comments on a webpage.<ref>{{Cite web |date=Aug 20, 2025 |title=Tweet from Brave |url=https://xcancel.com/brave/status/1958152314914508893#m |access-date=Aug 24, 2025 |website=X (formerly [[Twitter]])}}</ref> These arbitrary prompts can then be abused to hijack sensitive information, or worse, break into high-value accounts, such as for banking or game libraries.<ref>{{Cite web |date=Aug 23, 2025 |title=Tweet from zack (in SF) |url=https://xcancel.com/zack_overflow/status/1959308058200551721 |access-date=Aug 24, 2025 |website=X (formerly [[Twitter]])}}</ref> | ||
== Further reading == | |||
* [[Automatic Content Recognition]] | |||
* [[Palantir]] | |||
* [[Meta]] | |||
* [[Yandex]] | |||
* [[TikTok & AI-powered Ad Tracking]] | |||
* [[Flock License Plate Readers]] | |||
* [[Ring]] | |||
* [[Waymo]] | |||
* [[Google]] | |||
==References== | ==References== |