Bananabot (talk | contribs)
Added archive URLs for 4 citation(s) using CRWCitationBot
Line 9: Line 9:
}}
}}


'''{{wplink|OpenAI}}'''<ref>[https://openai.com/ OpenAI Landing Page]</ref> is an American [[Artificial intelligence]] (AI) focused company. Founded in December 2015, OpenAI is best known their [[ChatGPT]] chatbot, also known for the Generative Pre-Trained Transformer (GPT) family of large language models, the DALL-E series of text-to-image models, and Sora, a text-to-video model. With a reported revenue of $10B in FY2025 <ref>{{Cite web |last= |date=10 Jun 2025 |title=OpenAI's annualized revenue hits $10 billion, up from $5.5 billion in December 2024 |url=https://www.reuters.com/business/media-telecom/openais-annualized-revenue-hits-10-billion-up-55-billion-december-2024-2025-06-09/ |url-status=live |archive-url=https://web.archive.org/web/20250610051847/https://www.reuters.com/business/media-telecom/openais-annualized-revenue-hits-10-billion-up-55-billion-december-2024-2025-06-09/ |archive-date=2025-06-10 |access-date=2025-09-22 |website=Reuters}}</ref> and approximately 5.5B visitors per month<ref>[https://www.semrush.com/website/chatgpt.com/overview/ ChatGPT monthly traffic] </ref>, OpenAI has positioned itself has a leader in the Generative AI industry.  <!-- This article is a work of progress as of 8/13/25, Feel free to edit it to your heart's content, of course. This is my first article on a site like this. -->
'''{{wplink|OpenAI}}'''<ref>[https://openai.com/ OpenAI Landing Page] ([http://web.archive.org/web/20260217225454/https://openai.com/ Archived])</ref> is an American [[Artificial intelligence]] (AI) focused company. Founded in December 2015, OpenAI is best known their [[ChatGPT]] chatbot, also known for the Generative Pre-Trained Transformer (GPT) family of large language models, the DALL-E series of text-to-image models, and Sora, a text-to-video model. With a reported revenue of $10B in FY2025 <ref>{{Cite web |last= |date=10 Jun 2025 |title=OpenAI's annualized revenue hits $10 billion, up from $5.5 billion in December 2024 |url=https://www.reuters.com/business/media-telecom/openais-annualized-revenue-hits-10-billion-up-55-billion-december-2024-2025-06-09/ |url-status=live |archive-url=https://web.archive.org/web/20250610051847/https://www.reuters.com/business/media-telecom/openais-annualized-revenue-hits-10-billion-up-55-billion-december-2024-2025-06-09/ |archive-date=2025-06-10 |access-date=2025-09-22 |website=Reuters}}</ref> and approximately 5.5B visitors per month<ref>[https://www.semrush.com/website/chatgpt.com/overview/ ChatGPT monthly traffic] ([http://web.archive.org/web/20260114194134/https://www.semrush.com/website/chatgpt.com/overview/ Archived])</ref>, OpenAI has positioned itself has a leader in the Generative AI industry.  <!-- This article is a work of progress as of 8/13/25, Feel free to edit it to your heart's content, of course. This is my first article on a site like this. -->
==Consumer-impact summary==
==Consumer-impact summary==
{{Ph-C-CIS}}
{{Ph-C-CIS}}
Line 26: Line 26:


===Web Crawlers ignoring robots.txt (2025)===
===Web Crawlers ignoring robots.txt (2025)===
In 2025, Jonathan Bailey from PlagiarismToday posted an article going into how ChatGPTs web crawlers were ignoring the sites Robots.txt file.<ref>https://www.plagiarismtoday.com/2025/07/23/chatgpt-ignores-robots-txt-rehashes-my-column/</ref> PlaigarismToday had blocked OpenAI's web crawlers in August of 2023, yet the latest ChatGPT model at the time provided data from articles that were posted the day before on the website, even though OpenAI wasn't supposed to be scraping these web pages. This can be problematic for smaller websites, due to OpenAI's aggressive approach to web crawling, with their crawlers reportedly in a single week sending in more than 29 thousand requests to a wiki known as The Cutting Room Floor.
In 2025, Jonathan Bailey from PlagiarismToday posted an article going into how ChatGPTs web crawlers were ignoring the sites Robots.txt file.<ref>https://www.plagiarismtoday.com/2025/07/23/chatgpt-ignores-robots-txt-rehashes-my-column/ ([http://web.archive.org/web/20260106080839/https://www.plagiarismtoday.com/2025/07/23/chatgpt-ignores-robots-txt-rehashes-my-column/ Archived])</ref> PlaigarismToday had blocked OpenAI's web crawlers in August of 2023, yet the latest ChatGPT model at the time provided data from articles that were posted the day before on the website, even though OpenAI wasn't supposed to be scraping these web pages. This can be problematic for smaller websites, due to OpenAI's aggressive approach to web crawling, with their crawlers reportedly in a single week sending in more than 29 thousand requests to a wiki known as The Cutting Room Floor.


===ChatGPT Atlas and prompt-injection vulnerability (2025)===
===ChatGPT Atlas and prompt-injection vulnerability (2025)===
In 2025, Brave posted an article about vulnerabilities that have agentic web browsers, such as ChatGPT Atlas, that consists of adding hidden malicious prompts in files, text or another media. Those prompts, combined with weak safeguards of the AI agents, can make them to expose and leak sensitive data of the user.<ref>https://owasp.org/www-community/attacks/PromptInjection</ref>
In 2025, Brave posted an article about vulnerabilities that have agentic web browsers, such as ChatGPT Atlas, that consists of adding hidden malicious prompts in files, text or another media. Those prompts, combined with weak safeguards of the AI agents, can make them to expose and leak sensitive data of the user.<ref>https://owasp.org/www-community/attacks/PromptInjection ([http://web.archive.org/web/20260210124436/https://owasp.org/www-community/attacks/PromptInjection Archived])</ref>


==Products==
==Products==