Azaydan (talk | contribs)
Corrections about the Artificial FP64 Limitation on Consumer GPUs
Azaydan (talk | contribs)
Added some sources and corrected some imprecision
Line 21: Line 21:
In Nvidia's terms of service regarding accessing their website, under "Informal Resolution" users are required to agree to resolve legal disputes with Nvidia by arbitration from Judicial Arbitration and Mediation Services (JAMS).<ref>{{Cite web |title=NVIDIA Legal Notices |url=https://www.nvidia.com/en-gb/about-nvidia/terms-of-service/ |access-date=2025-06-19 |website=NVIDIA |language=en-gb |archive-url=http://web.archive.org/web/20260221043438/https://www.nvidia.com/en-gb/about-nvidia/terms-of-service/ |archive-date=21 Feb 2026}}</ref>
In Nvidia's terms of service regarding accessing their website, under "Informal Resolution" users are required to agree to resolve legal disputes with Nvidia by arbitration from Judicial Arbitration and Mediation Services (JAMS).<ref>{{Cite web |title=NVIDIA Legal Notices |url=https://www.nvidia.com/en-gb/about-nvidia/terms-of-service/ |access-date=2025-06-19 |website=NVIDIA |language=en-gb |archive-url=http://web.archive.org/web/20260221043438/https://www.nvidia.com/en-gb/about-nvidia/terms-of-service/ |archive-date=21 Feb 2026}}</ref>


===Artificial FP64 Limitation on Consumer GPUs (''2012 - 2014'')===
===Artificial FP64 Restriction on Kepler Consumer GPUs (''2013 - 2014'')===
On Kepler-era GK110 graphics cards, Nvidia used a driver-imposed clock rate restriction to artificially limit double precision floating point (FP64) compute performance on consumer variants, despite the hardware being fully capable. The GTX 780 and GTX 780 Ti share the GK110 die with the Tesla K40, which delivers FP64 at 1/3 the rate of FP32. On the consumer cards, the driver deliberately clocked the FP64 units at 1/8 of the chip's base rate, yielding a 1/24 ratio.<ref>{{Cite web |last=Angelini |first=Chris |date=2013-11-07 |title=Nvidia GeForce GTX 780 Ti Review: GK110, Fully Unlocked |url=https://www.tomshardware.com/reviews/geforce-gtx-780-ti-review-benchmarks,3663.html |url-status=live |access-date=2026-03-02 |website=Tom's Hardware}}</ref> The GTX Titan, also based on GK110, shipped with a driver control panel toggle that could unlock the full 1/3 FP64 rate at the cost of disabling GPU Boost, confirming the limitation was a software restriction rather than a hardware one.<ref>{{Cite web |last=Angelini |first=Chris |date=2013-02-19 |title=Compute Performance And Striking A Balance - Nvidia GeForce GTX Titan 6 GB: GK110 On A Gaming Card |url=https://www.tomshardware.com/reviews/geforce-gtx-titan-gk110-review,3438-3.html |url-status=live |access-date=2026-03-02 |website=Tom's Hardware}}</ref> From the Maxwell architecture onward, Nvidia switched to physically reducing the number of FP64 cores on consumer dies, ending the driver restriction but continuing hardware-level segmentation between consumer and enterprise products.
Nvidia's Kepler-generation GK110 GPU contains 64 double precision (FP64) CUDA cores per SMX block on all variants of the chip, including consumer GeForce cards, the GTX Titan, and the enterprise Tesla K40.<ref>{{Cite web |last=Angelini |first=Chris |date=2013-02-19 |title=GK110: The True Tank |url=https://www.tomshardware.com/reviews/geforce-gtx-titan-gk110-review,3438-2.html |url-status=live |access-date=2026-03-02 |website=Tom's Hardware}}</ref> On the GTX 780 and GTX 780 Ti, Nvidia forced the FP64 units to run at one eighth of the GPU's clock rate, yielding an effective FP64 throughput of 1/24 that of FP32, despite the hardware being fully present.<ref>{{Cite web |last=Angelini |first=Chris |date=2013-02-19 |title=Compute Performance And Striking A Balance |url=https://www.tomshardware.com/reviews/geforce-gtx-titan-gk110-review,3438-3.html |url-status=live |access-date=2026-03-02 |website=Tom's Hardware}}</ref> The GeForce GTX Titan, also based on GK110, shipped with a toggle in the Nvidia control panel under "CUDA - Double precision" that restored the full 1/3 FP64 rate at the cost of disabling GPU Boost, confirming the limitation on other GK110 consumer cards was a deliberate restriction rather than a hardware incapability.<ref>{{Cite web |last=Angelini |first=Chris |date=2017-05-25 |title=Nvidia Titan Xp 12GB Review |url=https://www.tomshardware.com/reviews/nvidia-titan-xp,5066-14.html |url-status=live |access-date=2026-03-02 |website=Tom's Hardware}}</ref> The Tesla K40, built on the same GK110 die, delivered the full 1/3 FP64 rate and retailed at several times the price of consumer equivalents. Beginning with the Maxwell architecture in 2014, Nvidia dropped native FP64 throughput to 1/32 of FP32 by physically reducing the number of FP64 cores on consumer dies, ending the firmware restriction but continuing the segmentation through hardware design.<ref>{{Cite web |last=Smith |first=Ryan |date=2015-03-17 |title=GM200: All Graphics, Hold The Double Precision |url=https://www.anandtech.com/show/9059/the-nvidia-geforce-gtx-titan-x-review/2 |url-status=live |access-date=2026-03-02 |website=AnandTech}}</ref>


===Stagnation of Consumer GPU Offerings===
===Stagnation of Consumer GPU Offerings===