Nvidia: Difference between revisions
Corrections about the Artificial FP64 Limitation on Consumer GPUs |
Added some sources and corrected some imprecision |
||
| Line 21: | Line 21: | ||
In Nvidia's terms of service regarding accessing their website, under "Informal Resolution" users are required to agree to resolve legal disputes with Nvidia by arbitration from Judicial Arbitration and Mediation Services (JAMS).<ref>{{Cite web |title=NVIDIA Legal Notices |url=https://www.nvidia.com/en-gb/about-nvidia/terms-of-service/ |access-date=2025-06-19 |website=NVIDIA |language=en-gb |archive-url=http://web.archive.org/web/20260221043438/https://www.nvidia.com/en-gb/about-nvidia/terms-of-service/ |archive-date=21 Feb 2026}}</ref> | In Nvidia's terms of service regarding accessing their website, under "Informal Resolution" users are required to agree to resolve legal disputes with Nvidia by arbitration from Judicial Arbitration and Mediation Services (JAMS).<ref>{{Cite web |title=NVIDIA Legal Notices |url=https://www.nvidia.com/en-gb/about-nvidia/terms-of-service/ |access-date=2025-06-19 |website=NVIDIA |language=en-gb |archive-url=http://web.archive.org/web/20260221043438/https://www.nvidia.com/en-gb/about-nvidia/terms-of-service/ |archive-date=21 Feb 2026}}</ref> | ||
===Artificial FP64 | ===Artificial FP64 Restriction on Kepler Consumer GPUs (''2013 - 2014'')=== | ||
Nvidia's Kepler-generation GK110 GPU contains 64 double precision (FP64) CUDA cores per SMX block on all variants of the chip, including consumer GeForce cards, the GTX Titan, and the enterprise Tesla K40.<ref>{{Cite web |last=Angelini |first=Chris |date=2013-02-19 |title=GK110: The True Tank |url=https://www.tomshardware.com/reviews/geforce-gtx-titan-gk110-review,3438-2.html |url-status=live |access-date=2026-03-02 |website=Tom's Hardware}}</ref> On the GTX 780 and GTX 780 Ti, Nvidia forced the FP64 units to run at one eighth of the GPU's clock rate, yielding an effective FP64 throughput of 1/24 that of FP32, despite the hardware being fully present.<ref>{{Cite web |last=Angelini |first=Chris |date=2013-02-19 |title=Compute Performance And Striking A Balance |url=https://www.tomshardware.com/reviews/geforce-gtx-titan-gk110-review,3438-3.html |url-status=live |access-date=2026-03-02 |website=Tom's Hardware}}</ref> The GeForce GTX Titan, also based on GK110, shipped with a toggle in the Nvidia control panel under "CUDA - Double precision" that restored the full 1/3 FP64 rate at the cost of disabling GPU Boost, confirming the limitation on other GK110 consumer cards was a deliberate restriction rather than a hardware incapability.<ref>{{Cite web |last=Angelini |first=Chris |date=2017-05-25 |title=Nvidia Titan Xp 12GB Review |url=https://www.tomshardware.com/reviews/nvidia-titan-xp,5066-14.html |url-status=live |access-date=2026-03-02 |website=Tom's Hardware}}</ref> The Tesla K40, built on the same GK110 die, delivered the full 1/3 FP64 rate and retailed at several times the price of consumer equivalents. Beginning with the Maxwell architecture in 2014, Nvidia dropped native FP64 throughput to 1/32 of FP32 by physically reducing the number of FP64 cores on consumer dies, ending the firmware restriction but continuing the segmentation through hardware design.<ref>{{Cite web |last=Smith |first=Ryan |date=2015-03-17 |title=GM200: All Graphics, Hold The Double Precision |url=https://www.anandtech.com/show/9059/the-nvidia-geforce-gtx-titan-x-review/2 |url-status=live |access-date=2026-03-02 |website=AnandTech}}</ref> | |||
===Stagnation of Consumer GPU Offerings=== | ===Stagnation of Consumer GPU Offerings=== | ||