Does a Ryzen CPU with 2 CCDs benefit from 2x32MB L3 cache?

Context: Gaming performance

Test system: R9 5900X, 4GHz fixed, 6 cores, SMT on, DDR4-3733 C14, Win 11, overclocked RX 6800 XT, Adrenalin 22.1.2

Conclusion: 1 CCD 6-0 >= 2 CCDs 3-0 (except FC 6)

Criticism: The test is not isolated. The higher inter-core latency has an impact on the performance.

Note: Of course it's not an isolated test but that's not really a problem. From a logical point of view you can answer the question: no, it doesn't benefit from 2x32MB L3 because it can't overcompensate at least the impact of higher inter-core latencies.

1 CCD vs 2 CCDs

👍︎ 21

💬︎

👤︎ u/devtechprofile

📅︎ Jan 24 2022

🚨︎ report

Will be 5800X 3D the only Zen3 CPU with 3D-Cache?

What do you think?
Will the 5800X 3D be the only CPU from AMD with 3D-Cache?

AMD only announced this model during CES, which worries me a bit. Admittedly, there were leaks about a prototype 5900X 3D having 92MB L3 Cache last June, but then there was no more information about this CPU model.

👍︎ 13

💬︎

👤︎ u/Gegunio

📅︎ Jan 11 2022

🚨︎ report

Dual AMD EPYC 7773X Flagship Milan-X CPUs Benchmarked, Over 1.5 GB of Total CPU Cache On A Single Server Platform wccftech.com/dual-amd-epy…

👍︎ 676

💬︎

👤︎ u/tuhdo

📅︎ Dec 09 2021

🚨︎ report

Pro Tools ran out of CPU power? Even with a cache size of 28GB?

Hello!

Ran into the "Pro Tools ran out of CPU power. Turn off Native plugins" Error. This has never happened before. On my old DAW (Mixcraft Pro 9) I've had sessions 3 times as large with more intensive plug ins in use, never running into an issue. I've also ran larger sessions than this one on Pro Tools (v.2021.7) with no issues... My PC has 32GB of RAM, and about a TB of free space on my SSD that's being used. So far I've mixed a few guitar tracks, bass, and a full drumset. I was going to throw plugins on a mandolin bus when this error has occurred. I don't know what the solution is... I'm using a mix of Waves, iZotope, Native, and Slate plugs. I still have to mix a bunch more tracks... Any solution would be appreciated! I'm still "new" to Pro Tools in regards to it not being the DAW I've used through most of my sound engineering life.

EDIT: New discovery: The error occurs even when ALL PLUGINS are disabled. I opened the session while holding shift, which disables all plugins, in hopes to isolate the plugin that's causing the issue, and it still occurs on attempted playback! This is a glitch, apparently... Help!

EDIT: Currently running a system memory diagnostic to see if my RAM is running correctly. Again, I think this is a glitch though, since the error occurs even with all plug ins disabled.

EDIT: So the buffersize was set to 1024 Samples. I switched interfaces to rule out the Helix, and the same error occurs with my M-Audio fast track, also at 1024 Samples (the highest it goes). Still no solution yet! ;(

EDIT: Adjusting the "Ignore Errors" setting has caused PT to crash.

👍︎ 4

💬︎

👤︎ u/aragorn767

📅︎ Jan 19 2022

🚨︎ report

Over 4 kilobits of addressable, paged binary RAM - in other words 512 Bytes arranged as 32x16 B pages, read and written via 8-bit 3-tick serial. I'm planning to use this as the main memory in my next CPU, along with smaller, faster caches

👍︎ 857

💬︎

👤︎ u/KuropatwiQ

📅︎ Nov 20 2021

🚨︎ report

AMD EPYC 7V73X Milan-X CPU With 3D V-Cache Benchmarked, Up To 12.5% Performance Increase Over Standard Milan wccftech.com/amd-epyc-7v7…

👍︎ 14

💬︎

👤︎ u/billbraski17

📅︎ Jan 22 2022

🚨︎ report

Tested: AMD CPU Cache Latency Up to 6x Slower in Windows 11 tomshardware.com/news/amd…

👍︎ 872

💬︎

👤︎ u/MamaSuPapaJensen

📅︎ Oct 14 2021

🚨︎ report

Intel already has an answer to AMD’s 3D V-cache secret weapon CPU techradar.com/uk/news/int…

👍︎ 4

💬︎

👤︎ u/long_AMD

📅︎ Jan 06 2022

🚨︎ report

Cpu cache impact on skyrim modded.

Are there any gains going from zen 2 to zen 3 and possibly zen3d ?

I could only find this comparison but it seems a lot : https://www.youtube.com/watch?v=s9l_8ASVyhA

👍︎ 2

💬︎

👤︎ u/SnowflakeMonkey

📅︎ Jan 25 2022

🚨︎ report

Tested: AMD CPU Cache Latency Up to 6x Slower in Windows 11 tomshardware.com/news/amd…

👍︎ 431

💬︎

👤︎ u/MamaSuPapaJensen

📅︎ Oct 14 2021

🚨︎ report

One person could maintain [Wikipedia]. It's a static site with a quarter of a dynamic header. … What exactly pertains to "data center" costs? Redis can serve an incredible amount of static sites from cache with just one cpu. reddit.com/r/opensource/c…

👍︎ 191

💬︎

👤︎ u/RunasSudo

📅︎ Nov 17 2021

🚨︎ report

Raptoreum Crypto CPU Miners Drive AMD Ryzen Shortage, Large Level 3 Cache Becomes Secret Sauce ownsnap.com/raptoreum-cry…

👍︎ 11

💬︎

👤︎ u/usonamdnvidia

📅︎ Nov 05 2021

🚨︎ report

Over 4 kilobits of addressable, paged binary RAM - in other words 512 Bytes arranged as 32x16 B pages, read and written via 8-bit 3-tick serial. I'm planning to use this as the main memory in my next CPU, along with smaller, faster caches reddit.com/gallery/qyg4zd

👍︎ 45

💬︎

👤︎ u/KuropatwiQ

📅︎ Nov 20 2021

🚨︎ report

Helios 300 10870H+3060 Time Spy - Removed the stock +200 mhz Memory OC , UVed my GPU ,, UVed my CPU (-110 mv core and cache)

👍︎ 3

💬︎

👤︎ u/V0id103

📅︎ Jan 05 2022

🚨︎ report

Why dont cpus have a big L1 cache shared between all cores to get around cache coherence problems and have bigger bandwidth between cores to boost message passing performance? Wouldn't losing a few cpu cycles on L1 be worthy if we get 3x-5x throughput?

I was implementing direct mapped, lru, n way set associative caches in C++ and found out that direct mapped cache is inherently parallelizable and easy to achieve 2.4 billions of lookups per second. Even the slower n way set associative surpassed 70 million lookups per second. If software can do this, why not hardware?

Writing to a variable in thread 1, reading it from thread 2, within just L1 latency instead of locking for 1000 nanoseconds nor fiddling with atomics. Just writing reading. Just imagine the good old concurrent queue doing 5 billion pushes and pops per second. Concurrent tree access for game scene graph. Concurrent anything would be blazingly scalable. Only if L1 cache has enough number of sets(if its n way set associated) or tags (if its direct mapped) like 64 or 128.

Also when not multiple-threading, all the bandwidth would be available to single thread for some really good handling of async io for web sites or maybe just boost the out of order execution capabilities? I really wouldn't care if excel loads a bit late. Its not pentium 2 after all. There must be a way of hiding the extra latency behind something else, like doing more L1 accesses at a time or working on more instructions at a time, maybe not as massively parallel as a gpu.

If its not possible nor cheap, then why dont they add hardware pipes that connect cores directly? Just like calling an assembly instruction to send a data directly to a core. Gpus have shared memory or local memory that can share data between all pipelines in multiprocessor. Gpu designers even optimize atomic functions by hardware to make many atomics run in parallel. Even if just atomic works like gpu, lockless concurrent data structures would get decent boost.

What about stacking cores on third dimension just above L1 to shorten path? Would it work? Maybe it's not "just a few more cycles" as I guessed. But is it possible even for higher price per chip?

What about putting carbon-nano-tubes(or open-ended tubes / pipes) between stacked cores and pump some cooling gas / liquid in them?

What about routing power via "wireless" way? Is it possible to make a standing-wave / resonance between stacks and feed transistors with e/m waves?

If lining the stacks is a problem, can we carve stacks out of a single crystalline structure that somehow can work as transistors, capacitors, etc with just some extra atoms as capacitance, etc? This may have gotten too far on the fantasy, but buying a computer (IBM personal c

... keep reading on reddit ➡

👍︎ 86

💬︎

👤︎ u/tugrul_ddr

📅︎ Oct 15 2021

🚨︎ report

AMD confirms Ryzen CPU refresh with 3D V-Cache early in 2022 pcgamer.com/amd-confirms-…

👍︎ 102

💬︎

👤︎ u/long_AMD

📅︎ Oct 12 2021

🚨︎ report

Is the L3 cache bug still present on Windows 11 with a swapped CPU?

Swapped my old CPU to a 5600x on Windows 11 recently and just wanted to know if this bug is still present or if I don't need to reinstall my OS.

👍︎ 13

💬︎

👤︎ u/daf435-con

📅︎ Dec 09 2021

🚨︎ report

Why don't CPU cores get a dedicated 2D direct-mapped cache for image processing?

Imaginary scenario in assembler:

enable exclusive 2Dimensional (x,y indexing) cache usage
map cache to accesses from given array pointer + range
start image processing without tag conflict
disable 2D cache, continue using L1 data cache with 0 contention/thrashing

I tested this on software-cache and 2D direct mapped cache had much better cache-hit pattern (and ratio) compared to 1D:

Pattern test (with user-input):

https://www.youtube.com/watch?v=4scCNJx5rds

Performance test (with compile-time-known pattern):

https://www.youtube.com/watch?v=Ox82H2aboIk

If CPUs can have wide SIMDs, big L2 caches, why don't designers give some extra logic to boost image processing / matrix multiplication to compete with GPUs? This would add 1 more cache type (instruction cache + data cache + image_cache) to separate bandwidths from each other so normal data/instruction cache wouldn't be polluted with big image?

👍︎ 93

💬︎

👤︎ u/tugrul_ddr

📅︎ Oct 24 2021

🚨︎ report

I’ve seen people discussing what the payouts might be for theta.tv and caching jobs so here’s a few days of my CPU running full bore 24/7 on cache. Remember, it’s still about helping adoption more than earning fuel.

👍︎ 17

💬︎

👤︎ u/thisguy-probably

📅︎ Nov 04 2021

🚨︎ report

Tested: AMD CPU Cache Latency Up to 6x Slower in Windows 11 tomshardware.com/news/amd…

👍︎ 68

💬︎

👤︎ u/escalibur

📅︎ Oct 15 2021

🚨︎ report

Intel Core i5-12490F is China exclusive 6-core Alder Lake desktop CPU with 20MB L3 cache videocardz.com/newz/intel…

👍︎ 2

💬︎

👤︎ u/reps_up

📅︎ Jan 05 2022

🚨︎ report

What will rule the cpu overclocking market in the future? core speed or cache memory?

Just curious as to what overclocking enthusiasts think about overclocking and which approach they found more fun/valuable. Speed or memory?

You know who loves cache memory; amd

You know who loves core speed; intel

Just curious as to what overclocking enthusiasts think about overclocking and which approach they found more fun/valueble. Speed or memory?

Yes I know both matters, but one has to matter more at this point in time.

👍︎ 3

💬︎

👤︎ u/Melonwater4

📅︎ Dec 14 2021

🚨︎ report

Radeon RX 6900 XT Review, AMD's Fight For the Top you're losing only a few fps for a massive tdp savings guys on the GPU power wattage needs under full load for a bigger CPU 5950x 3d v-cache or even another 6900xt to run in crossfire setups or both with a 1200-wat corsair PSU with extra watts/amps. youtube.com/watch?v=nxQ0-…

👍︎ 2

💬︎

👤︎ u/Own-Ad-9791

📅︎ Jan 05 2022

🚨︎ report

AMD unveils its first Ryzen CPU with 3D V-Cache, previews Zen 4 for late 2022 engadget.com/amd-ryzen-cp…

👍︎ 2

💬︎

👤︎ u/Imaorange410

📅︎ Jan 04 2022

🚨︎ report

Raptoreum Crypto CPU Miners Drive AMD Ryzen Shortage, Large Level 3 Cache Becomes Secret Sauce – Own Snap - BitcoinPress bitcoinpress.co.uk/raptor…

👍︎ 15

💬︎

👤︎ u/Long_on_AMD

📅︎ Nov 11 2021

🚨︎ report

Is 20 million lookups per second performance ok for a single threaded LRU cache written in C++? (CPU is fx8150 3.6GHz) github.com/tugrul512bit/L…

👍︎ 31

💬︎

👤︎ u/tugrul_ddr

📅︎ Oct 04 2021

🚨︎ report

Raptoreum crypto CPU mining: loves AMD Ryzen 9 CPUs and large L3 cache tweaktown.com/news/82712/…

👍︎ 5

💬︎

👤︎ u/Lee911123

📅︎ Nov 14 2021

🚨︎ report

Why is this "PHYSX>CPU" appearing on the screen? It only appears on Warframe, I tried the other games and it doesn't appear. all is installed via Steam, and I already tried Optimize Download Cache

👍︎ 3

💬︎

👤︎ u/ShadowSpy98

📅︎ Dec 16 2021

🚨︎ report

Is L3 cache ok? I installed all w11 update & amd chipset drivers too.my cpu is ryzen 5 3600.

👍︎ 8

💬︎

👤︎ u/SayerX

📅︎ Oct 24 2021

🚨︎ report

Monster L3 Cache - "Dual AMD EPYC 7773X Flagship Milan-X CPUs Benchmarked, Over 1.5 GB of Total CPU Cache On A Single Server Platform" - Intel has no chance !

Look at this article and contemplate what 1.5G bytes of L3 cache means! Insane beast mode!

https://wccftech.com/dual-amd-epyc-7773x-flagship-milan-x-cpus-benchmarked-over-1-5-gb-of-total-cpu-cache-on-a-single-server-platform/

Simple Intel has nothing close and this is Milan-X 6nm Zen3+ ... chiplets cache for the EPYC Genoa too.

Maybe Intel should IPO its x86 and GPUs as a separate company as they're doing with Mobileye and remain a fab only business? Of course Intel's fab is a mess so they have nothing really. ..

👍︎ 19

💬︎

👤︎ u/TOMfromYahoo

📅︎ Dec 08 2021

🚨︎ report

Premiere Pro barely running yet not even utilizing 40% of GPU / CPU or memory. Priority set to high in TM. Cache empty, no cores parked, GPU OC with MSI AB ?????? v.redd.it/29j7fmsjc3y71

👍︎ 3

💬︎

👤︎ u/MakeAspsGreatAgain

📅︎ Nov 07 2021

🚨︎ report

Multi-Level Cache (Direct Mapped L1 + LRU approx L2 + guard_locked LRU LLC) does up to 400 million lookups per second in Gaussian Blur operation on FX8150 CPU.

Edit: multi-threaded read-only mode benchmark complete( https://github.com/tugrul512bit/LruClockCache/wiki/How-To-Do-Multithreading-With-a-Read-Only-Multi-Level-Cache ): 892M get calls per second https://i.snipboard.io/DwS2Wg.jpg

So I multiplexed the input of an LRU-approximation cache, which was doing only 50M lookups per second, by adding a Direct-Mapped cache (which uses power-of-2 sized tag array to map integer keys) in front of the LRU. Compiler was not optimizing the modulo operator so I made it work with

tag = key&amp;(size-1);

and it became much faster than LRU but had somewhat worse "hit-ratio per cache size" characteristics even if I used prime-numbers as tag-array length.

Anyway, if anyone wants to benchmark a new system like "Ryzen" or 10th generation Intel, then I would be happy to see results for a comparison and make a comparison table.

Here is benchmarking program source code: https://github.com/tugrul512bit/LruClockCache/wiki/Multi-Level-Cache-Benchmark-With-Gaussian-Blur-Computation-(Integer-Key-Only)

Here are include files:

https://github.com/tugrul512bit/LruClockCache/blob/main/integer_key_specialization/CacheThreader.h

https://github.com/tugrul512bit/LruClockCache/blob/main/integer_key_specialization/CpuBenchmarker.h

https://github.com/tugrul512bit/LruClockCache/blob/main/integer_key_specialization/DirectMappedCache.h

https://github.com/tugrul512bit/LruClockCache/blob/main/LruClockCache.h

The program outputs 1 image file per iteration (each iteration allocates bigger L1, L2, LLC caches) so that you can see if it really works.

I also made tiled-computing optimization but that did not favor L1 for some reason. I guess that was about the latency of 2.5 nanoseconds not enough to hide latency of some other operations?

The LLC cache that uses guard-lock to have thread-safety doesn't have multiple L2/L1 yet. I'm thinking about some efficient way of invalidating caches when a L1 set(write) method is called. So it is single-threaded for now. Maybe std::atomic can be a good way of dealing with invalidations of arbitrary tags in L2 and L1. Maybe one atomic variable per tag takes too much memory but it should be parallelizable by CPU L3 I guess?

👍︎ 55

💬︎

👤︎ u/tugrul_ddr

📅︎ Oct 08 2021

🚨︎ report

AMD Launches Milan-X CPU with 3D V-Cache and Multichip Instinct MI200 GPU hpcwire.com/2021/11/08/am…

👍︎ 63

💬︎

👤︎ u/oliverpeckham

📅︎ Nov 08 2021

🚨︎ report

Raptoreum Crypto CPU Miners Drive AMD Ryzen Shortage, Large Level 3 Cache Becomes Secret Sauce ownsnap.com/raptoreum-cry…

👍︎ 11

💬︎

👤︎ u/usonamdnvidia

📅︎ Nov 05 2021

🚨︎ report

11th Gen Intel Mobile also suffers from L3 Cache regression resulting in frame drops in CPU bound games under low power wattage mode

CLARIFICATION: Tried both update&clean install, VBS is off, all test results from latest Windows 11 Dev build/lastest Windows 10 stable build, I do know benchmarks don't necessarily mean anything, but gaming performance is noticeable way worse

VBS is not causing the problem(like I said it's off), the main issue is the terrible performance at low wattage modes. (not only in games but also on desktop, etc)

I don't care about the fact that you guys turn off VBS due to performance loss, I disabled it due to the fact that virtualization blocks access to registers that software write to in order to achieve OC/UV

Also, the reason why I'm not doing OC/UV in BIOS is that Lenovo's new stupid mobo design doesn't allow any chances of BIOS recovery(like shortcircuit the BIOS jumper), once you set a bad value or misclicked something, boom, it's dead and would not post

I do agree that this title might be misleading, but the results are interesting and could possibly lead to the stuttering I faced in those more CPU demanding games...

Windows 11 Dev 22471 results:

I'm on the latest Dev version which supposedly should not have any problem on L3 cache, buuuut......

quiet mode 25W CPU

balanced mode 45W CPU

performance mode 90W CPU

Previously, I could run CPU bound games like LOL at 2560x1600, Highest graphics at 165FPS even when I get into team fights and while changing the camera view randomly(in quiet mode); GTA V could run at the same resolution, Very High graphics at 110FPS(in performance mode)

Those performances are gone since there was a Windows 11 update.

I hope Microsoft can fix it but looks like nobody is really noticing the same issue as mine...

Windows 10 19043 results:

quiet mode 25W CPU

balanced mode 45W CPU

[Performance mode 90W CPU](https://preview.redd.it/jy97jafh53t71.png?width=1

... keep reading on reddit ➡

👍︎ 29

💬︎

👤︎ u/pppig236

📅︎ Oct 12 2021

🚨︎ report

CPU internal and cache errors (2hours of gameplay)

Hi,

Anyone of you have a solution on how to fix this error?

thanks!

https://preview.redd.it/w2f59n1ejm381.png?width=770&format=png&auto=webp&s=07861a6f5c369ff1f8068643c4f8049cdb1bbdc8

👍︎ 4

💬︎

👤︎ u/MidNightElf

📅︎ Dec 05 2021

🚨︎ report

Raptoreum Crypto CPU Miners Drive AMD Ryzen Shortage, Large Level 3 Cache Becomes Secret Sauce ownsnap.com/raptoreum-cry…

👍︎ 4

💬︎

👤︎ u/AltGameAccount

📅︎ Nov 09 2021

🚨︎ report

Is Apple's "unified memory" of M1 like "LX cache" in x86 CPU's?

Technology YouTubers are praising M1's "unified memory" that is integrated directly to the CPU, thus saving the time for the CPU to communicate a separate RAM module via the bus provided by the motherboard, but come to think of it x86 CPU's have L1,L2,L3 cache memory in the CPU. I know that they are NOT shared with the GPU, but if you only talk about CPU operations, aren't basically the same as M1's memory?

👍︎ 4

💬︎

👤︎ u/evolution2015

📅︎ Nov 13 2021

🚨︎ report