By RPi foundation https://magpi.raspberrypi.org/articles/ ... benchmarks,

By Tom's hardware https://www.tomshardware.com/news/raspb ... 8gb-tested,

It is quite strange that the throughout of the memory of is only 30% of the theoratical value. I wonder if there is any structural design that limits the performance of the memory such as the memory controller, the bus or the cache.
While digging into this I found some clue.
I found somebody ran AIDA64 bandwidth benchmark on RPi4 with "windows on arm" system(https://www.bilibili.com/video/BV1Kh411Z7to) and the results are quite interesting, the AIDA64 results are as follows

Though the x86 program recompiled to ARM may affect the accuacy, the memory bandwidth is totally identical to the results from Rpi foundation and tomshardware and I assume these results are reliable.
The interesting point is that the L2 cache didin't perform much better than the memory on bandwidth which is not observed in other A72 systems https://www.anandtech.com/show/11088/hi ... nd-power/3 and I think the shared inclusive L2 cache may throttle the bandwidth of the memory.
OC L2 cache may help achieve better bandwidth if possible. There could also be some other limitations but neither RPi foundation nor Broadcomm disclosed more information about the detailed design or the bus topology inside the chip so it is hard to dig these things.