View Single Post
Old 06-04-2008, 02:15   #7101
dagon1978
Senior Member
 
L'Avatar di dagon1978
 
Iscritto dal: Mar 2004
Messaggi: 1464
a proposito di cache, qualcuno aveva già dato un'occhiata a questo articolo?
non mi pare sia stato già postato, se fosse così me ne scuso

l'analisi mi sembra molto approfondita, se bjt2 o altri vogliono dargli un occhio... io onestamente non ci capito molto di quanto scritto, però riporto le conslusioni:

The architecture of new AMD Phenom processors (AMD K10) reviewed in this article is quite successful. Its main advantage over the previous architecture (AMD Athlon 64, K8) is its balanced design on all levels. For example, the L1 D-Cache bus is expanded to 256 bit (2x128) agrees well with 128-bit FP arithmetic units, providing equal bandwidth for reading data using 128-bit SSE/SSE2 multimedia instructions (32 bytes/cycle). And the 128-bit FP units per se can yield a noticeable performance gain in most synthetic and real applications working with this data type (e.g. scientific programs, 3D Modeling, computer games, etc). In our simple case (reading data with 128-bit SSE/SSE2 instructions), their execution speed has grown fourfold.

Expanded to 128 bit in one direction, the L1-L2 D-Cache bus (which organizational type is still in question) eliminates a bottleneck in the data exchange process between a processor core and system memory (it's hard to imagine that the L1-L2 bus is as wide as the L2-memory controller bus in AMD K8 processors!) A great advantage of the new architecture is its efficient hardware prefetch on all levels (L2, L3 Caches, and system memory). As we have shown in our separate review, it can show its true worth in case of multi-core access to system memory, reaching up to 85% of performance potential of high speed dual-channel DDR2 memory.

What concerns L3 Cache in its integrated memory controller shared for all processor cores, this cache level is not an apparent advantage of new processors. According to our single-core access tests, bandwidth of this cache level is not much higher than bandwidth of system memory in the same conditions (8.9 GB/s versus 6.4 GB/s), which has to do with high bandwidth of dual-channel DDR2 memory. L3 Cache has better latencies, but only for random memory access (in other cases latencies of all levels are effectively "camouflaged" by hardware prefetch). So its advantages can be more noticeable in applications that randomly access data stored in different areas of memory (for example, databases).
__________________
| QUAD-FX | 2xAMD AthlonFX-70 @ 2.86Ghz | Asus L1N64-SLI WS | G-Skill DDR2-800 4x2Gb | Nvidia QuadroFX 3450 | Corsair HX Modular 620W |
[MI] Vendo RAM PC133: 2x256Mb (registered); PC100: 64Mb; PC2100S: 2x256Mb; PC2700S: 256Mb; HardDisk 2,5": Fujitsu 40Gb
CLICCA QUI PER DETTAGLI
dagon1978 è offline