i did a exeperiment - profiling a array-access program, and found that DATA_CACHE_REFILLS_FROM_SYSTEM was smaller than array-size/cache-line-size. i guess hardware prefetch improve the cache hit ratio. but i want a memory access profiling information,including not only normal accesses but also prefetchs.

because i'm using a dual core opteron which tow core share memory controller, MEM_PAGE_ACCESS is also not accurate.