Originally posted by fairydreaming
View Post
So, just write the data into a stack-based buffer and then use REP MOVSB to copy it where you want. Stack-based local variables are virtually guaranteed to be in L1, so this avoids RFO, when initially populating the buffer. It might seem inefficient to populate a local buffer before copying it to DRAM, but I think it's worth the trouble if your system is memory-bottlenecked and assuming REP MOVSB can indeed avoid RFO.
Leave a comment: