Originally posted by vladpetric
View Post
Announcement
Collapse
No announcement yet.
Linux Kernel Orphans Itanium Support, Linus Torvalds Acknowledges Its Death
Collapse
X
-
- Likes 5
-
Originally posted by vladpetric View PostIs it really annoying to decode instructions when they are arbitrary size, in bytes? Yes. Is it doable though? Yes. (and I've talked to designers of x86 as well).
Originally posted by vladpetric View PostI think you're also forgetting about the instruction trace caches in modern designs, which for a lot of codes get decoding off the critical path.
- Likes 1
Comment
-
Originally posted by coder View PostThe problem is that is scales very poorly, as you try to decode more instructions in parallel. This is because the start of each instruction depends on the length of the previous ones.
u-op caches are good for relatively tight code, but there are lots of examples where the time spent is too spread out to get much benefit from them.
- Likes 1
Comment
-
Originally posted by coder View PostThe problem is that is scales very poorly, as you try to decode more instructions in parallel. This is because the start of each instruction depends on the length of the previous ones.
u-op caches are good for relatively tight code, but there are lots of examples where the time spent is too spread out to get much benefit from them.
- Likes 2
Comment
-
Originally posted by vladpetric View PostPlease show me a decode-bound workload.
BTW, why reply to the same message twice?
- Likes 1
Comment
-
Originally posted by vladpetric View PostDo you need additional resources to implement the ISA? Yes. Do those matter? Well, it depends. On a mobile chip they might, from a power consumption perspective (and we're really not in the 1900s anymore with transistor budgets ... ). On a desktop/server chip? Absolutely not.
Also, die area matters, too. Look at the Ampere Altra, which packs 80 cores in much less silicon than AMD populates with 64, and will soon even go to 128!
- Likes 3
Comment
-
Originally posted by coder View PostI don't understand why you think power doesn't matter for desktop or server. Power is clearly a limiting factor for Intel (and some of AMD's top-binned chips) and matters a great deal for datacenter customers.
Also, die area matters, too. Look at the Ampere Altra, which packs 80 cores in much less silicon than AMD populates with 64, and will soon even go to 128!
- Likes 4
Comment
-
Originally posted by coder View PostI'm guessing some JIT code probably tends towards that end of the spectrum. Maybe also some video compression cases, where I seem to recall that x264 was having problems fitting some of their loops in L1 instruction cache.
BTW, why reply to the same message twice?
What's the big deal?
- Likes 1
Comment
-
Originally posted by coder View Postthey might come to rediscover IA64's unrealized potential.
Comment
Comment