Ampere Altra Performance Shows It Can Compete With - Or Even Outperform - AMD EPYC & Intel Xeon
Collapse
X
-
Originally posted by PerformanceExpert View PostThese results were measured using stock installs and settings, thus relevant to what most people would see. You can always tweak and tune further but that was not the goal of this comparison.
Comment
-
-
Originally posted by pal666 View Postwhat word is difficult for you?What compatibility did it not retain?x86_64 also didn't retain full binary compatibility, they are even on this metric
Originally posted by pal666 View Postcan x86 code be successfully run in x86-64 mode? no.
Comment
-
-
Originally posted by PerformanceExpert View PostHowever the standard benchmark is too simplistic and overstating the benefit of GPUs. Fujitsu's goal was to make programming and optimization easier and enable much higher efficiencies on the many HPC codes that don't work well on GPUs.
Comment
-
-
Originally posted by coder View PostTechnically true, but now it just feels like you're trying to move the goalposts. So, what is it you're really after? I thought you just wanted a way to code for ARM other than on a cellphone or Pi.
Comment
-
-
Originally posted by coder View PostIt did actually win a majority, but some of the benchmarks it won are variations of each other.
The thing that often arouses my suspicion is how the benchmarks featured in these articles are chosen. PTS has thousands of test cases, yet there are only about 25 or so that feature here.
I don't know what will be a better method tho. ( Isn't there a "g factor" or something, that correlates with the test results? Like between different mental tasks. Idk, I am dumb in statistics, and also never learnt it. )
Comment
-
-
Originally posted by coder View PostWhat's weird about this claim that "Arm scales higher than x86" is that it's so filled with caveats. For one thing, Altra has separate NUMA domains, which AMD specifically rejected, in their 7002 generation. So, if you really need a lot of cores because you have a heavy workload that requires lots of communication, then symmetric is probably still the way to go. However, if you're just making a cloud/density play, and plan to partition up with lots of VMs or containers that each fit in a single NUMA domain, then this NUMA approach is better.
Comment
-
-
Originally posted by tuxd3v View PostThe reality is that ARM64 exists because AMD helped ARM to create ARM64, in the time when AMD was thinking ingoing ARM..
The ARM and the AMD ISAs couldn't be more different; 64bits must the only both have in common because one is RISC and the other is CISC, one is a 3-operand ISA and the other isn't, one is a load+op ISA and the other isn't, one is a new ISA created from scratch and the other is an extension to former ISAs, one has weak memory model and the other has strong model,...
Comment
-
-
Originally posted by olivier View PostDespite those impressive results, I'm a bit concerned about the fragmentation coming in the server world… x86 "super compatible" is a double edged sword: bad for efficiency but major for compatibility and having no questions on choosing your hardware or upgrading it.
More details on that potential fragmentation on the market here: https://medium.com/@olivier.lambert/...d-abddca7a6268
ARM has publicly released the Server Base Boot Requirements (SBBR) Specification, a follow-on companion to the Server Base System Architecture (SBSA) specification. The SBBR defines the ARMv8 platform firmware abstractions necessary for OS deployment...
Originally posted by cb88 View PostIf you think modern ARM has any semblance to a "clean" ISA you have no idea what you are talking about... you can't build a clean performant ISA, you have to make concessions for code density and performance.
Comment
-
-
Originally posted by coder View PostYeah, but also no.
Fujitsu made a deliberate decision to build a CPU that would fill both roles. In my opinion, it's fair to judge its efficiency on standard benchmarks, because efficiency has real world consequences, and that's what you're measuring.
Originally posted by Weasel View PostYeah it's smaller because it's slower on the same process node.
Moreover, TX2 was so fast as the best x86 despite using a worse 16nm node. And TX3 using 7nm is faster than any x86 processor. In fact the link shows performance benchmarks comparing TX3 with Rome and with Cascade Lake.
Comment
-
Comment