1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Memory
  5. Motherboards
  6. Processors
  7. Software
  8. Storage
  9. Operating Systems


Facebook RSS Twitter Twitter Google Plus


Phoronix Test Suite

OpenBenchmarking.org

Apple iPad 2 As Fast As The Cray-2 Super Computer

Hardware

Published on 16 September 2012 09:15 AM EDT
Written by Michael Larabel in Hardware
31 Comments

A university research director has shown that Apple's iPad 2 is as fast as the Cray-2 vector super-computer out of Cray Research from the 1980's. With some work to to the software, the iPad 2 performance benchmark result is quite impressive.

While at the IEEE High Performance Extreme Computing (HPEC) conference in Massachusetts this week, Piotr Luszczek who serves as the Research Director for the University of Tennessee was talking about the ARM landscape and embedded LINPACK benchmarking. The BoF presentation was entitled "Anatomy of a Globally Recursive Embedded LINPACK Benchmark."

The presentation's abstract was:
We present a complete bottom-up implementation of an embedded LINPACK benchmark on iPad 2. We use a novel formulation of a recursive LU factorization that is recursive and parallel at the global scope. We be believe our new algorithm presents an alternative to existing linear algebra parallelization techniques such as master-worker and DAG-based approaches. We show a assembly API that allows us a much higher level of abstraction and provides rapid code development within the confines of mobile device SDK. We use performance modeling to help with the limitation of the device and the limited access to device from the development environment not geared for HPC application tuning
Luszczek has uploaded his slides to the UTK web-site, but what's interesting for the common Phoronix reader are just his Apple iPad 2 test results.

The researcher mentioned that Apple introduced the Accelerate Framework to iOS4 and that they were to include an optimized ATLAS (Automatically Tuned Linear Algebra Software) library inside this iOS framework, just like with it shipping inside the Mac OS X framework. However, this framework didn't really work and there is no ATLAS for the iPhone/iPod/iPad so he took things a lot further on his own.

Through a long process, Piotr Luszczek was able to learn more about the iPad hardware itself through nano and micro benchmarking. After this, he was able to create an optimized algorithm by writing a Python script to generate various Assembly routines to test each one for the most efficient performance. A few ARM tweaks also got tossed into Atlas BLAS.

When benchmarking the Apple iPad 2, the University of Tennessee employee achieved 4 GFLOPS per Watt on the ARM SoC (measured at the chip level). As the below chart shows, he found the iPad 2 to be as fast as the Cray 2 super-computer from the 1980s. The Cray-2 was originally the fastest super-computer in the world when it originally premiered. Piotr Luszczek also found that the original iPad was about the speed of the original Cray-1 super-computer. The latest iPad meanwhile is just a small performance bump over the iPad 2.

Apple iPad 2 As Fast As The Cray-2 Super Computer

One interesting tid-bit is that it took Cray Research one decade to go from the Cray-1 to the Cray-2 while Apple went from the original iPad to the second-generation iPad in just two years with the increased performance.

It will still be a long, long time though before any tablet can be as fast as ASCI Red/White, Sequoia, or the modern Cray Jaguar that has held a LINPACK performance record.

The other interesting slide were some numbers on various devices for their power and performance.

Apple iPad 2 As Fast As The Cray-2 Super Computer

While losing badly on the raw performance, an ARM Cortex-A9 delivered the greatest performance-per-Watt that could beat out an AMD FireStream 9370, NVIDIA Fermi M2050, AMD Magny-Cours Opteron 6180SE, Intel Westmere Xeon E7-8870, and Intel Atom N570. The ARM Cortex-A9 had a performance/power efficiency of a factor of four while the AMD FireStream and NVIDIA Fermi GPUs had an efficiency of about 2.3x, the AMD Opteron at about 1x, and the Intel Xeon and Atom at about 0.75x.

Apple iPad 2 As Fast As The Cray-2 Super Computer

Next up may be some fun with more low-cost ARM development boards, Piotr Luszczek is looking towards ARMv8/AArch64 with 64-bit "goodness", working on double-buffering and vectorization-friendly storage, and possibly using OpenGL ES with shaders or tapping into NVIDIA's CUDA to exploit more performance out of ARM hardware.

If you are interested in maxing out ARM performance, see the Phoronix 12-core ARM cluster and then the 96-core ARMv7 solar-powered super-computer that was assembled out at MIT earlier this summer. Coming up in a matter of weeks will be the long-awaited many-core Calxeda ARM benchmarks premiering on Phoronix.

About The Author
Michael Larabel is the principal author of Phoronix.com and founded the web-site in 2004 with a focus on enriching the Linux hardware experience and being the largest web-site devoted to Linux hardware reviews, particularly for products relevant to Linux gamers and enthusiasts but also commonly reviewing servers/workstations and embedded Linux devices. Michael has written more than 10,000 articles covering the state of Linux hardware support, Linux performance, graphics hardware drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated testing software. He can be followed via and or contacted via .
Latest Linux Hardware Reviews
  1. Preview: AMD's FX-9590 Eight-Core At Up To 5.0GHz On Linux
  2. Intel Launches The Core i7 5960X, Mighty Powerful Haswell-E CPUs
  3. AMD Radeon R9 290: Gallium3D vs. Catalyst Drivers
  4. AMD Radeon R9 290 Open-Source Driver Works, But Has A Ways To Go
Latest Linux Articles
  1. How Intel Graphics On Linux Compare To Open-Source AMD/NVIDIA Drivers
  2. The Fastest NVIDIA GPUs For Open-Source Nouveau With Steam Linux Gaming
  3. Testing For The Latest Linux Kernel Power Regression
  4. The Most Energy Efficient Radeon GPU For AMD Linux Gaming
Latest Linux News
  1. Nouveau X.Org Driver Released With DRI3+Present, Maxwell, GLAMOR
  2. Microsoft & AMD Release C++ AMP Compiler With Linux Support
  3. AMD, Wine & Valve Dominated August For Linux Users
  4. Linux 3.17-rc3 Kernel Released Back On Schedule
  5. Lennart Poettering Talks Up His New Linux Vision That Involves Btrfs
  6. Mesa 10.3 RC2 Arrives Via Its New Release Manager
  7. Ubuntu 14.10's Lack Of X.Org Server 1.16 Gets Blamed On AMD
  8. MSI Motherboard BIOS Updating Remains A Pain For Linux Users
  9. See How Your Linux System Performs Against The Latest Intel/AMD CPUs
  10. AMD Steppe Eagle Flys To Coreboot
Latest Forum Discussions
  1. Lennart Poettering Talks Up His New Linux Vision That Involves Btrfs
  2. Updated and Optimized Ubuntu Free Graphics Drivers
  3. AMD Releases UVD Video Decode Support For R600 GPUs
  4. SSD seems slow
  5. Is laptop with Intel CPU and AMD dGPU worth buying considering especially AMD Enduro?
  6. Radeon HD5670 and Ubuntu 14.04
  7. Btrfs Gets Talked Up, Googler Encourages You To Try Btrfs
  8. Updated graphics drivers for Ubuntu 12.04 Precise LTS