Announcement

Collapse
No announcement yet.

Axboe Achieves 8M IOPS Per-Core With Newest Linux Optimization Patches

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • RedEyed
    replied
    Originally posted by turboNOMAD View Post
    I think the question of MastaG is more like: do any of this optimizations even affect regular desktop use case? E.g. when loading a game on Steam, IO concurrency (queue depth) is very low, and existing software does not explicitly use the io_uring API. So the question is, will existing desktop applications (like Steam games) see any improvement at all, or the code paths being optimized are already taking negligible time/power (in desktop use case) before the optimizations?
    Regular desktop users use Windows, so it won't affect them

    Leave a comment:


  • bug77
    replied
    I looked at the article and all I saw was Schneider Weisse. I usually go for Tap 5, but I haven't seen it in quite a while

    Leave a comment:


  • caligula
    replied
    Originally posted by RedEyed View Post

    Potentially, any kind of optimisation is good for regular user.

    Even if it is not faster, it will use less power.
    Normal users don't use I/O that much. This is obvious when observing the decrease of the SSD cell lifespan every year. I also don't think e.g. SATA drive saturate even a single core in a 5900X or 11900K system.

    Leave a comment:


  • polarathene
    replied
    Originally posted by turboNOMAD View Post
    So the question is, will existing desktop applications (like Steam games) see any improvement at all, or the code paths being optimized are already taking negligible time/power (in desktop use case) before the optimizations?
    Not likely IOPS wise, since many NVMe IIRC (including PCIe 4.0 devices atm) only manage around 1M IOPS peak. The device being used here specializes in random IO more than sequential, and in these tests they're using more than one now as that device throughput maxed out around 5M IOPS, so they've been trying to max out how much IOPS a single core on their prosumer CPU can handle.

    For most of us that sort of workload isn't realistic, but just because we don't maximize throughput, we would still see the 2-3x perf improvement (referring to the improvements between 2.5M to 8M IOPS articles). AFAIK it's only IO_URING though, I don't know what actually supports that regarding typical desktop or gamer usage, perhaps if using a network share like Samba?

    On the plus side, we know it's considerably better than what preceded IO_URING, so any software that would stand to benefit and doesn't yet support IO_URING would probably consider the value of the support more worthwhile now

    TL;DR: If you would benefit, it'd be in latency reduction for I/O. Should be more noticeable with heavy random I/O tasks, provided they leverage IO_URING.

    Leave a comment:


  • Yttrium
    replied
    Originally posted by turboNOMAD View Post
    I think the question of MastaG is more like: do any of this optimizations even affect regular desktop use case? E.g. when loading a game on Steam, IO concurrency (queue depth) is very low, and existing software does not explicitly use the io_uring API. So the question is, will existing desktop applications (like Steam games) see any improvement at all, or the code paths being optimized are already taking negligible time/power (in desktop use case) before the optimizations?
    I dont think you would see any improvements. My disk IO is not anywhere near 8M at queue depth 128 and I can't think of any program I use that uses IO-uring anyway. Disk IO handling shouldn't use alot of power anyway so lower power states aren't applicable. But maybe with tommorows NVME drivers you'll be able to scale all the way up 8 million ops on your laptop.

    Leave a comment:


  • Yttrium
    replied
    Originally posted by blackshard View Post
    measuring throughput of an algorithm/api/whatever is sensible only when all the variables around stay the same.
    Those guys are all on different hardware and there is no comparison, so that's just pure numbers that actually tell nothing about the optimizations they do to the api...

    Paradoxically some kind of optimization for a setup may result in performance loss for another, where are the comparisons?
    It needs to be said that IO is inherently DISK LIMITED. No matter how many IO per core you can do, its limited to the disk speed you're attached to. Jens Axboe managed to do 8M IOPS on a single core, which is amazing because you would need less cores and operations to serve all disk bandwidth to your customers. Also, if you look at the pull request you'll see that the changes made are very general and not architecture specific so every architecture would see an improvement based on its capability. Most likely this would result in bigger cores and higher frequencies seeing more improvement but that's the case with all optimisations in general.

    Leave a comment:


  • turboNOMAD
    replied
    I think the question of MastaG is more like: do any of this optimizations even affect regular desktop use case? E.g. when loading a game on Steam, IO concurrency (queue depth) is very low, and existing software does not explicitly use the io_uring API. So the question is, will existing desktop applications (like Steam games) see any improvement at all, or the code paths being optimized are already taking negligible time/power (in desktop use case) before the optimizations?

    Leave a comment:


  • RedEyed
    replied
    Originally posted by MastaG View Post
    But could this also have a positive effect for a regular desktop user running a webbrowser and playing some games on Steam for example?
    Potentially, any kind of optimisation is good for regular user.

    Even if it is not faster, it will use less power.

    Leave a comment:


  • MastaG
    replied
    But could this also have a positive effect for a regular desktop user running a webbrowser and playing some games on Steam for example?

    Leave a comment:


  • George99
    replied
    Not only the same guy but also same hardware. Otherwise it would be pointless.

    Leave a comment:

Working...
X