Announcement

**pal666** · 17 January 2019, 09:50 AM

Originally posted by jbennett View Post

Also, Fedora is essentially RHEL/CentOS-next.

no. fedora to rhel/centos is upstream, like debian to ubuntu

**ypnos** · 17 January 2019, 10:06 AM

For proper utilization of 10GbE you need to do some work yourself:

Increase MTU to the hardware limits, although 9000 is a good bet
Use maximum supported ring parameters (ethtool -g/-G)
Set number of channels to number of CPU cores per NUMA core (ethtool -l/-L)
Pin the channel IRQs to said CPU cores on the NUMA core the NIC is connected to
Also pin the application that is transmitting data to CPU threads on said NUMA core

When you do this right I am pretty sure you will get great performance regardless of distribution or (recent) kernel version.

**gilboa** · 17 January 2019, 11:51 AM

Originally posted by ypnos View Post

For proper utilization of 10GbE you need to do some work yourself:

Increase MTU to the hardware limits, although 9000 is a good bet
Use maximum supported ring parameters (ethtool -g/-G)
Set number of channels to number of CPU cores per NUMA core (ethtool -l/-L)
Pin the channel IRQs to said CPU cores on the NUMA core the NIC is connected to
Also pin the application that is transmitting data to CPU threads on said NUMA core

When you do this right I am pretty sure you will get great performance regardless of distribution or (recent) kernel version.

/+100!

On the RX side pin-app-and-irq-to-adjacent-numa-ID alone can be difference between barely reaching 5-6Gbps and maxing out the machine ~200Gbps.
... And yes, a dual Xeon machine running Fedora Server 27 can passively monitor 16+ x 10Gbps (or 4 x 40Gbps) NICs with near zero packet loss.

- Gilboa

**fuzz** · 17 January 2019, 04:02 PM

Are there ways to automate those settings? Otherwise it's useless for automated testing.

**gilboa** · 18 January 2019, 08:25 AM

Originally posted by fuzz View Post

Are there ways to automate those settings? Otherwise it's useless for automated testing.

Yes, but its fairly complex.
You can locate the NUMA ID of the PCI-E slot from lspci, compare that to the NUMA information from lscpu - this will give you the closest NUMA ID CPU.
Now use ethtool to reduce the number of RSS queues (to the number of cores on that NUMA node) and use irq_affinity to match assign one IRQ per CPU core.
Add some additional ethtool magic to configure the ring parameters, etc. And you should be done.

- Gilboa

**nomadewolf** · 18 January 2019, 01:55 PM

Originally posted by Britoid View Post

Interesting that the RHEL-based distro's are last. Some kernel configuration maybe?

CentOS uses very old kernel.
Thjis is done for more stability, but at a cost of performance.
CentOS should come out this year and balance this.

**nomadewolf** · 18 January 2019, 01:56 PM

Originally posted by cen1 View Post

I still don't understand why anyone would run Fedora Server..

Testing out the new stuff, maybe.
Actually running it, not unless it's something not critical and you like to like on the edge...

**nomadewolf** · 18 January 2019, 01:58 PM

Originally posted by ThoreauHD View Post

Which is also why I don't understand why Ubuntu Server/RHEL/SLES isn't on there. Just some random ass desktop OS's thrown together.

SLES and Ubuntu yes.
RHEL is basically CentOS.

**nomadewolf** · 18 January 2019, 02:16 PM

Originally posted by pegasus View Post

Congrats for expanding into new benchmarking territory, but there are new dragons here. These numbers all seem way too low. I regularly max out 100Gbit on old ivy bridge storage nodes running centos 6 and that's with less than 30min spent on tuning them. 10Gbit today can be maxed out with a single core ...

Maybe you can post your suggestions on optimizations and why you choose those?

**pegasus** · 18 January 2019, 02:33 PM

There are many online. Broadcom have theirs, Intel have theirs, Mellanox have theirs ... They're mostly the same, tunning your tcp stack settings and congestion algorithms based on lan or wan scenarios, pinning nic interrupt processing threads to specific cores, enlarging the nic queue lengths, making sure that offloads are enabled etc. Mellanox even has a script that does all of that for you.

Announcement

10GbE Linux Networking Performance Between CentOS, Fedora, Clear Linux & Debian

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment