Re: HTB and PRIO qdiscs introducing extra latency when output interface is saturated

Jonathan Lynch <jlynch@xxxxxxxxxxxxxxxxxx> · Sat, 20 Aug 2005 21:51:30 +0100

I did a number of tests and there doesn't appear to be any noticeable
differences between using CPU and JIFFIES (HZ=1000) as packet scheduler
source. 

I didn't mention that the outgoing interface on the core router has 2 ip
addresses. One is vlan tagged for the test network im running and the
other is not tagged, so the vast majority of outgoing traffic will be
tagged on the outgoing interface. 

Question 1. When sending a packet on a tagged interface, the 4 byte vlan
header is added between the source address and the ethertype field. As
far as I know the tag is added and removed in the device driver and so
it does not show up when using ethereal or tcpdump. The driver in use is
the e1000 which supports VLANS. As a result the minimum pkt size for
tagged packets is 68bytes and maximum is 1522 bytes.

When using MPU and overhead in htb, does that mean that the overhead
should be increased to 42 (38 + 4 for vlan header) and MPU = 88 (42 +
46). There are both tagged and untagged packets frames on this
interface.

I have run tests using MPU of 84 and overhead of 38 and mpu 88 and
overhead of 42 and increasing the ceil to 90, 95, 97 and 99Mbit. There
are spikes in the delay and the higher I increase the ceil, the higher
the spikes are. 

I have graphs of the tests I done. The filename of the graph should
explain the settings that were used in the test. Hysteresis is set to 0,
CPU as the packet scheduler clock source, mpu and overhead ( 38 and 84
respectively) or vlan mpu and overhead (42 and 88 respectively) and then
the ceiling at either 90,95 or 99 mbit. 

21957_hysteresis_CPU_mpu_overhead_95_ceil.png    
22762_hysteresis_CPU_vlan_mpu_overhead_90_ceil.png 
22875_hysteresis_CPU_vlan_mpu_overhead_99_ceil.png 
24135_hysteresis_CPU_mpu_overhead_90_ceil.png      
24143_hysteresis_CPU_vlan_mpu_overhead_95_ceil.png 
24262_hysteresis_CPU_mpu_overhead_99_ceil.png   

They can be found at

http://www.compsoc.nuigalway.ie/~jlynch/htb/

MPU and overhead seems to be used mainly in places where the size of
frames are fixed. Does it make a difference using it with Ethernet where
the frame size is variable ?

When you said you could ceil at 99mbit ok did you look at the max
delay ? Did u notice spikes like what are in my graphs ? Do you have any
idea what could be causing these spikes ??

Other observations: 

I was using values from /proc/net/dev to measure the throughput going
through the core router and I noticed that different network drivers
increment the packet counters differently. (ie for e1000 for a packet
with max Ethernet payload of 1500 bytes byte counter incremented by 1518
bytes which includes eth header and trailer. 3c59x on the other hand
increments by 1514 bytes which does not include the eth trailer. For
vlan tagged packets e1000 increments byte counter by 1522 bytes, 3c59x
increments by 1518 bytes. Have you come across this before ?

Also according to the source of tc, specifically in tc_util.c it refers
to this page http://physics.nist.gov/cuu/Units/binary.html as to how
rates are specified in tc. So i presume tc uses this to specify rate,
ceil etc. This doesnt seem to be mentioned anywhere. 

static const struct rate_suffix {
        const char *name;
        double scale;
} suffixes[] = {
        { "bit",        1. },
        { "Kibit",      1024. },
        { "kbit",       1000. },
        { "mibit",      1024.*1024. },
        { "mbit",       1000000. },
        { "gibit",      1024.*1024.*1024. },
        { "gbit",       1000000000. },
        { "tibit",      1024.*1024.*1024.*1024. },
        { "tbit",       1000000000000. },
        { "Bps",        8. },
        { "KiBps",      8.*1024. },
        { "KBps",       8000. },
        { "MiBps",      8.*1024*1024. },
        { "MBps",       8000000. },
        { "GiBps",      8.*1024.*1024.*1024. },
        { "GBps",       8000000000. },
        { "TiBps",      8.*1024.*1024.*1024.*1024. },
        { "TBps",       8000000000000. },
        { NULL }
};

Jonathan

On Thu, 2005-08-11 at 17:36 +0100, Andy Furniss wrote: 
> Jonathan Lynch wrote:
> 
> > 
> > I dont understand what you mean when you say "if you could sample truly
> > randomly you would get a proper distribution". 
> > 
> > Also having the timers synchronized will allow for more accurate
> > measurements of the delay. I cant see how this would have an impact on
> > the pattern.
> 
> I mean it's possibly just to do with the test if a 0ms - 1ms delay is 
> expected then you could see patterns arising depending on how you 
> measure delay/clock drift or something.
> 
> Now I have two pings that do intervels < 1 sec - the inetutils GNU ping 
> guys implemented it for me :-),  and I also have the iputils one I can 
> simulate a stream better.
> 
> While doing this I noticed that iputils ping actually gives lower 
> latency readings when sending many pps. Using tcpdump deltas I can see 
> the network latency is the same however many pps I do - it's just that 
> when measuring <1ms delays and doing many pps it seems that some code 
> gets cached (guess) and the reported delay changes as a result.
> 
> I mention that just to illustrate that measuring small delays can be 
> misleading and influenced by the exact nature of your setup.
> 
> 
> 
> > 
> > I have the default setting which is to set it to jiffies. There is a
> > comment in the kernal config for Packet scheduler clock source that
> > mentions that Jiffies "its resolution is too low for accurate shaping
> > except at very low speed". I will recompile the kernel and try the CPU
> > option tomorrow to see if there is any change.
> 
> Maybe not in the case of htb - I use CPU and see similar results, the 
> comment about accurate shaping was probably written when HZ=100, but I 
> suppose it will be better for something :-)
> 
> 
> 
> > 
> > I didnt realise such options existed for htb (mpu + overhead). These parameters are not mentioned in the man pages or in the htb manual. 
> > I presume I have to patch tc to get these features ?. 
> 
> 
> There is mention on the htb page - it was added as a patch so was not 
> designed in, which explains why burst doesn't use it.
> 
> You don't need to patch recent iproute2 it's already in there.
> 
> Andy.

_______________________________________________
LARTC mailing list
LARTC@xxxxxxxxxxxxxxx
http://mailman.ds9a.nl/cgi-bin/mailman/listinfo/lartc

Re: HTB and PRIO qdiscs introducing extra latency when output interface is saturated

Linux Advanced Routing and Traffic Control