Hi, I'm able to have fixed frequency with intel_pstate=disable intel_idle.max_cstate=0 processor.max_cstate=1 Intel(R) Xeon(R) Gold 6128 CPU @ 3.40GHz # cat /proc/cpuinfo |grep MHz cpu MHz : 3400.002 cpu MHz : 3399.994 cpu MHz : 3399.995 cpu MHz : 3399.994 cpu MHz : 3399.997 cpu MHz : 3399.998 cpu MHz : 3399.992 cpu MHz : 3399.989 cpu MHz : 3399.998 cpu MHz : 3399.994 cpu MHz : 3399.988 cpu MHz : 3399.987 cpu MHz : 3399.990 cpu MHz : 3399.990 cpu MHz : 3399.994 cpu MHz : 3399.996 cpu MHz : 3399.996 cpu MHz : 3399.985 cpu MHz : 3399.991 cpu MHz : 3399.981 cpu MHz : 3399.979 cpu MHz : 3399.993 cpu MHz : 3399.985 cpu MHz : 3399.985 ----- Mail original ----- De: "Wido den Hollander" <wido@xxxxxxxx> À: "Blair Bethwaite" <blair.bethwaite@xxxxxxxxx> Cc: "ceph-users" <ceph-users@xxxxxxxxxxxxxx>, "Nick Fisk" <nick@xxxxxxxxxx> Envoyé: Mercredi 16 Mai 2018 15:34:35 Objet: Re: Intel Xeon Scalable and CPU frequency scaling on NVMe/SSD Ceph OSDs On 05/16/2018 01:22 PM, Blair Bethwaite wrote: > On 15 May 2018 at 08:45, Wido den Hollander <wido@xxxxxxxx > <mailto:wido@xxxxxxxx>> wrote: > > > We've got some Skylake Ubuntu based hypervisors that we can look at to > > compare tomorrow... > > > > Awesome! > > > Ok, so results still inconclusive I'm afraid... > > The Ubuntu machines we're looking at (Dell R740s and C6420s running with > Performance BIOS power profile, which amongst other things disables > cstates and enables turbo) are currently running either a 4.13 or a 4.15 > HWE kernel - we needed 4.13 to support PERC10 and even get them booting > from local storage, then 4.15 to get around a prlimit bug that was > breaking Nova snapshots, so here we are. Where are you getting 4.16, > http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.16/ ? > Yes, that's where I got 4.16: 4.16.7-041607-generic I also tried 4.16.8, but that didn't change anything either. Server I am testing with are these: https://www.supermicro.nl/products/system/1U/1029/SYS-1029U-TN10RT.cfm > So interestingly in our case we seem to have no cpufreq driver loaded. > After installing linux-generic-tools (cause cpupower is supposed to > supersede cpufrequtils I think?): > > rr42-03:~$ uname -a > Linux rcgpudc1rr42-03 4.15.0-13-generic #14~16.04.1-Ubuntu SMP Sat Mar > 17 03:04:59 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux > > rr42-03:~$ cat /proc/cmdline > BOOT_IMAGE=/vmlinuz-4.15.0-13-generic root=/dev/mapper/vg00-root ro > intel_iommu=on iommu=pt intel_idle.max_cstate=0 processor.max_cstate=1 > I have those settings as well, intel_idle and processor.max_cstate. [ 1.776036] intel_idle: disabled That works, the CPUs stay in C0 or C1 according to i7z, but they are clocking down in Mhz, for example: processor : 23 vendor_id : GenuineIntel cpu family : 6 model : 85 model name : Intel(R) Xeon(R) Silver 4110 CPU @ 2.10GHz stepping : 4 microcode : 0x2000043 cpu MHz : 799.953 /sys/devices/system/cpu/intel_pstate/min_perf_pct is set to 100, but that setting doesn't seem to do anything. I'm running out of ideas :-) Wido > rr42-03:~$ lscpu > Architecture: x86_64 > CPU op-mode(s): 32-bit, 64-bit > Byte Order: Little Endian > CPU(s): 36 > On-line CPU(s) list: 0-35 > Thread(s) per core: 1 > Core(s) per socket: 18 > Socket(s): 2 > NUMA node(s): 2 > Vendor ID: GenuineIntel > CPU family: 6 > Model: 85 > Model name: Intel(R) Xeon(R) Gold 6150 CPU @ 2.70GHz > Stepping: 4 > CPU MHz: 3400.956 > BogoMIPS: 5401.45 > Virtualization: VT-x > L1d cache: 32K > L1i cache: 32K > L2 cache: 1024K > L3 cache: 25344K > NUMA node0 CPU(s): 0,2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34 > NUMA node1 CPU(s): 1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,33,35 > Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr > pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe > syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts > rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq > dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid > dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx > f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 > invpcid_single pti intel_ppin mba tpr_shadow vnmi flexpriority ept vpid > fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm mpx > rdt_a avx512f avx512dq rdseed adx smap clflushopt clwb intel_pt avx512cd > avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc > cqm_mbm_total cqm_mbm_local ibpb ibrs stibp dtherm ida arat pln pts pku > ospke > > rr42-03:~$ sudo cpupower frequency-info > analyzing CPU 0: > no or unknown cpufreq driver is active on this CPU > CPUs which run at the same hardware frequency: Not Available > CPUs which need to have their frequency coordinated by software: Not > Available > maximum transition latency: Cannot determine or is not supported. > Not Available > available cpufreq governors: Not Available > Unable to determine current policy > current CPU frequency: Unable to call hardware > current CPU frequency: Unable to call to kernel > boost state support: > Supported: yes > Active: yes > > > And of course there is nothing under sysfs (/sys/devices/system/cpu*). > But /proc/cpuinfo and cpupower-monitor show that we seem to be hitting > turbo freqs: > > rr42-03:~$ sudo cpupower monitor > |Nehalem || Mperf > PKG |CORE|CPU | C3 | C6 | PC3 | PC6 || C0 | Cx | Freq > 0| 0| 0| 0.00| 0.00| 0.00| 0.00|| 0.05| 99.95| 3391 > 0| 1| 4| 0.00| 0.00| 0.00| 0.00|| 0.02| 99.98| 3389 > 0| 2| 8| 0.00| 0.00| 0.00| 0.00|| 0.14| 99.86| 3067 > 0| 3| 6| 0.00| 0.00| 0.00| 0.00|| 0.01| 99.99| 3385 > 0| 4| 2| 0.00| 0.00| 0.00| 0.00|| 0.09| 99.91| 3119 > 0| 8| 12| 0.00| 0.00| 0.00| 0.00|| 0.03| 99.97| 3312 > 0| 9| 16| 0.00| 0.00| 0.00| 0.00|| 0.11| 99.89| 3157 > 0| 10| 14| 0.00| 0.00| 0.00| 0.00|| 0.01| 99.99| 3352 > 0| 11| 10| 0.00| 0.00| 0.00| 0.00|| 0.05| 99.95| 3390 > 0| 16| 20| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3387 > 0| 17| 24| 0.00| 0.00| 0.00| 0.00|| 0.22| 99.78| 3115 > 0| 18| 26| 0.00| 0.00| 0.00| 0.00|| 0.01| 99.99| 3389 > 0| 19| 22| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3366 > 0| 20| 18| 0.00| 0.00| 0.00| 0.00|| 0.01| 99.99| 3392 > 0| 24| 28| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3376 > 0| 25| 32| 0.00| 0.00| 0.00| 0.00|| 0.05| 99.95| 3390 > 0| 26| 34| 0.00| 0.00| 0.00| 0.00|| 0.03| 99.97| 3391 > 0| 27| 30| 0.00| 0.00| 0.00| 0.00|| 0.01| 99.99| 3392 > 1| 0| 1| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3394 > 1| 1| 5| 0.00| 0.00| 0.00| 0.00|| 0.01| 99.99| 3378 > 1| 2| 9| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3393 > 1| 3| 7| 0.00| 0.00| 0.00| 0.00|| 0.01| 99.99| 3384 > 1| 4| 3| 0.00| 0.00| 0.00| 0.00|| 0.02| 99.98| 3391 > 1| 8| 13| 0.00| 0.00| 0.00| 0.00|| 0.01| 99.99| 3390 > 1| 9| 17| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3391 > 1| 10| 15| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3360 > 1| 11| 11| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3393 > 1| 16| 21| 0.00| 0.00| 0.00| 0.00|| 0.01| 99.99| 3397 > 1| 17| 25| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3391 > 1| 18| 27| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3376 > 1| 19| 23| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3334 > 1| 20| 19| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3387 > 1| 24| 29| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3377 > 1| 25| 33| 0.00| 0.00| 0.00| 0.00|| 0.01| 99.99| 3387 > 1| 26| 35| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3392 > 1| 27| 31| 0.00| 0.00| 0.00| 0.00|| 0.00|100.00| 3392 > > > On a similar node with the 4.13 kernel we get similar reports from > cpupower-monitor, but oddly on 4.13 /proc/cpuinfo shows all cores at > base 2700.000 (on 4.15 it updates). > > We can try 4.16 tomorrow. But I wonder why we are already seeing turbo > even at idle and you aren't... only thing I can think of is that it must > be because our cstates are disabled in BIOS, indeed when looking in > dmesg I see: > > [ 1.274325] intel_idle: disabled > > So it stands to reason that intel_idle.max_cstate=0 is doing nothing for > either of us. What do you see from intel_idle on 4.16? > > -- > Cheers, > ~Blairo _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com