FS#29850 - [linux] 3.3.5 - 3.4.2 Load average much higher double the power usage
Attached to Project:
Arch Linux
Opened by Jonas Jelten (TheJJ) - Friday, 11 May 2012, 21:58 GMT
Last edited by Tobias Powalowski (tpowa) - Monday, 05 November 2012, 15:01 GMT
Opened by Jonas Jelten (TheJJ) - Friday, 11 May 2012, 21:58 GMT
Last edited by Tobias Powalowski (tpowa) - Monday, 05 November 2012, 15:01 GMT
|
Details
Description:
Problem first described here: https://bbs.archlinux.org/viewtopic.php?pid=1100418 I recently noticed that my wattage and load average was much higher than before. This happens since kernel 3.3.5, my load average in idle is 0.91 0.75 0.68 just i3 and urxvt running. Was about 0.02 0.01 0.00 before the kernel update. Also my wattage has more than doubled, needed 7-9W before, now 15-25W (in idle). This also affects the battery runtime slightly (now 3h instead of ~9h). powertop(2), top, iotop, atop, xrestop show nothing special, i can post details if anyone wanted. I don't know for sure if it was the kernel update, but i've got 2 friends with different laptops, and both have the same issues (all 3 have intel cpus). Additional info: * Linux 3.3.5-1-ARCH #1 SMP PREEMPT Mon May 7 19:57:51 CEST 2012 x86_64 Intel(R) Core(TM) i5-2520M CPU @ 2.50GHz GenuineIntel GNU/Linux iostat: Linux 3.3.5-1-ARCH (jjpad) 05/11/2012 _x86_64_ (4 CPU) avg-cpu: %user %nice %system %iowait %steal %idle 2.65 0.00 0.83 0.04 0.00 96.47 Device: tps kB_read/s kB_wrtn/s kB_read kB_wrtn sda 14.90 139.40 71.44 404026 207054 free -m total used free shared buffers cached Mem: 7873 1209 6664 0 105 404 -/+ buffers/cache: 699 7173 Swap: 0 0 0 Now, writing the bug report with firefox: load average: 1.04, 0.82, 0.72 Steps to reproduce: upgrade to 3.3.5, and watch the system load. if more information is needed, i will post it. |
This task depends upon
What I did was this:
~ $ modinfo i915 | tail -n 16
depends: drm,drm_kms_helper,intel-gtt,i2c-core,video,button,i2c-algo-bit,intel-agp
//{...}
vermagic: 3.4.0-1-ARCH SMP preempt mod_unload modversions
//{...}
parm: i915_enable_rc6:Enable power-saving render C-state 6. Different stages can be selected via bitmask values (0 = disable; 1 = enable rc6; 2 = enable deep rc6; 4 = enable deepest rc6). For example, 3 would enable rc6 and deep rc6, and 7 would enable everything. default: -1 (use per-chip default) (int)
//{...}
This reveals some cool new i915 parameters, especially, that the enable_rc6 parameter CHANGED.
We intel igp users have set i915_enable_rc6=1 at the time it was essential, this now wastes energy.
When I changed it to
=========================================
i915_enable_rc6=7
=========================================
my laptop stopped heating the room with ~5-15W.
Ah, by the way, the energy problem was still persistant with stock 3.4-ARCH kernel and Intel KMS driver.
The high load is still not fixed.
Powertop Result: total idle, wifi connected, urxvt and i3 running
The battery reports a discharge rate of 9.03 W
The estimated remaining time is 379 minutes
Summary: 31.7 wakeups/second, 0.0 GPU ops/second, 0.0 VFS ops/sec and 0.4% CPU use
Power est. Usage Events/s Category Description
4.44 W 0.0 pkts/s Device Network interface: eth0 (e1000e)
2.93 W 66.7% Device Display backlight
554 mW 11.7 pkts/s Device Network interface: wlan0 (iwlwifi)
68.8 mW 267.3 µs/s 7.8 Interrupt [6] tasklet(softirq)
34.8 mW 80.8 µs/s 4.0 kWork ieee80211_iface_work
23.8 mW 432.0 µs/s 2.7 Process i3status
15.9 mW 11.5 µs/s 1.8 Timer clocksource_watchdog
15.0 mW 5.8 µs/s 1.7 Timer intel_gpu_idle_timer
14.1 mW 214.0 µs/s 1.6 Interrupt [9] acpi
12.3 mW 95.2 µs/s 1.4 Interrupt [1] timer(softirq)
8.82 mW 10.3 µs/s 1.0 kWork pci_pme_list_scan
7.50 mW 96.1 µs/s 0.9 Interrupt [42] i915
6.61 mW 84.3 µs/s 0.8 Process /usr/lib/upower/upowerd
6.17 mW 1.0 ms/s 0.7 Process /usr/bin/X -nolisten tcp vt07 -auth /var/run/slim.auth
etc.
Yesterday, I noticed the issue. I was around 1.20 1.14 0.98 just with firefox and mumble. Even if the system is in idle (programs disabled), the load is surprisingly high.
Switched to 3.3.4 this morning and it seems to be much better.
Just for your information, I'm running AMD + NVIDIA (Phenom II X4 955 BE + GTX460).
(a) this is not a problem of kernel using more resources, but rather an issue on how it reports their usage;
(b) I personally haven't noticed any increase in system temperature or decrease in battery life, so those of you who experience higher temps/battery consumption -- look somewhere else.
changing i915_enable_rc6=7 helped a lot.
with 3.3.7-1-ARCH ,32bits, on AMD x2 5000+,
i have an high load-average in idle.
testing/linux-3.4.1-1 fixes the ``problem'' for me... thanks for a quick update.
In idle with only few sshfs mounts (over wpa2 wifi) and open mupdf's load average is like this:
$ uptime
19:52:05 up 3:51, 1 user, load average: 0.02, 0.07, 0.06
After watching an html5 video on the firefox homepage:
$ uptime
19:59:40 up 3:59, 1 user, load average: 0.35, 0.45, 0.24
which is reasonable I guess.
Overall I think that now load averages are reported accurately, albeit of course differently than in versions <=3.3.5.
Kernel - Linux arch 3.4.1-1-ARCH #1 SMP PREEMPT Tue Jun 5 09:05:01 CEST 2012 x86_64 GNU/Linux
Kernel boot line - i915.i915_enable_rc6=1 i915.i915_enable_fbc=1 i915.lvds_downclock=1 drm.vblankoffdelay=1
Sadly, it's been pushed to stable/core with all terrible power problems.
Yes, read the launchpad report above.
> Sadly, it's been pushed to stable/core with all terrible power problems.
If there are power problems, they are "routine" (because drivers change, kernel's power consumption usually fluctuates as opposed to decreasing monotonically with increasing version) and not related to the loadavg readings.
Tobias Powalowski: why do you maintain the kernel package when you don't report bugs to the kernel developers?
Still getting the same issues like described above - high system load when idling, significantly reduced battery life and CPU temp increased by almost 10 degrees (when idling)
This happens BOTH on my Lenovo g560 and Asus EeePC 1201NL.
uname -a
Linux cri-arch 3.4.4-1-ARCH #1 SMP PREEMPT Sat Jun 23 10:53:18 CEST 2012 x86_64 GNU/Linux
lspci
00:00.0 Host bridge: Intel Corporation Core Processor DRAM Controller (rev 02)
00:01.0 PCI bridge: Intel Corporation Core Processor PCI Express x16 Root Port (rev 02)
00:16.0 Communication controller: Intel Corporation 5 Series/3400 Series Chipset HECI Controller (rev 06)
00:1a.0 USB controller: Intel Corporation 5 Series/3400 Series Chipset USB2 Enhanced Host Controller (rev 05)
00:1b.0 Audio device: Intel Corporation 5 Series/3400 Series Chipset High Definition Audio (rev 05)
00:1c.0 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 1 (rev 05)
00:1c.1 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 2 (rev 05)
00:1c.2 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 3 (rev 05)
00:1c.4 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 5 (rev 05)
00:1d.0 USB controller: Intel Corporation 5 Series/3400 Series Chipset USB2 Enhanced Host Controller (rev 05)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev a5)
00:1f.0 ISA bridge: Intel Corporation Mobile 5 Series Chipset LPC Interface Controller (rev 05)
00:1f.2 SATA controller: Intel Corporation 5 Series/3400 Series Chipset 4 port SATA AHCI Controller (rev 05)
00:1f.3 SMBus: Intel Corporation 5 Series/3400 Series Chipset SMBus Controller (rev 05)
00:1f.6 Signal processing controller: Intel Corporation 5 Series/3400 Series Chipset Thermal Subsystem (rev 05)
01:00.0 VGA compatible controller: NVIDIA Corporation GT218 [GeForce 310M] (rev a2)
01:00.1 Audio device: NVIDIA Corporation High Definition Audio Controller (rev a1)
06:00.0 Network controller: Broadcom Corporation BCM4313 802.11b/g/n Wireless LAN Controller (rev 01)
07:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8101E/RTL8102E PCI Express Fast Ethernet controller (rev 02)
ff:00.0 Host bridge: Intel Corporation Core Processor QuickPath Architecture Generic Non-core Registers (rev 02)
ff:00.1 Host bridge: Intel Corporation Core Processor QuickPath Architecture System Address Decoder (rev 02)
ff:02.0 Host bridge: Intel Corporation Core Processor QPI Link 0 (rev 02)
ff:02.1 Host bridge: Intel Corporation Core Processor QPI Physical 0 (rev 02)
ff:02.2 Host bridge: Intel Corporation Core Processor Reserved (rev 02)
ff:02.3 Host bridge: Intel Corporation Core Processor Reserved (rev 02)
yaourt -Q nvidia
extra/nvidia 302.17-1
more info available on request :)
So the load seems not only to depend on the kernel version > 3.3.5, but also on an unknown factor. But it may be possible that the scheduler has also been changed in the linux-lts version?
For now, I stay on 3.3.4.
https://lkml.org/lkml/2012/7/14/186
Temperature is still too much to be usable.
And this issue becomes really boring...
Now, my load seems to be better even if it's a bit different compared to 3.3.4.
It's still higher than before...
The CPU (SandyBridge i5-2520) is in C7 sleep in 99 % of the time. No 2d/3d load except urxvt.
Anyone already figured out what exactly is wasting the electricity?
My bets: PCIe or i915, but as users with AMD are also affected, i think it's some ACPI or PCIe ASPM issue.
By the way, Bill Gate's oppinion on ACPI once was this (1999): http://antitrust.slated.org/www.iowaconsumercase.org/011607/3000/PX03020.pdf
https://wiki.ubuntu.com/Kernel/PowerManagement/PowerSavingTweaks
Watching videos on Youtube + HTML5 and doing work on terminals, my load is around 1.2 last minute and 0.97 last five minutes.
My setup is minimal as hell (DWM), I really don't get it.
Right now (writing these lignes), I have 0.18 0.24 0.23 (compared to 0.01 0.03 0.05 on 3.3.4).
Please note:
(a) DWM/i3/openbox/... are NOT minimal. It's a myth. They are all very inefficient when it comes to window redrawing compared to more advanced window managers in major DEs.
(b) Commit 7490d0a4cfefa16f9d8ce636eb5b2e13d2432db3 in linux 3.4.6.
For example, I have right after logging to flyspray:
$ uptime
14:12:14 up 4:21, 1 user, load average: 0.16, 0.12, 0.08
and after typing this message
$ uptime
14:15:00 up 4:24, 1 user, load average: 0.01, 0.07, 0.06
(x220t, intel i5-2520M)
https://bbs.archlinux.org/viewtopic.php?pid=1144709