LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Desktop (https://www.linuxquestions.org/questions/linux-desktop-74/)
-   -   Wierd Nouveau Problem after Kernel Upgrade to 6.2 (https://www.linuxquestions.org/questions/linux-desktop-74/wierd-nouveau-problem-after-kernel-upgrade-to-6-2-a-4175727811/)

AsgAnquietas 08-07-2023 12:24 PM

Wierd Nouveau Problem after Kernel Upgrade to 6.2
 
Hello,

After googling and attempting everything I can think of, I am giving up and asking for your help... :-(

In one classroom where I teach, I have an older PC which runs KUbuntu 22.04.3 LTS with the newly freshly upgraded Kernel to 6.2.0-26-generic version.

For the moment, I have 2 kernels:

Code:

administrator@instructor-pc [~] $ ls -l /boot/ |grep vmli
lrwxrwxrwx 1 root root        24 aug  3 17:58 vmlinuz -> vmlinuz-6.2.0-26-generic
-rw------- 1 root root  12224488 iul 10 20:41 vmlinuz-5.19.0-50-generic
-rw------- 1 root root  13770312 iul 13 18:31 vmlinuz-6.2.0-26-generic
lrwxrwxrwx 1 root root        25 aug  3 17:58 vmlinuz.old -> vmlinuz-5.19.0-50-generic

Booting the new Kernel breaks things.
Booting the old Kernel everything works greatly.

I have an old Nvidia Card:
Code:

administrator@instructor-pc [~] $ lspci |grep VGA
01:00.0 VGA compatible controller: NVIDIA Corporation G86 [GeForce 8500 GT] (rev a1)

Naturally, I use nouveau.
NVidia Drivers are no longer available for this type of card (in the normal repos).
Doing (ubuntu-drivers devices) does not reveal any available drivers.

And even if they were, I don't want to rebuild Nvidia module everytime a kernel upgrade is performed... from what I heard, NVidia Drivers are sometimes worse than Nouveau.

Nouveau works fine for me... worked, being the operative word... now, it cannot find/load 2 firwmare files...

Yes, I have already installed nouveau-firmware:
Code:

nouveau-firmware is already the newest version (20091212-0ubuntu1).
A little more info:

Code:

instructor-pc [~] # lshw -c video
  *-display               
      description: VGA compatible controller
      product: G86 [GeForce 8500 GT]
      vendor: NVIDIA Corporation
      physical id: 0
      bus info: pci@0000:01:00.0
      logical name: /dev/fb0
      version: a1
      width: 64 bits
      clock: 33MHz
      capabilities: pm msi pciexpress vga_controller bus_master cap_list rom fb
      configuration: depth=32 driver=nouveau latency=0 resolution=1440,900
      resources: irq:30 memory:fc000000-fcffffff memory:d0000000-dfffffff memory:fa000000-fbffffff ioport:bc00(size=128) memory:c0000-dffff
instructor-pc [~]

And the main culprit (if it is the culprit!):
Code:

instructor-pc [~] # dmesg |grep nouveau
[    4.284583] nouveau 0000:01:00.0: vgaarb: deactivate vga console
[    4.284640] nouveau 0000:01:00.0: NVIDIA G86 (086100a2)
[    4.387017] nouveau 0000:01:00.0: bios: version 60.86.34.00.13
[    4.408220] nouveau 0000:01:00.0: bios: M0203T not found
[    4.408229] nouveau 0000:01:00.0: bios: M0203E not matched!
[    4.408232] nouveau 0000:01:00.0: fb: 512 MiB DDR2
[    4.481632] nouveau 0000:01:00.0: DRM: VRAM: 512 MiB
[    4.481636] nouveau 0000:01:00.0: DRM: GART: 1048576 MiB
[    4.481640] nouveau 0000:01:00.0: DRM: TMDS table version 2.0
[    4.481643] nouveau 0000:01:00.0: DRM: DCB version 4.0
[    4.481646] nouveau 0000:01:00.0: DRM: DCB outp 00: 02000300 00000028
[    4.481650] nouveau 0000:01:00.0: DRM: DCB outp 01: 01000302 00020030
[    4.481653] nouveau 0000:01:00.0: DRM: DCB outp 02: 04011310 00000028
[    4.481655] nouveau 0000:01:00.0: DRM: DCB outp 03: 010223f1 00c0c080
[    4.481658] nouveau 0000:01:00.0: DRM: DCB conn 00: 1030
[    4.481661] nouveau 0000:01:00.0: DRM: DCB conn 01: 0100
[    4.481664] nouveau 0000:01:00.0: DRM: DCB conn 02: 0210
[    4.481666] nouveau 0000:01:00.0: DRM: DCB conn 03: 0211
[    4.481668] nouveau 0000:01:00.0: DRM: DCB conn 04: 0213
[    4.483330] nouveau 0000:01:00.0: DRM: MM: using CRYPT for buffer copies
[    4.492507] nouveau 0000:01:00.0: DRM: failed to create encoder 0/1/0: -19
[    4.492512] nouveau 0000:01:00.0: DRM: TV-1 has no encoders, removing
[    4.743588] [drm] Initialized nouveau 1.3.1 20120801 for 0000:01:00.0 on minor 0
[    4.783642] nouveau 0000:01:00.0: DRM: [DRM/00000002:dac-0000-0304] [LOAD_DETECT data:00000154] load:07 (ret:-5)
[    4.785634] fbcon: nouveaudrmfb (fb0) is primary device
[    4.832010] nouveau 0000:01:00.0: [drm] fb0: nouveaudrmfb frame buffer device
[  14.793574] nouveau 0000:01:00.0: DRM: [DRM/00000002:dac-0000-0304] [LOAD_DETECT data:00000154] load:07 (ret:-5)
[  74.911227] nouveau 0000:01:00.0: DRM: [DRM/00000002:dac-0000-0304] [LOAD_DETECT data:00000154] load:07 (ret:-5)
[  79.750942] nouveau 0000:01:00.0: DRM: [DRM/00000002:dac-0000-0304] [LOAD_DETECT data:00000154] load:07 (ret:-5)
[  79.793910] nouveau 0000:01:00.0: DRM: [DRM/00000002:dac-0000-0304] [LOAD_DETECT data:00000154] load:07 (ret:-5)
[  80.479481] nouveau 0000:01:00.0: DRM: [DRM/00000002:dac-0000-0304] [LOAD_DETECT data:00000154] load:07 (ret:-5)
[  80.520935] nouveau 0000:01:00.0: DRM: [DRM/00000002:dac-0000-0304] [LOAD_DETECT data:00000154] load:07 (ret:-5)
[  80.580412] nouveau 0000:01:00.0: DRM: [DRM/00000002:dac-0000-0304] [LOAD_DETECT data:00000154] load:07 (ret:-5)
[  80.622017] nouveau 0000:01:00.0: DRM: [DRM/00000002:dac-0000-0304] [LOAD_DETECT data:00000154] load:07 (ret:-5)
[  82.130039] nouveau 0000:01:00.0: Direct firmware load for nouveau/nv84_xuc00f failed with error -2
[  82.130055] nouveau 0000:01:00.0: vp: unable to load firmware nouveau/nv84_xuc00f
[  82.130060] nouveau 0000:01:00.0: vp: init failed, -2
[  82.130914] nouveau 0000:01:00.0: Direct firmware load for nouveau/nv84_xuc103 failed with error -2
[  82.130921] nouveau 0000:01:00.0: bsp: unable to load firmware nouveau/nv84_xuc103
[  82.130925] nouveau 0000:01:00.0: bsp: init failed, -2
[  96.036322] nouveau 0000:01:00.0: DRM: [DRM/00000002:dac-0000-0304] [LOAD_DETECT data:00000154] load:07 (ret:-5)
[  97.532651] nouveau 0000:01:00.0: DRM: [DRM/00000002:dac-0000-0304] [LOAD_DETECT data:00000154] load:07 (ret:-5)
instructor-pc [~] #

So, I don't know what the hell to do anymore...

I love linux, I really do, that's why I purged Windows and installed Linux in my labs, but sometimes, these Kernel upgrades break things that WORK, things that actually WORK and really cuts of people's enthusiasm for using Linux & Opensource software in different environments...

If anyone has an ideea, I'm all ears (or more accurately, eyes) :-)

Thank you.

mrmazda 08-08-2023 01:53 AM

1-why the installation of 6.2 kernel?

2-from where did you get it? installed how?

3-are you aware that nouveau has multiple meanings? See here.

4-using 6.2, what's the first apparent evidence something is broken, specific symptoms?

I would check the source of your 6.2 kernel for a newer libdrm-nouveau2.

I have G84 and G98 running various current and devel distros with no apparent issues. This is from an even older one:
Code:

# inxi -GSaz --vs --zl --hostname
inxi 3.3.28-00 (2023-07-10)
System:
  Host: mcp61 Kernel: 6.4.0-1-amd64 arch: x86_64 bits: 64 compiler: gcc
    v: 13.1.0 clocksource: tsc available: acpi_pm parameters: root=LABEL=<filter>
    ipv6.disable=1 net.ifnames=0 noresume consoleblank=0 preempt=full
    mitigations=off
  Desktop: Trinity v: R14.1.1~[DEVELOPMENT] tk: Qt v: 3.5.0 info: kicker
    wm: Twin v: 3.0 vt: 7 dm: 1: TDM 2: XDM Distro: Debian GNU/Linux trixie/sid
Graphics:
  Device-1: NVIDIA C61 [GeForce 6150SE nForce 430] vendor: Micro-Star MSI
    driver: nouveau v: kernel non-free: series: 304.xx status: legacy (EOL)
    last: release: 304.137 kernel: 4.13 xorg: 1.19 arch: Curie
    process: 90-130nm built: 2003-13 ports: active: VGA-1 empty: none
    bus-ID: 00:0d.0 chip-ID: 10de:03d0 class-ID: 0300
  Display: x11 server: X.Org v: 1.21.1.7 driver: X: loaded: modesetting
    unloaded: fbdev,vesa alternate: nouveau,nv dri: nouveau gpu: nouveau
    display-ID: :0 screens: 1
  Screen-1: 0 s-res: 1680x1050 s-dpi: 108 s-size: 395x246mm (15.55x9.69")
    s-diag: 465mm (18.32")
  Monitor-1: VGA-1 model: Dell P2213 serial: <filter> built: 2012
    res: 1680x1050 hz: 60 dpi: 90 gamma: 1.2 size: 473x296mm (18.62x11.65")
    diag: 558mm (22") ratio: 16:10 modes: max: 1680x1050 min: 720x400
  API: OpenGL v: 2.1 Mesa 22.3.6 renderer: NV4C direct-render: Yes
# lsmod | egrep 'veau|vid' | sort
button                24576  1 nouveau
drm                  720896  7 drm_kms_helper,drm_display_helper,drm_ttm_helper,ttm,nouveau
drm_display_helper    212992  1 nouveau
drm_kms_helper        253952  4 drm_display_helper,nouveau
drm_ttm_helper        12288  1 nouveau
i2c_algo_bit          12288  1 nouveau
mxm_wmi                12288  1 nouveau
nouveau              2887680  1
ttm                  102400  2 drm_ttm_helper,nouveau
video                  69632  1 nouveau
wmi                    36864  3 video,mxm_wmi,nouveau
# dpkg-query -W | egrep 'mwar|veau|xserver-xorg'
firmware-sof-signed    2.2.6-1
libdrm-nouveau2:amd64  2.4.115-1
xserver-xorg    1:7.7+23
xserver-xorg-core      2:21.1.7-3
xserver-xorg-input-libinput    1.2.1-1+b1
xserver-xorg-legacy    2:21.1.7-3
xserver-xorg-video-fbdev        1:0.5.0-2
xserver-xorg-video-vesa 1:2.5.0-1+b1
#


AsgAnquietas 08-08-2023 02:48 AM

Hello,

1. This kernel came naturally in my "apt update ; apt full-upgrade" update script.
I assume it is the newest kernel for the 22.04.3 LTS

2. From the repos. Normal update script.

3. By "nouveau" I mean the GPU drivers. Opensource replacement drivers for Nvidia drivers.

4. Booting the new kernel does not recognize my second monitor. (and I see those console messages saying that it failed to load that firmware)
Booting the old 5.* kernel instantly recognizez my second monitor . (I did not check those console messages with the old kernel).

It is clearly something with the new kernel that affects the Nouveau of Firwmare or whatever that my second monitor does not get input from my GPU card...

I will check libdrm-nouveau2 when I get to my lab... however, I already issued an "apt full-upgrade" on that system... It should have been updated, right ?...

mrmazda 08-08-2023 04:51 AM

I did a dist-upgrade 3 weeks ago and didn't get 22.04.3. I just finished doing a repeat. That got me 22.04.3, but yet another 5.15 kernel:
Code:

# grep upgrade .bash_history | tail -3
time apt-get upgrade
time apt-get full-upgrade
time apt-get dist-upgrade
# grep RETT /etc/os-release
PRETTY_NAME="Ubuntu 22.04.3 LTS"
# xdriinfo
Screen 0: nouveau
# inxi -GSazr --vs --zl --hostname
inxi 3.3.28-00 (2023-07-10)
System:
  Host: big41 Kernel: 5.15.0-78-generic arch: x86_64 bits: 64 compiler: gcc
    v: 11.3.0 parameters: ro root=LABEL=<filter> net.ifnames=0 ipv6.disable=1
    noresume plymouth.enable=0 consoleblank=0 preempt=full mitigations=off
  Desktop: Trinity v: R14.1.0 tk: Qt v: 3.5.0 info: kicker wm: Twin v: 3.0
    vt: 7 dm: 1: TDM 2: XDM Distro: Ubuntu 22.04.3 LTS (Jammy Jellyfish)
Graphics:
  Device-1: NVIDIA G84 [GeForce 8600 GT] vendor: XFX Pine driver: nouveau
    v: kernel alternate: nvidiafb non-free: series: 340.xx status: legacy (EOL)
    last: release: 340.108 kernel: 5.4 xorg: 1.20 arch: Tesla process: 40-80nm
    built: 2006-13 pcie: gen: 1 speed: 2.5 GT/s lanes: 16 ports:
    active: DVI-I-1,DVI-I-2 empty: none bus-ID: 01:00.0 chip-ID: 10de:0402
    class-ID: 0300 temp: 78.0 C
  Display: x11 server: X.Org v: 1.21.1.4 driver: X: loaded: modesetting
    alternate: fbdev,vesa dri: nouveau gpu: nouveau display-ID: :0 screens: 1
  Screen-1: 0 s-res: 3600x1200 s-dpi: 120 s-size: 762x254mm (30.00x10.00")
    s-diag: 803mm (31.62")
  Monitor-1: DVI-I-1 pos: primary,left model: NEC EA243WM serial: <filter>
    built: 2011 res: 1920x1200 hz: 60 dpi: 94 gamma: 1.2
    size: 519x324mm (20.43x12.76") diag: 612mm (24.1") ratio: 16:10 modes:
    max: 1920x1200 min: 640x480
  Monitor-2: DVI-I-2 pos: right model: Dell P2213 serial: <filter>
    built: 2012 res: 1680x1050 hz: 60 dpi: 90 gamma: 1.2
    size: 473x296mm (18.62x11.65") diag: 558mm (22") ratio: 16:10 modes:
    max: 1680x1050 min: 720x400
  API: OpenGL v: 3.3 Mesa 23.0.4-0ubuntu1~22.04.1 renderer: NV84
    direct-render: Yes

Repos:
  Active apt repos in: /etc/apt/sources.list
    1: deb http://us.archive.ubuntu.com/ubuntu/ jammy main restricted universe multiverse
    2: deb http://us.archive.ubuntu.com/ubuntu/ jammy-updates main restricted universe multiverse
    3: deb http://us.archive.ubuntu.com/ubuntu/ jammy-backports main restricted universe multiverse
    4: deb http://archive.canonical.com/ubuntu jammy partner
    5: deb http://security.ubuntu.com/ubuntu jammy-security main restricted universe multiverse
    6: deb http://mirror.ppa.trinitydesktop.org/trinity/deb/trinity-r14.1.x jammy main deps
  Active apt repos in: /etc/apt/sources.list.d/debugpkgs.list
    1: deb http://ddebs.ubuntu.com/ jammy main restricted universe multiverse
# lsmod | egrep 'vid|veau' | sort
drm                  622592  6 drm_kms_helper,drm_ttm_helper,ttm,nouveau
drm_kms_helper        311296  1 nouveau
drm_ttm_helper        16384  1 nouveau
i2c_algo_bit          16384  1 nouveau
mxm_wmi                16384  1 nouveau
nouveau              2306048  2
ttm                    86016  2 drm_ttm_helper,nouveau
video                  65536  1 nouveau
wmi                    32768  2 mxm_wmi,nouveau
# dpkg-query -W | egrep 'veau|mwar|drm' | sort
firmware-sof-signed    2.0-1ubuntu4.1
libdrm2:amd64  2.4.113-2~ubuntu0.22.04.1
libdrm-amdgpu1:amd64    2.4.113-2~ubuntu0.22.04.1
libdrm-common  2.4.113-2~ubuntu0.22.04.1
libdrm-intel1:amd64    2.4.113-2~ubuntu0.22.04.1
libdrm-nouveau2:amd64  2.4.113-2~ubuntu0.22.04.1
libdrm-radeon1:amd64    2.4.113-2~ubuntu0.22.04.1
linux-firmware  20220329.git681281e4-0ubuntu3.17
nouveau-firmware        20091212-0ubuntu1
# locate nv84_xuc103
# dmesg | grep veau
[    1.001884] nouveau 0000:01:00.0: vgaarb: deactivate vga console
[    1.002914] nouveau 0000:01:00.0: NVIDIA G84 (084200a2)
[    1.106246] nouveau 0000:01:00.0: bios: version 60.84.58.00.11
[    1.127017] nouveau 0000:01:00.0: fb: 512 MiB GDDR3
[    1.189718] nouveau 0000:01:00.0: DRM: VRAM: 512 MiB
[    1.189729] nouveau 0000:01:00.0: DRM: GART: 1048576 MiB
[    1.189734] nouveau 0000:01:00.0: DRM: TMDS table version 2.0
[    1.189738] nouveau 0000:01:00.0: DRM: DCB version 4.0
[    1.189741] nouveau 0000:01:00.0: DRM: DCB outp 00: 02000300 00000028
[    1.189745] nouveau 0000:01:00.0: DRM: DCB outp 01: 01000302 00010030
[    1.189749] nouveau 0000:01:00.0: DRM: DCB outp 02: 04011310 00000028
[    1.189752] nouveau 0000:01:00.0: DRM: DCB outp 03: 02011312 00010030
[    1.189756] nouveau 0000:01:00.0: DRM: DCB outp 04: 010223f1 00c0c080
[    1.189760] nouveau 0000:01:00.0: DRM: DCB conn 00: 2030
[    1.189763] nouveau 0000:01:00.0: DRM: DCB conn 01: 1130
[    1.189766] nouveau 0000:01:00.0: DRM: DCB conn 02: 0210
[    1.189769] nouveau 0000:01:00.0: DRM: DCB conn 03: 0211
[    1.189772] nouveau 0000:01:00.0: DRM: DCB conn 04: 0213
[    1.191292] nouveau 0000:01:00.0: DRM: MM: using CRYPT for buffer copies
[    1.193341] nouveau 0000:01:00.0: DRM: failed to create encoder 0/1/0: -19
[    1.193347] nouveau 0000:01:00.0: DRM: TV-1 has no encoders, removing
[    1.196452] nouveau 0000:01:00.0: therm: temperature (89 C) hit the 'fanboost' threshold
[    1.288932] nouveau 0000:01:00.0: DRM: allocated 1920x1200 fb: 0x70000, bo (____ptrval____)
[    1.304549] fbcon: nouveaudrmfb (fb0) is primary device
[    1.497275] nouveau 0000:01:00.0: [drm] fb0: nouveaudrmfb frame buffer device
[    1.511950] [drm] Initialized nouveau 1.3.1 20120801 for 0000:01:00.0 on minor 0
[    1.546826] nouveau 0000:01:00.0: therm: temperature (86 C) went below the 'fanboost' threshold
#

Aptitude search shows 6.2 kernels available. Besides using apt instead of apt-get, what did you do that got you one installed?

AsgAnquietas 08-08-2023 05:17 AM

I just discovered this now.

On my systems which I freshly installed using Kubuntu 22.04.2 , I already had the "linux-image-generic-hwe-22.04" package.
This automatically pulled in the new 6.2 kernel as soon as it was available.

On my other systems on which I originally had 20.04 LTS and upgraded to 22.04 LTS, I was missing the above package.
I've installed it:

Code:

apt install linux-image-generic-hwe-22.04
and this got me the new 6.2 kernel on these systems as well.

The key, I supose, is to have that package installed to pull in the new kernel earlier than it would normally be (backport ?)

All nice and great, real happy for 6.2 kernel, but my nouveau problem still bugs me... :-(

mrmazda 08-08-2023 06:25 AM

All seems good here on G84 with 6.2:
Code:

# uname -r
6.2.0-26-generic
# lspci | grep VGA
01:00.0 VGA compatible controller: NVIDIA Corporation G84 [GeForce 8600 GT] (rev a1)
# lsmod | egrep 'vid|veau' | sort
drm                  696320  7 drm_kms_helper,drm_display_helper,drm_ttm_helper,ttm,nouveau
drm_display_helper    212992  1 nouveau
drm_kms_helper        249856  4 drm_display_helper,nouveau
drm_ttm_helper        16384  1 nouveau
i2c_algo_bit          16384  1 nouveau
mxm_wmi                16384  1 nouveau
nouveau              2826240  1
ttm                  110592  2 drm_ttm_helper,nouveau
video                  73728  1 nouveau
wmi                    40960  3 video,mxm_wmi,nouveau
#

Installing HWE only added:
Code:

# dpkg-query -W | grep 6.2.0
linux-image-6.2.0-26-generic    6.2.0-26.26~22.04.1
linux-image-generic-hwe-22.04  6.2.0.26.26~22.04.7
linux-modules-6.2.0-26-generic  6.2.0-26.26~22.04.1
linux-modules-extra-6.2.0-26-generic    6.2.0-26.26~22.04.1
#

The installation process did cause a lot of "W: Possible missing firmware" messages (all including string nvidia, none nouveau), but I see a lot of that here, with no apparent ill effect.

AsgAnquietas 08-09-2023 02:40 PM

Solved using nvidia-340 for Kubuntu 22.04.3 LTS.

Installed nvidia-340 from kelebek333 PPA, as this package is no longer in the main repos...

Code:

sudo add-apt-repository ppa:kelebek333/nvidia-legacy
sudo apt update
sudo apt install nvidia-340 nvidia-340-updates nvidia-340-updates-dev xorg-modulepath-fix

Rebooted and driver seems to be working OK...

Original reference: https://www.reddit.com/r/linux_gamin...204/?rdt=55778

h2-1 08-22-2023 12:53 PM

Be warned, 340 driver is end of life, that solution is only going to work for a little bit longer, unless you don't change kernels, then it will be fine as long as you run that kernel.

Don't upgrade to newer kernel, I don't have the direct info where 340 latest kernel support ends, but many drivers are failing on 6.5 except for very latest release, and there is no latest for 340 as far as I know.


All times are GMT -5. The time now is 08:08 PM.