Announcement

Collapse
No announcement yet.

Everything that uses EGL segfaults with AMDGPU-PRO on Ubuntu 16.04?

Collapse
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Everything that uses EGL segfaults with AMDGPU-PRO on Ubuntu 16.04?

    Plenty of details here:
    http://paste.ubuntu.com/16598214/

    Maybe bridgman has an idea?

    This includes GDM and gnome-shell so Ubuntu Gnome is pretty useless. The new Feral launcher also uses EGL so they crash too.

  • #2
    Looks like HW is a Fury but not much other kernel info. Are you running with the DKMS package from the amdgpu-pro stack or stock kernel or something else ?

    Comment


    • #3
      Both with bleeding edge kernel from git and it's included module and Ubuntu 4.4.0-23-generic
      +dkms.
      Here you can see that it's really using the DKMS module:

      modinfo amdgpu
      filename: /lib/modules/4.4.0-23-generic/updates/dkms/amdgpu.ko
      version: 16.20.3
      license: GPL and additional rights
      description: AMD GPU
      author: AMD linux driver team
      firmware: radeon/hawaii_smc.bin


      dmesg | grep -i -e amd -e drm -e dri
      Code:
      [ 0.000000] Command line: BOOT_IMAGE=/vmlinuz-4.4.0-23-generic root=UUID=8bd2b8c2-0aa1-4884-9806-9eaeab5dda58 ro amdgpu.powerplay=1 quiet splash vt.handoff=7
      [ 0.000000] AMD AuthenticAMD
      [ 0.000000] RAMDISK: [mem 0x33790000-0x35bbffff]
      [ 0.000000] Kernel command line: BOOT_IMAGE=/vmlinuz-4.4.0-23-generic root=UUID=8bd2b8c2-0aa1-4884-9806-9eaeab5dda58 ro amdgpu.powerplay=1 quiet splash vt.handoff=7
      [ 0.144760] Performance Events: PEBS fmt1+, 16-deep LBR, SandyBridge events, full-width counters, Intel PMU driver.
      [ 0.176040] acpiphp: ACPI Hot Plug PCI Controller Driver version: 0.5
      [ 0.204924] usbcore: registered new interface driver usbfs
      [ 0.204930] usbcore: registered new interface driver hub
      [ 0.204940] usbcore: registered new device driver usb
      [ 0.907862] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 249)
      [ 0.908273] pcie_pme 0000:00:01.0:pcie01: service driver pcie_pme loaded
      [ 0.908295] pcie_pme 0000:00:1c.0:pcie01: service driver pcie_pme loaded
      [ 0.908319] pcie_pme 0000:00:1c.2:pcie01: service driver pcie_pme loaded
      [ 0.908327] pciehp: PCI Express Hot Plug Controller Driver version: 0.4
      [ 0.909227] Serial: 8250/16550 driver, 32 ports, IRQ sharing enabled
      [ 0.934295] tun: Universal TUN/TAP device driver, 1.6
      [ 0.934325] PPP generic driver version 2.4.2
      [ 0.934367] ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver
      [ 0.934371] ehci-pci: EHCI PCI platform driver
      [ 0.965063] ehci-platform: EHCI generic platform driver
      [ 0.965072] ohci_hcd: USB 1.1 'Open' Host Controller (OHCI) Driver
      [ 0.965077] ohci-pci: OHCI PCI platform driver
      [ 0.965085] ohci-platform: OHCI generic platform driver
      [ 0.965091] uhci_hcd: USB Universal Host Controller Interface driver
      [ 0.968295] i2c /dev entries driver
      [ 0.968385] Intel P-state driver initializing.
      [ 0.970187] microcode: Microcode Update Driver: v2.01 <[email protected]>, Peter Oruba
      [ 0.999476] FUJITSU Extended Socket Network Device Driver - version 1.0 - Copyright (c) 2015 FUJITSU LIMITED
      [ 1.006392] [drm] Initialized drm 1.1.0 20060810
      [ 1.007406] e1000e: Intel(R) PRO/1000 Network Driver - 3.2.6-k
      [ 1.021935] [drm] amdgpu kernel modesetting enabled.
      [ 1.023268] AMD IOMMUv2 driver by Joerg Roedel <[email protected]>
      [ 1.023269] AMD IOMMUv2 functionality not available on this system
      [ 1.025309] fb: switching to amdgpudrmfb from VESA VGA
      [ 1.025531] [drm] initializing kernel modesetting (FIJI 0x1002:0x7300 0x174B:0xE329 0xCB).
      [ 1.025537] [drm] register mmio base: 0xFE600000
      [ 1.025538] [drm] register mmio size: 262144
      [ 1.025540] [drm] doorbell mmio base: 0xD0000000
      [ 1.025540] [drm] doorbell mmio size: 2097152
      [ 1.025544] [drm] probing gen 2 caps for device 8086:101 = 2212102/0
      [ 1.025545] [drm] probing mlw for device 8086:101 = 2212102
      [ 1.025557] amdgpu 0000:01:00.0: Invalid ROM contents
      [ 1.025574] [drm] Changing default dispclk from 500Mhz to 600Mhz
      [ 1.025830] amdgpu 0000:01:00.0: VRAM: 4096M 0x0000000000000000 - 0x00000000FFFFFFFF (4096M used)
      [ 1.025831] amdgpu 0000:01:00.0: GTT: 4096M 0x0000000100000000 - 0x00000001FFFFFFFF
      [ 1.025832] [drm] Detected VRAM RAM=4096M, BAR=256M
      [ 1.025832] [drm] RAM width 512bits DDR
      [ 1.025879] [drm] amdgpu: 4096M of VRAM memory ready
      [ 1.025879] [drm] amdgpu: 4096M of GTT memory ready.
      [ 1.025885] [drm] GART: num cpu pages 1048576, num gpu pages 1048576
      [ 1.029039] [drm] PCIE GART of 4096M enabled (table at 0x0000000000040000).
      [ 1.029049] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
      [ 1.029050] [drm] Driver supports precise vblank timestamp query.
      [ 1.029070] amdgpu 0000:01:00.0: amdgpu: using MSI.
      [ 1.029086] [drm] amdgpu: irq initialized.
      [ 1.035901] amdgpu: powerplay initialized
      [ 1.036360] [drm] AMDGPU Display Connectors
      [ 1.036361] [drm] Connector 0:
      [ 1.036361] [drm] DP-1
      [ 1.036362] [drm] HPD5
      [ 1.036363] [drm] DDC: 0x4868 0x4868 0x4869 0x4869 0x486a 0x486a 0x486b 0x486b
      [ 1.036363] [drm] Encoders:
      [ 1.036364] [drm] DFP1: INTERNAL_UNIPHY1
      [ 1.036364] [drm] Connector 1:
      [ 1.036365] [drm] DP-2
      [ 1.036365] [drm] HPD4
      [ 1.036366] [drm] DDC: 0x486c 0x486c 0x486d 0x486d 0x486e 0x486e 0x486f 0x486f
      [ 1.036367] [drm] Encoders:
      [ 1.036367] [drm] DFP2: INTERNAL_UNIPHY1
      [ 1.036368] [drm] Connector 2:
      [ 1.036368] [drm] DP-3
      [ 1.036369] [drm] HPD3
      [ 1.036369] [drm] DDC: 0x4870 0x4870 0x4871 0x4871 0x4872 0x4872 0x4873 0x4873
      [ 1.036370] [drm] Encoders:
      [ 1.036370] [drm] DFP3: INTERNAL_UNIPHY
      [ 1.036371] [drm] Connector 3:
      [ 1.036371] [drm] HDMI-A-1
      [ 1.036372] [drm] HPD2
      [ 1.036373] [drm] DDC: 0x487c 0x487c 0x487d 0x487d 0x487e 0x487e 0x487f 0x487f
      [ 1.036373] [drm] Encoders:
      [ 1.036373] [drm] DFP4: INTERNAL_UNIPHY2
      [ 1.036573] amdgpu 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000100000008, cpu addr 0xffff8800352bf008
      [ 1.036709] amdgpu 0000:01:00.0: fence driver on ring 1 use gpu addr 0x0000000100000018, cpu addr 0xffff8800352bf018
      [ 1.036842] amdgpu 0000:01:00.0: fence driver on ring 2 use gpu addr 0x0000000100000028, cpu addr 0xffff8800352bf028
      [ 1.036967] amdgpu 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000100000038, cpu addr 0xffff8800352bf038
      [ 1.037121] amdgpu 0000:01:00.0: fence driver on ring 4 use gpu addr 0x0000000100000048, cpu addr 0xffff8800352bf048
      [ 1.037240] amdgpu 0000:01:00.0: fence driver on ring 5 use gpu addr 0x0000000100000058, cpu addr 0xffff8800352bf058
      [ 1.037387] amdgpu 0000:01:00.0: fence driver on ring 6 use gpu addr 0x0000000100000068, cpu addr 0xffff8800352bf068
      [ 1.037538] amdgpu 0000:01:00.0: fence driver on ring 7 use gpu addr 0x0000000100000078, cpu addr 0xffff8800352bf078
      [ 1.037705] amdgpu 0000:01:00.0: fence driver on ring 8 use gpu addr 0x0000000100000088, cpu addr 0xffff8800352bf088
      [ 1.037878] amdgpu 0000:01:00.0: fence driver on ring 9 use gpu addr 0x0000000100000098, cpu addr 0xffff8800352bf098
      [ 1.037944] amdgpu 0000:01:00.0: fence driver on ring 10 use gpu addr 0x00000001000000a8, cpu addr 0xffff8800352bf0a8
      [ 1.038037] [drm] Found UVD firmware Version: 1.52 Family ID: 10
      [ 1.038493] amdgpu 0000:01:00.0: fence driver on ring 11 use gpu addr 0x000000000088f7b0, cpu addr 0xffffc9000444e7b0
      [ 1.038536] [drm] Found VCE firmware Version: 48.0 Binary ID: 3
      [ 1.038595] amdgpu 0000:01:00.0: fence driver on ring 12 use gpu addr 0x00000001000000c8, cpu addr 0xffff8800352bf0c8
      [ 1.038613] amdgpu 0000:01:00.0: fence driver on ring 13 use gpu addr 0x00000001000000d8, cpu addr 0xffff8800352bf0d8
      [ 1.085468] [drm] ring test on 0 succeeded in 12 usecs
      [ 1.085649] [drm] ring test on 1 succeeded in 21 usecs
      [ 1.085689] [drm] ring test on 2 succeeded in 22 usecs
      [ 1.085699] [drm] ring test on 3 succeeded in 5 usecs
      [ 1.085706] [drm] ring test on 4 succeeded in 3 usecs
      [ 1.085713] [drm] ring test on 5 succeeded in 3 usecs
      [ 1.085720] [drm] ring test on 6 succeeded in 3 usecs
      [ 1.085726] [drm] ring test on 7 succeeded in 3 usecs
      [ 1.085733] [drm] ring test on 8 succeeded in 3 usecs
      [ 1.085758] [drm] ring test on 9 succeeded in 6 usecs
      [ 1.085769] [drm] ring test on 10 succeeded in 6 usecs
      [ 1.111584] [drm] ring test on 11 succeeded in 2 usecs
      [ 1.111585] [drm] UVD initialized successfully.
      [ 1.220772] [drm] ring test on 12 succeeded in 16 usecs
      [ 1.220785] [drm] ring test on 13 succeeded in 3 usecs
      [ 1.220786] [drm] VCE initialized successfully.
      [ 1.280215] [drm] fb mappable at 0xC0BAA000
      [ 1.280216] [drm] vram apper at 0xC0000000
      [ 1.280217] [drm] size 33177600
      [ 1.280217] [drm] fb depth is 24
      [ 1.280218] [drm] pitch is 15360
      [ 1.280310] fbcon: amdgpudrmfb (fb0) is primary device
      [ 1.280437] amdgpu 0000:01:00.0: fb0: amdgpudrmfb frame buffer device
      [ 1.288995] [drm] ib test on ring 0 succeeded in 0 usecs
      [ 1.289338] [drm] ib test on ring 1 succeeded in 0 usecs
      [ 1.289468] [drm] ib test on ring 2 succeeded in 0 usecs
      [ 1.289600] [drm] ib test on ring 3 succeeded in 0 usecs
      [ 1.289752] [drm] ib test on ring 4 succeeded in 0 usecs
      [ 1.289903] [drm] ib test on ring 5 succeeded in 0 usecs
      [ 1.290012] [drm] ib test on ring 6 succeeded in 0 usecs
      [ 1.290256] [drm] ib test on ring 7 succeeded in 0 usecs
      [ 1.290367] [drm] ib test on ring 8 succeeded in 0 usecs
      [ 1.290427] [drm] ib test on ring 9 succeeded in 0 usecs
      [ 1.290481] [drm] ib test on ring 10 succeeded in 0 usecs
      [ 1.291565] [drm] ib test on ring 11 succeeded
      [ 1.291685] [drm] ib test on ring 12 succeeded
      [ 1.291843] [drm] Initialized amdgpu 3.1.0 20150101 for 0000:01:00.0 on minor 0
      [ 2.150430] hidraw: raw HID events driver (C) Jiri Kosina
      [ 2.158142] usbcore: registered new interface driver usbhid
      [ 2.158143] usbhid: USB HID core driver
      [ 3.381252] lp: driver loaded but no devices found
      [ 3.383147] ppdev: user-space parallel port driver
      [ 3.570601] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
      [ 3.570604] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
      [ 3.570606] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
      [ 3.580346] shpchp: Standard Hot Plug PCI Controller Driver version: 0.4
      [ 3.703172] usbcore: registered new interface driver btusb
      [ 3.879838] asus_wmi: ASUS WMI generic driver loaded
      [ 4.241587] usbcore: registered new interface driver snd-usb-audio
      [ 4.258038] usbcore: registered new interface driver uvcvideo
      [ 4.258039] USB Video Class driver (1.1.1)
      [ 18.782547] gnome-shell[1791]: segfault at 10 ip 00007f19b2360b02 sp 00007fff51c4f4a0 error 4 in amdgpu_dri.so[7f19b165c000+248c000]
      [ 20.010768] gnome-shell[2040]: segfault at 10 ip 00007f132e360b02 sp 00007ffef862aa00 error 4 in amdgpu_dri.so[7f132d65c000+248c000]

      Comment


      • #4
        Hmm.. this is not good:

        sudo update-alternatives --list i386-linux-gnu_egl_conf
        /usr/lib/i386-linux-gnu/mesa-egl/ld.so.conf
        sudo update-alternatives --list i386-linux-gnu_gl_conf
        /usr/lib/i386-linux-gnu/mesa/ld.so.conf
        sudo update-alternatives --list x86_64-linux-gnu_egl_conf
        /usr/lib/x86_64-linux-gnu/mesa-egl/ld.so.conf
        sudo update-alternatives --list x86_64-linux-gnu_gl_conf
        /usr/lib/x86_64-linux-gnu/mesa/ld.so.conf

        Comment


        • #5
          Looks like it never finds a libGL here... ?

          strace es2_info 2>&1 | grep libGL.so
          open("/lib/x86_64-linux-gnu/tls/x86_64/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/lib/x86_64-linux-gnu/tls/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/lib/x86_64-linux-gnu/x86_64/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/lib/x86_64-linux-gnu/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/usr/lib/x86_64-linux-gnu/tls/x86_64/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/usr/lib/x86_64-linux-gnu/tls/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/usr/lib/x86_64-linux-gnu/x86_64/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/usr/lib/x86_64-linux-gnu/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/lib/tls/x86_64/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/lib/tls/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/lib/x86_64/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/lib/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/usr/lib/tls/x86_64/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/usr/lib/tls/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/usr/lib/x86_64/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
          open("/usr/lib/libGL.so.1.2", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)

          Comment


          • #6
            Here's the xorg log. I guess this part is bad also:
            [ 48.153] (EE) AMDGPU(0): Failed to open amdgpu hybrid version
            http://paste.ubuntu.com/16641357/

            Comment


            • #7
              I did a clean install of Ubuntu Gnome 16.04, installed updates, installed amdgpu-pro. Same behavior.
              I have 2 4K displayport monitors, one Dell P2415Q and one Samsung U24E850. Also tested booting with each of them connected alone.

              Comment


              • #8
                I ran es1_info under "rr" so I could reverse step. Here's what that looked like:

                http://paste.ubuntu.com/16653158/

                I have a Sandy Bridge processor, that's pretty old. Perhaps something is using an instruction set my processor doesn't support?

                Comment


                • #9
                  I'm not sufficiently familiar with Ubuntu variant details to tell from the logs - was your prior install Ubuntu Gnome as well or the standard (Unity ?) version ?

                  Comment


                  • #10
                    Originally posted by bridgman View Post
                    I'm not sufficiently familiar with Ubuntu variant details to tell from the logs - was your prior install Ubuntu Gnome as well or the standard (Unity ?) version ?
                    Well my prior complaint was that gnome-shell and GDM and a lot of other things seems to be broken for me.

                    Regarding Ubuntu there is just one Ubuntu and then they have different "flavors" which are different sets of default packages.

                    Openbox works great of course. LightDM has some rendering artifacts actually. Let me try Unity properly also...

                    Comment

                    Working...
                    X