Announcement

Collapse
No announcement yet.

AMD Stages Latest Radeon/AMDGPU Changes For Linux 4.21 Kernel

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #41
    Originally posted by IreMinMon View Post

    What monitoring software is this?
    It's the GALLIUM_HUD env varaible, here are some example usages:

    Native games:

    Code:
    $ export GALLIUM_HUD_PERIOD=0.07
    $ export GALLIUM_HUD=".h80.w105cpufreq-cur-cpu0+cpufreq-cur-cpu1+cpufreq-cur-cpu2+cpufreq-cur-cpu3;.h80.x185.w230.c100cpu0+cpu1+cpu2+cpu3;.x445. h80.w75.dGPU-load+cpu+fps;.x565.h80.w875.dfps;.x1470.h80.w190.c 100sensors_temp_cu-amdgpu-pci-0100.temp1+GPU-load:100;.x1690.h80.w170requested-VRAM+VRAM-usage"
    $ ./game

    For Steam games:

    Code:
    GALLIUM_HUD_PERIOD=0.07 GALLIUM_HUD=".h80.w105cpufreq-cur-cpu0+cpufreq-cur-cpu1+cpufreq-cur-cpu2+cpufreq-cur-cpu3;.h80.x185.w230.c100cpu0+cpu1+cpu2+cpu3;.x445. h80.w75.dGPU-load+cpu+fps;.x565.h80.w875.dfps;.x1470.h80.w190.c 100sensors_temp_cu-amdgpu-pci-0100.temp1+GPU-load:100;.x1690.h80.w170requested-VRAM+VRAM-usage" %command%

    If you have more than 4 cores, you'll just need to update the above to add in cpu4-7, etc.

    More info here: https://manerosss.wordpress.com/2017...o-gallium-hud/

    Comment


    • #42
      Originally posted by debianxfce View Post
      I have a feeling that your PSU is too weak.
      I'd normally agree, but it's not the PSU. I have a Thermaltake 700W, it can definitely handle my i5 Haswell and RX 480 at max stock clocks.

      I'm almost certain the voltage is too low. In my previous post I showed it was pulling 150W @ 1.09V. I think if I bump that up to 1.175V (1175mV), it'll fix the issue.

      Running prime95 and Intel's power_gadget right after, I see:

      Code:
      $ sudo ./power_gadget -e 1000 -d 1
      System Time,RDTSC,Elapsed Time (sec),IA Frequency_0 (MHz),Processor Power_0 (Watt),Cumulative Processor Energy_0 (Joules),Cumulative Processor Energy_0 (mWh),IA Power_0 (Watt),Cumulative IA Energy_0 (Joules),Cumulative IA Energy_0(mWh),GT Power_0 (Watt),Cumulative GT Energy_0 (Joules),Cumulative GT Energy_0(mWh)
      06:46:45:849,90852126468092,1.0007,477172664,89.1909,89.1983,24.7773,81.4456,81.4524,22.6257,0.0000,0.0000,0.0000,
      
      Total Elapsed Time(sec)=1.0007
      
      Total Processor Energy_0(Joules)=89.1983
      Total Processor Energy_0(mWh)=24.7773
      [B]Average Processor Power_0(Watt)=89.1393[/B]
      Let's round up to 100W, so that's 250W for both my CPU and GPU, leaving 500W left. It's not the PSU.

      I'll have to report back sometime with 1303 MHz at higher voltages but I'll wait until the drivers mature a little more when it comes to overclocking. I've had problems with the amdgpu.ppfeaturemask in the past.

      Comment


      • #43
        Originally posted by perpetually high View Post

        I'd normally agree, but it's not the PSU. I have a Thermaltake 700W, it can definitely handle my i5 Haswell and RX 480 at max stock clocks.

        I'm almost certain the voltage is too low. In my previous post I showed it was pulling 150W @ 1.09V. I think if I bump that up to 1.175V (1175mV), it'll fix the issue.

        Running prime95 and Intel's power_gadget right after, I see:

        Code:
        $ sudo ./power_gadget -e 1000 -d 1
        System Time,RDTSC,Elapsed Time (sec),IA Frequency_0 (MHz),Processor Power_0 (Watt),Cumulative Processor Energy_0 (Joules),Cumulative Processor Energy_0 (mWh),IA Power_0 (Watt),Cumulative IA Energy_0 (Joules),Cumulative IA Energy_0(mWh),GT Power_0 (Watt),Cumulative GT Energy_0 (Joules),Cumulative GT Energy_0(mWh)
        06:46:45:849,90852126468092,1.0007,477172664,89.1909,89.1983,24.7773,81.4456,81.4524,22.6257,0.0000,0.0000,0.0000,
        
        Total Elapsed Time(sec)=1.0007
        
        Total Processor Energy_0(Joules)=89.1983
        Total Processor Energy_0(mWh)=24.7773
        [B]Average Processor Power_0(Watt)=89.1393[/B]
        Let's round up to 100W, so that's 250W for both my CPU and GPU, leaving 500W left. It's not the PSU.

        I'll have to report back sometime with 1303 MHz at higher voltages but I'll wait until the drivers mature a little more when it comes to overclocking. I've had problems with the amdgpu.ppfeaturemask in the past.
        The fucked thing about power supplies though is that retardedly, many of them have double and even triple 12v rails. If you have the CPU and GPU pulling power off the same rail and the other are going unused then that could in fact be a power supply problem, and TT is specifically known to use too many rails with too little power per rail.

        EDIT: Honestly, if you have a single CPU and a single GPU then you really should buy a power supply with a single 12v rail. If you already own a dual rail supply then you need to make sure the GPU is pulling power off the second rail.
        Last edited by duby229; 16 November 2018, 02:03 PM.

        Comment


        • #44
          Originally posted by duby229 View Post

          The fucked thing about power supplies though is that retardedly, many of them have double and even triple 12v rails. If you have the CPU and GPU pulling power off the same rail and the other are going unused then that could in fact be a power supply problem, and TT is specifically known to use too many rails with too little power per rail.

          EDIT: Honestly, if you have a single CPU and a single GPU then you really should buy a power supply with a single 12v rail. If you already own a dual rail supply then you need to make sure the GPU is pulling power off the second rail.
          Appreciate the info.

          About a couple months ago I bought a GTX 1080 (I'm not proud of this) and didn't experience any GPU hangs or issues with the PSU. And that is more power hungry than the RX 480. I believe I was hitting 180W on the GPU at full power, but I can't remember. So in my particular issue, I can say with high confidence it's not the PSU. I did however, return that GTX 1080 and got back my $500 (this, I'm very proud of).

          Comment


          • #45
            Originally posted by perpetually high View Post

            Appreciate the info.

            About a couple months ago I bought a GTX 1080 (I'm not proud of this) and didn't experience any GPU hangs or issues with the PSU. And that is more power hungry than the RX 480. I believe I was hitting 180W on the GPU at full power, but I can't remember. So in my particular issue, I can say with high confidence it's not the PSU. I did however, return that GTX 1080 and got back my $500 (this, I'm very proud of).
            Sorry brother, I don't mean to second guess you. But, just because some prior GPU worked in the same configuration doesn't mean that this GPU isn't pulling too much power. I'm 100% certain that 1080 is more power efficient than that 580 and I'm also 100% certain nVidia's power gating and thermal throttling is more advanced than amd's. It's entirely likely that you were in fact hitting the same power limit, but nVidia's driver gated and throttled the GPU enough to make it unnoticeable.

            Comment


            • #46
              Originally posted by duby229 View Post

              Well, if you are getting a permission denied error, then it's likely a filesystem problem. Something like an executable bit or a read bit or a write bit or something. Probably not a driver issue.
              Actually it turned out the Xorg log was from when I originally tried 4.19.0, and unfortunately not even that is produced in subsequent kernels. In any case I did look at /dev/dri/card0 and it has the same permissions that work with all other kernels: crw-rw----+. And by the way kernels 4.14.81, 4.17.19, and 4.18.19 work perfectly with the system just as it is. Though I also backed everything up and did a clean reinstall just to be sure it wasn't my setup. I also removed my second GPU and a few PCI cards when doing the clean install test just to eliminate as many variables as possible.

              In any case just out of curiosity I also tried setting the card0 "Other" permissions to rw as well, but as I expected the permissions are set automatically and don't survive reboot.

              Comment

              Working...
              X