PC shutting down during games
I see in logs next:
Code:
kernel: amdgpu 0000:04:00.0: amdgpu: ERROR: GPU over temperature range(SW CTF) detected! I'm looking for temperature, and can't see more than 72℃. Critical 110℃. If critical 110℃ - why kernel shutting down PC, if vcard have only 72℃ maximum? |
What distro and kernel are you using? Also,what hardware?
Kernel 5.8 had patches to deal with this issue. |
"Only" 72℃? :confused:
I would definitely check fans and thermal paste asap! Critical 110℃? That would probably kill any processor... |
Your kernel is detecting that your video card is overheating, and you think the problem is with the kernel?
|
This is a config issue. Or maybe a kernel issue.
My ATI has a max SAFE range of up to 110C,. (not a thing wrong with it in 4 years),. i wouldn't want my machine deciding to shutdown when its running as advertised either. https://www.pcgamer.com/fretting-ove...-spec-on-navi/ But its all down to the hardware. Clearly the OP is expecting to run at higher temperatures like mine? |
In addition to what Jan K. suggested, also check to ensure that cooling vents are free of dust and other obstructions.
|
OpenSuse Leap 15.5
Kernel: 5.14.21-150500.46-default CPU AMD 3700X Vcard RX 6700XT nitro+ Collers are clean, radiator is clean. Previous card - RX 570x had the same temperatures, but never shutting downs. |
https://www.techpowerup.com/review/a...700-xt/33.html
This indicates that the hotspot for that card is 95C and it games at around 80C. So it seems to me that the 70C shutdown is unneccesary. Seems I might as well point out the obvious... *BEFORE DOING ANY OF THIS* you better make sure you really truly know what you are getting into, because you can raise this safety check too high and smoke your card or system. To change this,. first you will have to find the appropriate hwmon directory for your GPU in /sys/class/hwmon Then cat the temp1_crit file to see what the setting is. Once you find the correct directory, and that file, you can change it. It's likely going to be 70000 (which is 70C) and you can change it to something more befitting the operating temperatures of that card. |
Quote:
Code:
~> cat /sys/class/hwmon/hwmon1/temp1_crit But what is this "junction" temperature? I have no digits like 70000. I think if 6700 going too hot it must get thermal throttling, instead of shutting down whole system. I have this card: https://www.techpowerup.com/review/s...-nitro/33.html In idle my card hotter on 5℃-6℃. I see 49℃-50℃, instead of the temperature on the site above. |
Change thermal compound, found out missing part of thermal pad on power.
Now I haven't problems with shutting down. And have lower temperature on GPU. |
All times are GMT -5. The time now is 09:03 PM. |