Manjaro 20.0.3 kernel 5.6.15-1 instability. Crashes with different severity, need help troubleshooting
My installation is about 4 days old, the only non standard driver used is one for my USB wifi adapter (https://github.com/aircrack-ng/rtl8812au). Maybe like every 24+ hours or so the system freezes. No input from keyboard/mouse can be given. The two crashes prior to the last one have been fixed by a hard reset, but the most recent one was even more severe. After hard reset button some of the fans started blasting and it seemed like it was stuck on some pre-post loop. Not even five+ second power button to power it off worked. I had to use the power switch on the back of the power supply.
Now; usually you would just look in journalctl and find errors and troubleshoot those. However there are no errors that leads up to this. That is; at least I can't seem to recognize any. As you probably understand I'm not a linux savvy - so I come here for help. This is journalctl leading up to the crash: Code:
Jun 08 13:01:01 manjaro CROND[91975]: (root) CMD (run-parts /etc/cron.hourly) journalctl for latest crash: Code:
Jun 08 20:34:11 manjaro msm_notifier[1472]: "qt5-translations" The crashes seem to happen whenever, it's happened two times while I'm afk, and two times while I'm on the pc, but not as a result of running anything as far as I can tell. I have had Chrome, Thunderbird, Discord and a couple of terminals open for probably all the crashes. RTX2070 on nVidia driver 440.82 Latest BIOS |
First question I'd ask would be, why are you using that driver? Does the 'stock' driver work and connect you, and if so, does it create the same issue?
|
Quote:
|
Quote:
https://forum.manjaro.org/t/realtek-...on-aur/85429/3 ...specifically for the 8188. Not sure if the aur links will work differently than the github manual build, but worth a try at least. |
Quote:
Quote:
|
Quote:
Have you checked the /etc/cron.hourly directory? ("ls /etc/cron.hourly"). That's mentioned there too, and the fact you have a --Reboot-- in the logs seems to say your system is rebooting itself. You can try running: Code:
sudo service anacron stop |
My cron.hourly only contains 0anacron which again only contains this:
Code:
#!/bin/sh |
System just rebooted while I was working in Chrome. I guess I can throw out the idea about it being one of the pages I had shut down.
Code:
Jun 10 01:02:52 manjaro dbus-daemon[767]: [system] Activating via systemd: service name='org.bluez' unit='dbus-org.bluez.service' requested by ':1.6705' (uid=1000 pid=70097 comm="/opt> This is from .xsession-errors.old but tbh I don't see anything here either: Code:
avaScript error: resource:///modules/activity/autosync.jsm, line 251: uncaught exception: 2147746065 I installed rsyslog in desperation, but I guess I need to configure this to be running on the system all the time to log anything: also if I do - is it likely it will catch more than journalctl? |
Quote:
Anacron is different than cron. https://www.tecmint.com/cron-vs-anac...cron-on-linux/ And to ask an obvious question that I should have asked earlier; are you 100% sure the hardware is ok? I take it this is a desktop machine...for a simple experiment, you can try to boot and run a 'live' version of Linux, and I'd suggest something other than Manjaro for the test, like Fedora, Mint, or openSUSE. You can see if things still reboot/crash or if they just run normally. Hard to diagnose issues that don't leave traces in the logs, so sometimes it's easier to shotgun things, and see what sticks. |
Quote:
Code:
# /etc/anacrontab: configuration file for anacron Quote:
|
Not getting anywhere with this. The system still crashes randomly every day or two with no apparent error messages. Also I believe the reason the system reports --Reboot-- is because I have to hard reset the system so it counts that as a reboot. Do some kind of utility that monitors *everything* exist so I might be able to figure this one out?
|
First of all, install the wifi driver recommended by Manjaro.
BTW, the fact that you did this outside of package management makes me think what else you might have "tweaked", possibly breaking something. Is there a pattern to the reboots? I.e. when the system is under stress? Or always at a certain time? Or always X hours after booting? Etc., you get the picture. You have to think what you did between then (when the system was working OK) and now. You installed a new OS. What else? As an example, I once had a 64bit system that supported only 4GB of RAM (which I didn't know at first) - when I put in more RAM, it did not complain, everything worked fine until it started utilising more RAM, at which point it froze/crashed (crappy MSI laptop). Quote:
|
I installed the driver recommended and blacklisted the old module like suggested in the thread. Rebooted and everything seems fine. Now it's just wait and see.
Prior to this install I was on Debian for like a week. I had similar issues there. Prior to Debian I have been running Windows 10 with no issues for over a year with the same hardware. The crashes are random, I tried the process of elimination by shutting down more and more programs I had running, but in the end I only had Chrome up with like 10 tabs and was writing an email from gmail when it crashed. I also had it crash under heavier load while playing a game. All that being said if this fixes it I will be a bit upset at myself for not installing this driver when first recommended it. Thank you for your help so far. |
System crashed again, this time when I was afk. So now I did not hard restart but journalctl still shows:
Code:
Jul 13 09:01:01 manjaro CROND[3781]: (root) CMD (run-parts /etc/cron.hourly) Code:
Jul 13 10:07:24 manjaro kernel: mce: [Hardware Error]: CPU 7: Machine Check: 0 Bank 5: bea0000000000108 edit: Okay I've been googling the error and it seems like my CPU might be borked. Quite annoying that it works fine on Windows. But eh, guess it was time for a new one anyway. I ordered a new one and will keep this unsolved until I've tested it. |
Quote:
He will figure it out it is a two way radio and the drivers and the firmware. Ask our Self what is rtl8812au_aircrack-ng and what it ask that device to do. You know and I know it isn't finding the firmware from the stock driver. And through the years the firmware keeps getting moved. git clone https://github.com/gnab/rtl8812au.git |
All times are GMT -5. The time now is 10:55 AM. |