LinuxQuestions.org
Latest LQ Deal: Latest LQ Deals
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software > Linux - Kernel
User Name
Password
Linux - Kernel This forum is for all discussion relating to the Linux kernel.

Notices


Reply
  Search this Thread
Old 05-12-2022, 10:04 AM   #16
hazel
LQ Guru
 
Registered: Mar 2016
Location: Harrow, UK
Distribution: LFS, AntiX, Slackware
Posts: 7,680

Original Poster
Blog Entries: 19

Rep: Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492

So why didn't I get the usual panic report with a traceback?
 
Old 05-12-2022, 10:17 AM   #17
elcore
Senior Member
 
Registered: Sep 2014
Distribution: Slackware
Posts: 1,754

Rep: Reputation: Disabled
Quote:
Originally Posted by hazel View Post
So why didn't I get the usual panic report with a traceback?
Maybe the buffer was filled with so much output the report did not fit anymore. But that's just a guess.
Usually the kernel guys recommend to ssh into the machine and check if really frozen solid or just a display froze.
But to report anything upstream, you must reproduce it on the most recent version of any given branch.
 
1 members found this post helpful.
Old 05-12-2022, 11:29 AM   #18
hazel
LQ Guru
 
Registered: Mar 2016
Location: Harrow, UK
Distribution: LFS, AntiX, Slackware
Posts: 7,680

Original Poster
Blog Entries: 19

Rep: Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492
Quote:
Originally Posted by elcore View Post
Usually the kernel guys recommend to ssh into the machine and check if really frozen solid or just a display froze.
Excellent advice. I'll do that next time I have any kind of freeze.
 
Old 05-12-2022, 11:49 AM   #19
petelq
Member
 
Registered: Aug 2008
Location: Yorkshire
Distribution: openSUSE(Leap and Tumbleweed) and a (not so) regularly changing third and fourth
Posts: 629

Rep: Reputation: Disabled
Quote:
Originally Posted by petelq View Post
Take your laptop to M&S and use their wifi. It's free and no snags to start, no strings.
You can have a very nice coffee while you work.
I thought I'd seen a post about you having a laptop at some point so I just assumed and you know what's said about assuming.
I stand corrected.
I think we're about the same age and I have a smart phone and I build roms for it. I would have thought you were much more tech savvy than me (and you probably are).
 
Old 05-13-2022, 04:27 AM   #20
hazel
LQ Guru
 
Registered: Mar 2016
Location: Harrow, UK
Distribution: LFS, AntiX, Slackware
Posts: 7,680

Original Poster
Blog Entries: 19

Rep: Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492
OK, lots to report!

1) The fault is reproducible, although the output differs this morning. I guess it depends on the precise moment of the crash. This time I got some lines of kernel output, repeated over and over. I transcribed them by hand (hope this is accurate):
Code:
BUG: Unable to handle page fault for address ffffffffd83ea180
#PF: supervisor read access in kernel mode
#PF: error_code (0x0000) - not-present page
PGD 15612067 P4D 15612067 PUD 15614067 PMD 0
general protection fault, probably for non-canonical address 0x720072007200720:0000 [#12] PREEMPT SMP PT1
CPU: 0 PID 119539488 Comm: /*d*r*i*v*e*r*s*/*x* Not tainted 5.15.19scroll #1
Hardware name: LENOVO 90BX0018UK/Aptio CRB, BIOS 007KT39AUS 06/18/2014
2) When I tried to ssh from the laptop I got:
Code:
ssh: connect to host bigboy port 22: No route to host
Bigboy is in the /etc/hosts file btw with the address 192.168.2.100. When I pinged that address, I got destination unreachable. So it really was a crash, not just a console freeze.

Now I need to repeat all that with a standard unpatched kernel.

Last edited by hazel; 05-13-2022 at 05:07 AM.
 
Old 05-14-2022, 05:25 AM   #21
hazel
LQ Guru
 
Registered: Mar 2016
Location: Harrow, UK
Distribution: LFS, AntiX, Slackware
Posts: 7,680

Original Poster
Blog Entries: 19

Rep: Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492
I am now running the 5.15.27 official Slackware kernel and there is no bug. Of course, that's not a 100% reliable test because it's a later version and there could be a real kernel bug that got fixed. To be certain, I shall need to rebuild 5.15.19 without the scrolling tty patch and try booting from that. But I'm fairly certain now that the bug is in the patch and not in the main kernel. After all, this is a strictly unofficial patch.

I think I have a hazy idea of what is going on here. It has to do with memory management. PF stands for page fault, and PGD, etc., are the kernel's various page indexes. d*r*i*v*e*r*s probably means that a driver (the patched tty driver) is triggering the problem. Output to a scrolling console has to be stored in memory so that you can get it back when you scroll upwards. I vaguely remember from past reading that it goes into video memory, but I have a system-on-a-chip with built-in video so there probably isn't that much difference physically between video and main memory on my machine. Anyway, I think what is happening is that the patched driver and the part of the kernel that does memory management aren't quite in sync. When you cause a lot of output to be dumped at once, as I do in this test op, maybe it can't get written to memory fast enough and something gets out of step.

If anyone else is using this patch, perhaps they could try to replicate the bug. Just give a command that dumps several screenfuls of output almost instantaneously and see what happens.

Last edited by hazel; 05-14-2022 at 05:27 AM.
 
Old 05-16-2022, 04:44 AM   #22
hazel
LQ Guru
 
Registered: Mar 2016
Location: Harrow, UK
Distribution: LFS, AntiX, Slackware
Posts: 7,680

Original Poster
Blog Entries: 19

Rep: Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492
I just booted from an unpatched 5.15.19 kernel and there is no bug! So the problem is either in the patch itself or possibly in a mismatch between the requirements of a scrolling console and the expectations of a memory manager that no longer supports it. In either case, there's nothing that needs to be reported upwards.
 
Old 05-16-2022, 12:18 PM   #23
verndog
Member
 
Registered: Oct 2007
Posts: 279

Rep: Reputation: 68
hazel, glad you got it fixed. I was interested in your comments regarding "age" related stuff. Born in '44, I'm just now using some of the smartness of my phone. Its usually off most of the time. No need to thumb my life. When I exercise outside, I laugh at all the "thumb" people out and about. I want to enjoy the sights, smells, etc of the outdoors. I witness people stop running when the phone rings?!? Can't leave home without it, it seems. I'm to conservative for today's society.
 
Old 05-16-2022, 12:32 PM   #24
hazel
LQ Guru
 
Registered: Mar 2016
Location: Harrow, UK
Distribution: LFS, AntiX, Slackware
Posts: 7,680

Original Poster
Blog Entries: 19

Rep: Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492Reputation: 4492
Quote:
Originally Posted by verndog View Post
hazel, glad you got it fixed.
It isn't fixed! It's unfixable (by me anyway). But the question is solved because we now know what's causing the problem. Since it only manifests when you use this unauthorised patch, reporting it upwards isn't going to do anything useful.

How serious it is depends on what you want to use a scrolling console for. If you just want to be able to examine kernel messages after a failed boot, that's fine because they don't come out fast enough to trigger the problem. But if you wanted to use it to build extra software on a skeletal LFS system which has no graphical interface as yet, you probably would crash it, because messages from a build come out very fast indeed. You would have to either direct the output into a file and examine it afterwards or use something like screen or tmux. And that takes away the purpose of using a scrolling console in the first place. You might as well use screen/tmux with a standard kernel.

Last edited by hazel; 05-16-2022 at 12:33 PM.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
LXer: Serious Sam HD & Serious Sam HD: The Second Encounter Will Come To Linux LXer Syndicated Linux News 0 04-05-2014 09:25 PM
LXer: Do you want a serious—I mean serious—developer laptop? Then Dell and Ubuntu have the system fo LXer Syndicated Linux News 0 11-29-2012 03:30 PM
A serious bug in Slackware 10.2.0 JekBaskakov Slackware 10 11-29-2005 05:58 AM
A serious bug of Firefox under Linux? grautu Linux - Software 6 11-06-2005 10:13 AM
Perl has serious bug !! AZIMBD03 Red Hat 2 04-18-2004 04:15 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software > Linux - Kernel

All times are GMT -5. The time now is 10:15 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration