DaemonForums  

Go Back   DaemonForums > OpenBSD > OpenBSD General

OpenBSD General Other questions regarding OpenBSD which do not fit in any of the categories below.

Reply
 
Thread Tools Display Modes
  #1   (View Single Post)  
Old 7th August 2008
gosha gosha is offline
Spam Deminer
 
Join Date: Jun 2008
Location: China
Posts: 256
Thanked 0 Times in 0 Posts
Default hardware or software problem?

Hello folks,
the last few days my openbsd 4.3 keeps on hanging suddenly, and I can only reset with the power button. I did not install new software or the like, so I think it could be a problem with the hardware which is very old (HP Omnibook 900b).
How do I make sure it is not a software problem? And is there a way to check if it is a hardware problem without being a technician?
Thank you
Reply With Quote
  #2   (View Single Post)  
Old 7th August 2008
jggimi's Avatar
jggimi jggimi is offline
More noise than signal
 
Join Date: May 2008
Location: USA
Posts: 3,432
Thanked 214 Times in 189 Posts
Default

Typically, "hangs" occur most often when using X from the console. That's because kernel messages (such as a kernel panic), and dropping into the ddb(4) debugger, are hidden from you.

If you have a "hang" while using X, you can determine if the computer is still running only by access from the network. This requires a network and a second computer. You can see if the box responds to a ping, and if you can connect to a service such as sshd (if the service was configured to run).

The only way to "see" what's on the console (such as a panic followed by ddb) when using X is to connect a serial console. I have one, it's an old Palm PDA, with it's old 9-pin serial connector and some freeware/shareware PDA software that emulates a VT100 terminal.

If whatever is happening allows the OS to keep running, you can look at /var/log/messages and /var/log/daemon for possible messages.

If you're not using X, you usually won't see "hangs" unless there is some sort of hardware problem. Let's find out what your actual symptoms are, first.

Last edited by jggimi; 7th August 2008 at 03:02 PM.
Reply With Quote
  #3   (View Single Post)  
Old 7th August 2008
jggimi's Avatar
jggimi jggimi is offline
More noise than signal
 
Join Date: May 2008
Location: USA
Posts: 3,432
Thanked 214 Times in 189 Posts
Default

While we're waiting to find out what's going on, here are a couple of things to think about:

1) Do you have sufficient swap space defined? If you run out, your OS can hang or panic, depending on what process requests memory.

2) If you're running X, and missing the panic->drop to ddb, you can disable the drop to ddb by setting sysctl ddb.panic=0. Instead of dropping to ddb, the OS will dump RAM to swap and reboot -- of course, you need swap space for this to happen. See crash(8) and ddb(4) for more info.
Reply With Quote
  #4   (View Single Post)  
Old 7th August 2008
gosha gosha is offline
Spam Deminer
 
Join Date: Jun 2008
Location: China
Posts: 256
Thanked 0 Times in 0 Posts
Default

thank you, I'll check the man pages you say
I'm using X, in fact. Swap should be enough, 200m for 196m of ram
Reply With Quote
  #5   (View Single Post)  
Old 7th August 2008
vermaden's Avatar
vermaden vermaden is offline
Administrator
 
Join Date: Apr 2008
Location: pl_PL.lodz
Posts: 1,051
Thanked 118 Times in 93 Posts
Default

Check RAM with memtest.

Does colling of your CPU works well?
__________________
religions, worst damnation of mankind
"If 386BSD had been available when I started on Linux, Linux would probably never had happened." Linus Torvalds

Linux is not UNIX! Face it! It is not an insult. It is fact: GNU is a recursive acronym for “GNU's Not UNIX”.
vermaden's: links resources deviantart spreadbsd
Reply With Quote
  #6   (View Single Post)  
Old 7th August 2008
BSDfan666 BSDfan666 is offline
Real Name: N/A, this is the interweb.
Helpful companion
 
Join Date: Apr 2008
Location: Ontario, Canada
Posts: 2,223
Thanked 193 Times in 184 Posts
Default

Your computer is newer then most of mine.

Can you post your dmesg here in [code][/code] blocks? it may or may not help us.... help you.

On systems that crash, (I don't own many portable devices.. so some of this may not apply to you.)
  • Run memtest+ (An easy find via Google)
  • Remove any recently installed hardware, software doesn't typically crash a system.. but, it's possible.
  • Determine via dmesg if there are any obvious signs of interrupt routing problems, if so.. disable apm/pcibios and use acpi, or vice versa.
  • Try booting GENERIC.MP(bsd.mp), this kernel *will* work on non-smp systems.. with the added bonus of enabling ioapic(4).

Last edited by BSDfan666; 7th August 2008 at 03:51 PM.
Reply With Quote
  #7   (View Single Post)  
Old 7th August 2008
gosha gosha is offline
Spam Deminer
 
Join Date: Jun 2008
Location: China
Posts: 256
Thanked 0 Times in 0 Posts
Default

I gently stopped my window manager (ion3), and I found ddb running in console with this message:
cbb0: bad Vcc request sock_ctrl 0x0, sock_status 0x30000b19
panic: wdcstart: channel waiting for irq
Stopped at Debugger +0x4 leave

this is the content of the /var/log/messages:
Code:
Aug  7 22:00:01 openhp newsyslog[5022]: logfile turned over
Aug  7 22:00:01 openhp syslogd: restart
Aug  7 23:37:16 openhp reboot: rebooted by sw
Aug  7 23:37:17 openhp syslogd: exiting on signal 15
Aug  7 23:49:13 openhp syslogd: start
Aug  7 23:49:13 openhp /bsd: OpenBSD 4.3 (GENERIC) #698: Wed Mar 12 11:07:05 MDT 2008
Aug  7 23:49:13 openhp /bsd:     deraadt@i386.openbsd.org:/usr/src/sys/arch/i386/compile/GENERIC
Aug  7 23:49:13 openhp /bsd: cpu0: Intel Pentium III ("GenuineIntel" 686-class) 449 MHz
Aug  7 23:49:14 openhp /bsd: cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,SER,MMX,FXSR,SSE
Aug  7 23:49:14 openhp /bsd: real mem  = 200785920 (191MB)
Aug  7 23:49:14 openhp /bsd: avail mem = 185978880 (177MB)
Aug  7 23:49:14 openhp /bsd: mainbus0 at root
Aug  7 23:49:14 openhp /bsd: bios0 at mainbus0: AT/286+ BIOS, date 11/11/99, BIOS32 rev. 0 @ 0xfd790, SMBIOS rev. 2.2 @ 0xf0850 (38 entries)
Aug  7 23:49:14 openhp /bsd: bios0: vendor Phoenix Technologies LTD version "CL.M7.13" date 11/11/99
Aug  7 23:49:14 openhp /bsd: bios0: Hewlett-Packard HP OmniBook PC
Aug  7 23:49:14 openhp /bsd: apm0 at bios0: Power Management spec V1.2
Aug  7 23:49:14 openhp /bsd: apm0: battery life expectancy 100%
Aug  7 23:49:14 openhp /bsd: apm0: AC on, battery charge high, estimated 3:20 hours
Aug  7 23:49:14 openhp /bsd: acpi at bios0 function 0x0 not configured
Aug  7 23:49:14 openhp /bsd: pcibios0 at bios0: rev 2.1 @ 0xfd790/0x870
Aug  7 23:49:14 openhp /bsd: pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfdf50/144 (7 entries)
Aug  7 23:49:14 openhp /bsd: pcibios0: PCI Interrupt Router at 000:07:0 ("Intel 82371FB ISA" rev 0x00)
Aug  7 23:49:14 openhp /bsd: pcibios0: PCI bus #3 is the last bus
Aug  7 23:49:14 openhp /bsd: bios0: ROM list: 0xc0000/0xc800
Aug  7 23:49:14 openhp /bsd: cpu0 at mainbus0
Aug  7 23:49:14 openhp /bsd: pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
Aug  7 23:49:14 openhp /bsd: pchb0 at pci0 dev 0 function 0 "Intel 82443BX AGP" rev 0x03
Aug  7 23:49:14 openhp /bsd: agp0 at pchb0: aperture at 0xe0000000, size 0x4000000
Aug  7 23:49:14 openhp /bsd: ppb0 at pci0 dev 1 function 0 "Intel 82443BX AGP" rev 0x03
Aug  7 23:49:14 openhp /bsd: pci1 at ppb0 bus 1
Aug  7 23:49:15 openhp /bsd: vga1 at pci1 dev 0 function 0 "ATI Mobility 1" rev 0x64
Aug  7 23:49:15 openhp /bsd: wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
Aug  7 23:49:15 openhp /bsd: wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
Aug  7 23:49:15 openhp /bsd: cbb0 at pci0 dev 4 function 0 "TI PCI1225 CardBus" rev 0x01: irq 10
Aug  7 23:49:15 openhp /bsd: cbb1 at pci0 dev 4 function 1 "TI PCI1225 CardBus" rev 0x01: irq 10
Aug  7 23:49:15 openhp /bsd: piixpcib0 at pci0 dev 7 function 0 "Intel 82371AB PIIX4 ISA" rev 0x02
Aug  7 23:49:15 openhp /bsd: pciide0 at pci0 dev 7 function 1 "Intel 82371AB IDE" rev 0x01: DMA, channel 0 wired to compatibility, channel 1 wired to compatibility
Aug  7 23:49:15 openhp /bsd: wd0 at pciide0 channel 0 drive 0: <IBM-DBCA-206480>
Aug  7 23:49:15 openhp /bsd: wd0: 16-sector PIO, LBA, 4887MB, 10009440 sectors
Aug  7 23:49:15 openhp /bsd: wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2
Aug  7 23:49:15 openhp /bsd: pciide0: channel 1 disabled (no drives)
Aug  7 23:49:15 openhp /bsd: uhci0 at pci0 dev 7 function 2 "Intel 82371AB USB" rev 0x01: irq 10
Aug  7 23:49:15 openhp /bsd: piixpm0 at pci0 dev 7 function 3 "Intel 82371AB Power" rev 0x03: SMI
Aug  7 23:49:15 openhp /bsd: iic0 at piixpm0
Aug  7 23:49:15 openhp /bsd: spdmem0 at iic0 addr 0x50: 128MB SDRAM non-parity PC133CL2
Aug  7 23:49:15 openhp /bsd: maestro0 at pci0 dev 8 function 0 "ESS Maestro 2E" rev 0x10: irq 10
Aug  7 23:49:15 openhp /bsd: ac97: codec id 0x83847609 (SigmaTel STAC9721/23)
Aug  7 23:49:15 openhp /bsd: ac97: codec features 18 bit DAC, 18 bit ADC, SigmaTel 3D
Aug  7 23:49:15 openhp /bsd: audio0 at maestro0
Aug  7 23:49:15 openhp /bsd: cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b10
Aug  7 23:49:15 openhp /bsd: cardslot0 at cbb0 slot 0 flags 0
Aug  7 23:49:15 openhp /bsd: cardbus0 at cardslot0: bus 2 device 0 cacheline 0x8, lattimer 0x20
Aug  7 23:49:15 openhp /bsd: pcmcia0 at cardslot0
Aug  7 23:49:15 openhp /bsd: cardslot1 at cbb1 slot 1 flags 0
Aug  7 23:49:16 openhp /bsd: cardbus1 at cardslot1: bus 3 device 0 cacheline 0x8, lattimer 0x20
Aug  7 23:49:16 openhp /bsd: pcmcia1 at cardslot1
Aug  7 23:49:16 openhp /bsd: isa0 at piixpcib0
Aug  7 23:49:16 openhp /bsd: isadma0 at isa0
Aug  7 23:49:16 openhp /bsd: pckbc0 at isa0 port 0x60/5
Aug  7 23:49:16 openhp /bsd: pckbd0 at pckbc0 (kbd slot)
Aug  7 23:49:16 openhp /bsd: pckbc0: using irq 1 for kbd slot
Aug  7 23:49:16 openhp /bsd: wskbd0 at pckbd0: console keyboard, using wsdisplay0
Aug  7 23:49:16 openhp /bsd: pms0 at pckbc0 (aux slot)
Aug  7 23:49:16 openhp /bsd: pckbc0: using irq 12 for aux slot
Aug  7 23:49:16 openhp /bsd: wsmouse0 at pms0 mux 0
Aug  7 23:49:16 openhp /bsd: pcppi0 at isa0 port 0x61
Aug  7 23:49:16 openhp /bsd: midi0 at pcppi0: <PC speaker>
Aug  7 23:49:16 openhp /bsd: spkr0 at pcppi0
Aug  7 23:49:16 openhp /bsd: lpt0 at isa0 port 0x378/4 irq 7
Aug  7 23:49:16 openhp /bsd: npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16
Aug  7 23:49:16 openhp /bsd: pccom0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
Aug  7 23:49:16 openhp /bsd: pccom1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo
Aug  7 23:49:16 openhp /bsd: usb0 at uhci0: USB revision 1.0
Aug  7 23:49:16 openhp /bsd: uhub0 at usb0 "Intel UHCI root hub" rev 1.00/1.00 addr 1
Aug  7 23:49:16 openhp /bsd: biomask ef65 netmask ef65 ttymask ffe7
Aug  7 23:49:16 openhp /bsd: mtrr: Pentium Pro MTRR support
Aug  7 23:49:16 openhp /bsd: cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b10
Aug  7 23:49:16 openhp /bsd: cbb0: bad Vcc request. sock_ctrl 0x33, sock_status 0x30000b10
Aug  7 23:49:16 openhp /bsd: cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b19
Aug  7 23:49:16 openhp /bsd: cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b10
Aug  7 23:49:16 openhp /bsd: cbb0: bad Vcc request. sock_ctrl 0x33, sock_status 0x30000b10
Aug  7 23:49:17 openhp /bsd: cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b19
Aug  7 23:49:17 openhp /bsd: wdc2 at pcmcia0 function 0 "TS128MFLASHCP, , "cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b10
Aug  7 23:49:17 openhp /bsd: cbb0: bad Vcc request. sock_ctrl 0x33, sock_status 0x30000b10
Aug  7 23:49:17 openhp /bsd:  port 0xa000/16
Aug  7 23:49:17 openhp /bsd: wd1 at wdc2 channel 0 drive 0: <Hitachi CVM1.1.1>
Aug  7 23:49:17 openhp /bsd: wd1: 1-sector PIO, LBA, 122MB, 250368 sectors
Aug  7 23:49:17 openhp /bsd: wd1(wdc2:0:0): using BIOS timings
Aug  7 23:49:17 openhp /bsd: softraid0 at root
Aug  7 23:49:17 openhp /bsd: malo0 at cardbus1 dev 0 function 0 "Marvell Libertas 88W8335" rev 0x03: irq 10, address 00:14:6c:c4:10:04
Aug  7 23:49:17 openhp /bsd: root on wd0a swap on wd0b dump on wd0b
Aug  7 23:49:17 openhp /bsd: WARNING: / was not properly unmounted
Aug  7 23:49:17 openhp savecore: no core dump
Aug  7 23:54:16 openhp ntpd[27328]: 3 out of 5 peers valid
Aug  7 23:54:16 openhp ntpd[27328]: bad peer from pool pool.ntp.org (67.201.12.252)
Aug  7 23:54:16 openhp ntpd[27328]: bad peer from pool pool.ntp.org (66.96.98.9)

I have a flash disk in the pcmcia slot which I use as /tmp. I did that because the hard disk is very small and I never use this flash disk otherwise. Is it that? Or is it the wireless card?

tks
Reply With Quote
  #8   (View Single Post)  
Old 7th August 2008
BSDfan666 BSDfan666 is offline
Real Name: N/A, this is the interweb.
Helpful companion
 
Join Date: Apr 2008
Location: Ontario, Canada
Posts: 2,223
Thanked 193 Times in 184 Posts
Default

Well, look at it from this perspective.

Code:
cbb0 at pci0 dev 4 function 0 "TI PCI1225 CardBus" rev 0x01: irq 10
cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b10
cardslot0 at cbb0 slot 0 flags 0
cardbus0 at cardslot0: bus 2 device 0 cacheline 0x8, lattimer 0x20
pcmcia0 at cardslot0
cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b10
cbb0: bad Vcc request. sock_ctrl 0x33, sock_status 0x30000b10
cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b19
cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b10
cbb0: bad Vcc request. sock_ctrl 0x33, sock_status 0x30000b10
cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b19
wdc2 at pcmcia0 function 0 "TS128MFLASHCP, , "cbb0: bad Vcc request. sock_ctrl 0x0, sock_status 0x30000b10
cbb0: bad Vcc request. sock_ctrl 0x33, sock_status 0x30000b10
port 0xa000/16
wd1 at wdc2 channel 0 drive 0: <Hitachi CVM1.1.1>
wd1: 1-sector PIO, LBA, 122MB, 250368 sectors
wd1(wdc2:0:0): using BIOS timings
It seems clear the first Cardbus device is at fault... can you relocate /tmp and boot without the flash card attached?

Also, if you have some spare time... type boot -c at the boot> prompt, this will take you into the UKC, type "disable apm" and then "quit". (Post the dmesg here again, if that's alright.)
Reply With Quote
  #9   (View Single Post)  
Old 7th August 2008
gosha gosha is offline
Spam Deminer
 
Join Date: Jun 2008
Location: China
Posts: 256
Thanked 0 Times in 0 Posts
Default

I did disable apm, but on reboot it is enabled again. I removed the flashcard and put /tmp in a file.
In fact it has always given this error in dmesg, but worked fine until a few days ago.
So, is it actually the flashcard the problem?
Here is my dmesg



Code:
OpenBSD 4.3 (GENERIC) #698: Wed Mar 12 11:07:05 MDT 2008
    deraadt@i386.openbsd.org:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel Pentium III ("GenuineIntel" 686-class) 449 MHz
cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,SER,MMX,FXSR,SSE
real mem  = 200785920 (191MB)
avail mem = 185978880 (177MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 11/11/99, BIOS32 rev. 0 @ 0xfd790, SMBIOS rev. 2.2 @ 0xf0850 (38 entries)
bios0: vendor Phoenix Technologies LTD version "CL.M7.13" date 11/11/99
bios0: Hewlett-Packard HP OmniBook PC
apm0 at bios0: Power Management spec V1.2
apm0: battery life expectancy 100%
apm0: AC on, battery charge high, estimated 3:20 hours
acpi at bios0 function 0x0 not configured
pcibios0 at bios0: rev 2.1 @ 0xfd790/0x870
pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfdf50/144 (7 entries)
pcibios0: PCI Interrupt Router at 000:07:0 ("Intel 82371FB ISA" rev 0x00)
pcibios0: PCI bus #3 is the last bus
bios0: ROM list: 0xc0000/0xc800
cpu0 at mainbus0
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 "Intel 82443BX AGP" rev 0x03
agp0 at pchb0: aperture at 0xe0000000, size 0x4000000
ppb0 at pci0 dev 1 function 0 "Intel 82443BX AGP" rev 0x03
pci1 at ppb0 bus 1
vga1 at pci1 dev 0 function 0 "ATI Mobility 1" rev 0x64
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
cbb0 at pci0 dev 4 function 0 "TI PCI1225 CardBus" rev 0x01: irq 10
cbb1 at pci0 dev 4 function 1 "TI PCI1225 CardBus" rev 0x01: irq 10
piixpcib0 at pci0 dev 7 function 0 "Intel 82371AB PIIX4 ISA" rev 0x02
pciide0 at pci0 dev 7 function 1 "Intel 82371AB IDE" rev 0x01: DMA, channel 0 wired to compatibility, channel 1 wired to compatibility
wd0 at pciide0 channel 0 drive 0: <IBM-DBCA-206480>
wd0: 16-sector PIO, LBA, 4887MB, 10009440 sectors
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2
pciide0: channel 1 disabled (no drives)
uhci0 at pci0 dev 7 function 2 "Intel 82371AB USB" rev 0x01: irq 10
piixpm0 at pci0 dev 7 function 3 "Intel 82371AB Power" rev 0x03: SMI
iic0 at piixpm0
spdmem0 at iic0 addr 0x50: 128MB SDRAM non-parity PC133CL2
maestro0 at pci0 dev 8 function 0 "ESS Maestro 2E" rev 0x10: irq 10
ac97: codec id 0x83847609 (SigmaTel STAC9721/23)
ac97: codec features 18 bit DAC, 18 bit ADC, SigmaTel 3D
audio0 at maestro0
cardslot0 at cbb0 slot 0 flags 0
cardbus0 at cardslot0: bus 2 device 0 cacheline 0x8, lattimer 0x20
pcmcia0 at cardslot0
cardslot1 at cbb1 slot 1 flags 0
cardbus1 at cardslot1: bus 3 device 0 cacheline 0x8, lattimer 0x20
pcmcia1 at cardslot1
isa0 at piixpcib0
isadma0 at isa0
pckbc0 at isa0 port 0x60/5
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pms0 at pckbc0 (aux slot)
pckbc0: using irq 12 for aux slot
wsmouse0 at pms0 mux 0
pcppi0 at isa0 port 0x61
midi0 at pcppi0: <PC speaker>
spkr0 at pcppi0
lpt0 at isa0 port 0x378/4 irq 7
npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16
pccom0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
pccom1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo
usb0 at uhci0: USB revision 1.0
uhub0 at usb0 "Intel UHCI root hub" rev 1.00/1.00 addr 1
biomask ef65 netmask ef65 ttymask ffe7
mtrr: Pentium Pro MTRR support
malo0 at cardbus1 dev 0 function 0 "Marvell Libertas 88W8335" rev 0x03: irq 10, address 00:14:6c:c4:10:04
softraid0 at root
root on wd0a swap on wd0b dump on wd0b
umass0 at uhub0 port 1 configuration 1 interface 0 "USB Flash Disk" rev 2.00/2.00 addr 2
umass0: using SCSI over Bulk-Only
scsibus0 at umass0: 2 targets
sd0 at scsibus0 targ 1 lun 0: <Aigo USB, Storage Device, 2.00> SCSI2 0/direct removable
sd0: 125MB, 15 cyl, 255 head, 63 sec, 512 bytes/sec, 256000 sec total
msdosfs: reducing max cluster to 126975 from 126976 due to FAT size
Reply With Quote
Old 7th August 2008
jggimi's Avatar
jggimi jggimi is offline
More noise than signal
 
Join Date: May 2008
Location: USA
Posts: 3,432
Thanked 214 Times in 189 Posts
Default

Quote:
Originally Posted by gosha View Post
... on reboot it is enabled again.
Using boot -c will allow you to temporarily change some kernel configuration attributes, as described by boot_config(8). Those changes can be made permanent with config(8)'s "-u" option, if warranted, or config(8) may be used to recreate the changes or produce a changed kernel file to select at the boot> prompt.

----

cbb(4) is a cardbus controller. Looking through the source code (src/sys/dev/pci/{pccbb.c,pcbbreg.h} the "Bad Vcc Request" comes from a status code produced by the cardbus socket register.

----

Searching through the misc@ mailing list archives, I saw one posting from 2004 that described the message as spurious. Considering you see kernel messages like these while your system is running and not hanging, I do not believe that cardbus/pcmcia issues are related to your hang.
Reply With Quote
Old 7th August 2008
gosha gosha is offline
Spam Deminer
 
Join Date: Jun 2008
Location: China
Posts: 256
Thanked 0 Times in 0 Posts
Default

I forgot to mention that this dmesg is without the flashcard, if I plug it in it always gives that error
After the last crash, after reboot the system would not work, it was very slow,like 30 seconds to show me the "cd" I just typed. When I finally removed the card everything seems fine after reboot.
I'll try and use it a few days and see what happens. Cooling should be fine, and I don't really do any cpu intensive work, only browsing with w3m and chatting with pidgin, oh also emails with alpine

I'll also try memtest

Thanks a lot for you time, if it keeps hanging I'll come back

Reply With Quote
Old 8th August 2008
gosha gosha is offline
Spam Deminer
 
Join Date: Jun 2008
Location: China
Posts: 256
Thanked 0 Times in 0 Posts
Default

it hanged again the same way, so it definitely is not the flashcard
I did a quick test yesterday with memtester from the packages, now I'm doing a more complete one, but ram seems to be ok

what should I do?
Reply With Quote
Old 8th August 2008
jggimi's Avatar
jggimi jggimi is offline
More noise than signal
 
Join Date: May 2008
Location: USA
Posts: 3,432
Thanked 214 Times in 189 Posts
Default

  1. As I mentioned above: if your OS panics, by default the console will enter the ddb debugger. Since this only happens when you're running X, you will never see this happen; your OS will "hang" and you will be unable to reach the console. As I recommended: disable the ddb debugger. Set sysctl ddb.panic to 0. Edit /etc/sysctl.conf appropriately. On reboot, ddb will be disabled. If the OS panics, it will dump memory to swap space, and reboot, and the dump will be saved in /var/crash by savecore(8), as described in the crash(8) man page I recommended.
  2. Increase your swap space. Since we do not know your environment, it is possible -- perhaps likely -- that whatever you are doing is consuming all of your swap space, causing the "hang" you are seeing. In addition, if you want to be able to have the OS save the results of the panic and reboot, you will need to have more swap than you have now. Best practice minimum is 2X RAM size, primarily to manage dumps of RAM. You have very little RAM for modern X applications, and I would recommend 512MB or 1G swap space without batting an eye. FAQ 14.4 describes how to add swap space using a file -- note that doing this adds swap space which is not usable with savecore(8). I recommend adding a 2nd swap partition or repartitioning your drive, as necessary. Also, I do not know if your existing 200MB swap space in a partition will be enough to save 192MB -- depending on existing swap usage or control information it may be insufficient, so if you do not do this, and only set the sysctl, you may still see "hangs" .
Once you've set your OS up to dump memory in the event of a panic, while in X the system will appear to hang during the process while RAM is copied to swap space. The time needed is dependent on the speed of the disk drive to save 192MB of information.

If you make these changes, and the system still hangs, install the "stress" package and run various heavy load tests while in console mode, to see if particular types of hardware stresses cause system failures/hangs.

Last edited by jggimi; 8th August 2008 at 07:27 AM.
Reply With Quote
Old 8th August 2008
gosha gosha is offline
Spam Deminer
 
Join Date: Jun 2008
Location: China
Posts: 256
Thanked 0 Times in 0 Posts
Default

I see, thank you for your detailed answer.
In fact yesterday I already added the sysctl.panic=0 as you said in /etc, so as you are guessing it must be some software needing a lot of ram and the swap is not enoug either, since it hanged again.
This laptop only supports this much ram, cannot add anymore. I will try and give more swap for the crash dump to see what happens. But also, I remember reading in the faqs or man pages that dumps are written in to /var, am I right? My /var partition is 300. Anyway I'll try and give more swap and see what happens.
The only thing I added lately is cupsd starting on boot, and I also have often postgresql server running. Could it be those?
I'm of course planning to by a new laptop, but first I want to become proficient with openbsd, so I won't need to mess around with a brand new machine.
I'm running memtest now and everything seems fine with ram.
Reply With Quote
Old 8th August 2008
gosha gosha is offline
Spam Deminer
 
Join Date: Jun 2008
Location: China
Posts: 256
Thanked 0 Times in 0 Posts
Default

I was in console (no X running) and it happened again
I then removed the ddb.panic=0, and it happened again with this message:

Stopped at Xrecurse_legacy4+0x6d incl uvmexp+0x8c

typing "trace" gives:
apm_cpu_idle (lots of numbers) at apm_cpu_idle+0x2c
cpu_idle_cycle (numbers) at cpu_idle_cycle+0vc
Bad frame pointer: (number)


I should add that the display is somewhat faulty, if I move the lid it usually change colors, and now, these days it started to change color by itself, insted of having black I get a green display
Maybe it has to do with this? Maybe it is really time for this old machine to rest for good?

Last edited by gosha; 8th August 2008 at 01:59 PM.
Reply With Quote
Old 8th August 2008
jggimi's Avatar
jggimi jggimi is offline
More noise than signal
 
Join Date: May 2008
Location: USA
Posts: 3,432
Thanked 214 Times in 189 Posts
Default

You have now confirmed that your system is panicking. Panics happen when the kernel cannot recover from an error.
Unfortunately your post describing the panic doesn't show the actual panic message (which should have been above the "Stopped.." line on the console), so we don't know why it occurred --- just where in memory it happened. I have scanned the source code for both the kernel and for X, I could not find the string "Xrecurse_legacy4".

I cannot provide any diagnosis with the information provided so far.
Saving of RAM and system state after a panic will be attempted, automatically, if ddb.panic=0, or can be done by command from within ddb if ddb.panic=1. The dump of memory happens in two stages:
  1. System state and all of memory are saved into available swap space by the failing kernel, then the system reboots.
  2. During reboot, /etc/rc runs savecore(8), which looks for a dump in swap space, and saves it to a pair of files in /var/crash for later analysis.
In order to successfully capture a dump, you need BOTH sufficient swap space and sufficient filesystem space in /var.
Reply With Quote
Old 8th August 2008
gosha gosha is offline
Spam Deminer
 
Join Date: Jun 2008
Location: China
Posts: 256
Thanked 0 Times in 0 Posts
Default

if I swap to file, will the system use both my swap partition and the file or only the file?
Reply With Quote
Old 8th August 2008
jggimi's Avatar
jggimi jggimi is offline
More noise than signal
 
Join Date: May 2008
Location: USA
Posts: 3,432
Thanked 214 Times in 189 Posts
Default

OK! I found the reference to Xrecurse_legacy4 -- I'd missed it earlier because I was hunting in C (.c/.h) source, not in Assembler Language source (.s/.S) code.

I found it in /usr/src/sys/arch/i386/i386/vector.s -- interrupt handling for i386.

Your traceback refers to APM -- which is the likely culprit for the panic -- though the specific reason for the problem is unclear without the panic message. But knowing what we know, you can disable APM, as bsdfan666 first suggested we discussed earlier, and see if your problem goes away.

To disable it permanently, use:
# config -ef /bsd
and enter "disable apm" and "quit" as bsdfan666 had shown above. Then reboot.
Reply With Quote
Old 8th August 2008
jggimi's Avatar
jggimi jggimi is offline
More noise than signal
 
Join Date: May 2008
Location: USA
Posts: 3,432
Thanked 214 Times in 189 Posts
Default

Quote:
Originally Posted by gosha View Post
if I swap to file, will the system use both my swap partition and the file or only the file?
As I mentioned above, during dump to swap, the system will dump to available swap space. On boot, swap space in a file will not be available to savecore(8), so if it dumps any data to swap space assigned to a file, the dump will be lost.
Reply With Quote
Old 8th August 2008
gosha gosha is offline
Spam Deminer
 
Join Date: Jun 2008
Location: China
Posts: 256
Thanked 0 Times in 0 Posts
Default

I see, thanks, I'll try with an external drive

I did not permanently disable apm yet, because for some reason if I disable it my wireless card also does not come up, but if apm is enabled it works fine.
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Your OpenBSD Hardware mfaridi OpenBSD General 67 13th December 2013 01:14 AM
Your NetBSD hardware PhotoJim NetBSD General 12 5th January 2012 07:37 AM
Hardware recommendation: what hardware to buy for my new FreeBSD desktop? Broodjegehaktmetmayo General Hardware 92 11th February 2009 10:43 PM
What Sun hardware do you have revzalot General Hardware 7 22nd August 2008 01:44 PM
hardware not working Terminal-Chaos FreeBSD General 2 29th May 2008 05:32 AM


All times are GMT. The time now is 09:03 AM.


Powered by vBulletin® Version 3.8.4
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Content copyright © 2007-2010, the authors
Daemon image copyright ©1988, Marshall Kirk McKusick