DaemonForums  

Go Back   DaemonForums > OpenBSD > OpenBSD Installation and Upgrading

OpenBSD Installation and Upgrading Installing and upgrading OpenBSD.

Reply
 
Thread Tools Display Modes
  #1   (View Single Post)  
Old 6th April 2012
daemonfowl daemonfowl is offline
bsdstudent
 
Join Date: Jan 2012
Location: DaemonLand
Posts: 834
Default wd0a: device timeout writing fsbn ... of .....-.......

Hi !

I was upgrading my imac via ftp , suddently an error message read repeatedly :

Quote:
wd0a : device timeout writing fsbn ......number..... of .....number......
and :
d0 : soft error (corrected)
then it ended with :
Quote:
could not retrieve Tx statistics cancelling automatic rate control
couldn not transmit buffer : TIMEOUT
what does it mean in understandible human language ?
thank you very much

Note : Mates who are upgrading maccpc .. avoid packages , and build ports instead .. the packages list is older than the install sets .. :-)
Reply With Quote
  #2   (View Single Post)  
Old 6th April 2012
jggimi's Avatar
jggimi jggimi is offline
More noise than signal
 
Join Date: May 2008
Location: USA
Posts: 7,977
Default

Your first quote refers to wd0a -- this is partition 'a' on drive wd0. The device driver was unable to write to the drive within a time limit set by the wd(4) driver. "fsbn" is "file system block number". The message should also describes the physical block number within wd0 and may also provide "cn" which is "cylinder number" and "tn" which is "track" number. These latter two are probably not very useful with a modern drive, since the cylinder/track/sector addressing is virtual, and mapped by drive electronics.

The error was eventually corrected -- meaning that after retrying many, many times, perhaps hundreds or thousands of times, the drive electronics eventually confirmed the write had occurred.

The second block of messages, regarding "Tx" or "Transmit" is a kernel message produced by the ral(4) or rum(4) Ethernet drivers and have nothing directly to do, I think, with your disk I/O trouble. However if the system was "hung" while the I/O retries were occurring, this could explain a network hiccup.

---
Your entirely unrelated "warning" regarding packages should 1) reflect that you mean -current, rather than -release, and 2) does not belong in this thread, and 3) is a transient condition...as you have been told many times, snapshot packages are never exactly in sync with snapshots. They are provided as a courtesy, only.


Last edited by jggimi; 6th April 2012 at 11:46 PM. Reason: typos, clarity
Reply With Quote
  #3   (View Single Post)  
Old 6th April 2012
ocicat ocicat is offline
Administrator
 
Join Date: Apr 2008
Posts: 3,318
Default

Quote:
Originally Posted by daemonfowl View Post
Note : Mates who are upgrading maccpc .. avoid packages , and build ports instead .. the packages list is older than the install sets .. :-)
daemonfowl, while I understand why you make this statement (being a -current user...), there will be readers who are not attuned to the specifics of your situation to understand why this is not good advice. As stated in Section 15.3 of the FAQ:
Quote:
IMPORTANT NOTE: The ports tree is meant for advanced users. Everyone is encouraged to use the pre-compiled binary packages.
In general, this site attempts to provide reasoned advice which stays in step with the wishes & dictates of the project. Anything said in contradiction needs to provide supporting evidence.
Reply With Quote
  #4   (View Single Post)  
Old 6th April 2012
daemonfowl daemonfowl is offline
bsdstudent
 
Join Date: Jan 2012
Location: DaemonLand
Posts: 834
Default

Teacher @jgimmi , and teacher @ocicat : molte gratzie !!!!
your help is much appreciated !!
sorry for the inconvenience , I abide by the golden rule .. yet I wanted to save others times by stating that I failed to install packages after an upgrade ( 5.1 GENERIC#172 macppc) .. but again on i386 .. eveything is runing fine : I mean to this very moment I am installing packages without problems (5.1 GENERIC.MP#230 i386)
Reply With Quote
  #5   (View Single Post)  
Old 7th April 2012
jggimi's Avatar
jggimi jggimi is offline
More noise than signal
 
Join Date: May 2008
Location: USA
Posts: 7,977
Default

Back to the thread's topic -- your "temporary" I/O error: what might have been the problem, and why did it suddenly resolve after a long time passed by?

Let us assume, for the sake of this discussion, that the timeout was 5 seconds, and your drive spins at a typical 7200rpm.

During that 5 seconds, 600 times the sector passed underneath the head and the write failed. Why would it fail ~600 times and then ... suddenly succeed? Keep in mind:
  • Writes are not tested after being written to the physical media. You won't find out if you've written to a bad sector until you later attempt to read it. However, if the sector is n
  • Writes and reads typically share the use of a cache buffer on the drive electronics. This chunk of RAM improves performance, though on write operations, the drive does not report the write is complete until the block(s) are actually written to the disk.
  • If the drive was busy having trouble on a prior read, and was busy retrying, and placed the write into a queue until the read was either successful or failed.... you would have had other indications of this, such as a read timeout.
There are two possibilities: 1) Perhaps that the sector could not be located by the drive electronics, or the sector header was damaged and could not be read. The sudden success would be due to the electronics selecting a replacement sector from its built-in spares and remapping the sector. Or .... 2) Perhaps the drive was unable to seek to the track, and a realignment exercise was conducted by the drive to correct the error, which took five seconds.

If you are interested in finding out which it was, your drive electronics may be able to tell you. There is data stored on the electronics, through a technology called SMART. The atactl(8) program can read it, but I find sysutil/smartmontools much easier to use.

Last edited by jggimi; 7th April 2012 at 01:14 AM. Reason: typo, addition of SMART paragraph and link. Added seek error as a plausible root cause
Reply With Quote
  #6   (View Single Post)  
Old 7th April 2012
jggimi's Avatar
jggimi jggimi is offline
More noise than signal
 
Join Date: May 2008
Location: USA
Posts: 7,977
Default

I've revised my thoughts on possible reasons for the error -- not just a sector access error; seek errors may have caused this also. Posting revised.
Reply With Quote
  #7   (View Single Post)  
Old 7th April 2012
daemonfowl daemonfowl is offline
bsdstudent
 
Join Date: Jan 2012
Location: DaemonLand
Posts: 834
Default

thank you for this detailed explanation!
Quote:
not just a sector access error; seek errors may have caused this also
do we describe a sector access error as a hardware-specific problem , while seek errors as software ?
I've read the atactl man page .. it's interesting what kind of information one can get from it .. why isn't there a satactl ?
Reply With Quote
  #8   (View Single Post)  
Old 7th April 2012
jggimi's Avatar
jggimi jggimi is offline
More noise than signal
 
Join Date: May 2008
Location: USA
Posts: 7,977
Default

Seek errors are hardware errors. SATA drives are ATA and may be used with atactl.
Reply With Quote
  #9   (View Single Post)  
Old 7th April 2012
daemonfowl daemonfowl is offline
bsdstudent
 
Join Date: Jan 2012
Location: DaemonLand
Posts: 834
Default

I left that pseudo-Note unchanged so that Ocicat & jgimmi 's helpful replies stay relevant .. one more extra reason against using ports on legacy hardware : mozilla-firefox failed to build on old ibook because of memory exhaustion ! .. this was the error message I received a while ago after one night spent on port-building .. :-)
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
NetBSD FSBN Problem SL6-A2000 NetBSD General 2 9th August 2011 09:13 AM
System stops booting after "GEOM_MIRROR: Force device gm0 start due to timeout." indienick FreeBSD General 1 25th March 2010 09:46 PM
c++: writing to the *this pointer? robbak Programming 2 23rd October 2009 06:12 PM
/bsd: ral0: device timeout revzalot OpenBSD General 3 22nd July 2008 10:15 PM
identifying device associated with USB device? spiderpig OpenBSD General 2 7th July 2008 05:18 AM


All times are GMT. The time now is 09:13 PM.


Powered by vBulletin® Version 3.8.4
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Content copyright © 2007-2010, the authors
Daemon image copyright ©1988, Marshall Kirk McKusick