I don't know if it's the case with FreeBSD, but on OpenBSD, using the raw character device node instead of the block device is better, as it bypasses the kernel buffer cache, and this can even improve speeds (..using a larger block size as stated by Carpetsmoker).
|