atom feed29 messages in org.freebsd.freebsd-currentRe: Fresh installed Freebsd 9 don't b...
FromSent OnAttachments
Pavel TimofeevOct 20, 2011 12:27 am 
Dennis KoegelOct 21, 2011 1:58 am 
Pavel TimofeevOct 21, 2011 2:44 am 
Lars EngelsOct 21, 2011 4:08 am 
Pavel TimofeevOct 21, 2011 4:12 am 
Gunnar SchaeferOct 21, 2011 12:19 pm 
John BaldwinOct 21, 2011 1:33 pm 
Andriy GaponOct 21, 2011 2:27 pm 
Andriy GaponOct 21, 2011 2:36 pm 
Gunnar SchaeferOct 21, 2011 3:22 pm 
Andriy GaponOct 23, 2011 5:50 am 
Dennis KoegelOct 23, 2011 8:27 am 
Andriy GaponOct 23, 2011 10:57 am 
Dennis KoegelOct 23, 2011 12:55 pm 
Dimitry AndricOct 23, 2011 1:08 pm 
John BaldwinOct 24, 2011 5:18 am 
John BaldwinOct 24, 2011 6:40 am 
Andriy GaponOct 24, 2011 6:47 am 
John BaldwinOct 24, 2011 8:33 am 
Andriy GaponOct 24, 2011 9:24 am 
John BaldwinOct 24, 2011 11:23 am 
Dennis KoegelOct 24, 2011 2:39 pm 
Gunnar SchaeferOct 24, 2011 4:20 pm 
Pavel TimofeevOct 25, 2011 12:13 am 
Andriy GaponOct 25, 2011 4:26 am 
John BaldwinOct 25, 2011 12:55 pm 
Pavel TimofeevNov 8, 2011 11:10 am 
John BaldwinNov 8, 2011 11:31 am 
Pavel TimofeevNov 9, 2011 10:48 pm 
Subject:Re: Fresh installed Freebsd 9 don't boot from hd
From:Andriy Gapon (av@FreeBSD.org)
Date:Oct 23, 2011 10:57:36 am
List:org.freebsd.freebsd-current

on 23/10/2011 18:27 Dennis Koegel said the following:

On Fri, Oct 21, 2011 at 04:33:38PM -0400, John Baldwin wrote:

Working offline with Dennis, we found that changing the CFLAGS in sys/boot/i386/gptboot/Makefile from "-O1" to "-Os -mrtd" (partially reverting an earlier commit) fixed gptboot. The next test for someone to do would be to try just adding "-mrtd" and leaving "-O1" as-is to see if that fixes it.

More test results:

gcc -Os -fno-guess-branch-probability -fomit-frame-pointer -fno-unit-at-a-time \ -mno-align-long-strings -mrtd [from before r225530]: Boots OK gcc -Os -mrtd: Boots OK gcc -O1 -mrtd: Fails gcc -O1: Fails gcc -O0: Fails gcc -Os: Boots OK

clang -O1: Fails clang -Os: Fails clang -Oz: Fails

I've put some printf()s into gpt{,boot}.c to trace where the reboot is triggered. It appears to be in drvsize() (called from gptread()). OTOH the debug output may have changed where the problem occurs, I don't know about that.

With 9.0R drawing near, CFLAGS should be s/-O1/-Os/, until we can figure out what happens. But as for why gcc's magic -Os is required and clang's output doesn't work at all, I'm clueless.

Thank you for your very valuable analysis! I looked at a difference in assembly code of the drvsize function produced by gcc -Os and by gcc -O1. One thing that was immediately obvious is that gcc places the params array and the sectors variable in a different order for different options. One idea is that if BIOS actually writes beyond the end of the array, then in one case it could be harmless (overwrites the sector variable), but in the other case it could be more harmful. I found a document that suggests a possibility of BIOS writing more bytes to the array than its current size of 0x42: http://www.t13.org/documents/UploadedDocuments/docs2008/e08134r1-BIOS_Enhanced_Disk_Drive_Services_4.0.pdf

Of course, the size of the array is passed to BIOS at the start of the array and so a _non-buggy_ BIOS should not write beyond the array, but we live in a non-perfect world.

Could you please test this hypothesis by trying the following patch? diff --git a/sys/boot/i386/common/drv.c b/sys/boot/i386/common/drv.c index 11f6628..5996a80 100644 --- a/sys/boot/i386/common/drv.c +++ b/sys/boot/i386/common/drv.c @@ -37,10 +37,10 @@ __FBSDID("$FreeBSD$"); uint64_t drvsize(struct dsk *dskp) { - unsigned char params[0x42]; + unsigned char params[0x4A]; uint64_t sectors;

- *(uint32_t *)params = sizeof(params); + *(uint16_t *)params = sizeof(params);

v86.ctl = V86_FLAGS; v86.addr = 0x13;

P.S. the assembly diff to which I referred above: --- drvsize.Os.S 2011-10-23 20:17:56.871996966 +0300 +++ drvsize.O1.S 2011-10-23 20:18:27.430995560 +0300 @@ -4,8 +4,8 @@ pushl %ebp movl %esp, %ebp subl $76, %esp - leal -74(%ebp), %ecx - movl $66, -74(%ebp) + leal -66(%ebp), %ecx + movl $66, -66(%ebp) movl $262144, __v86 movl $19, __v86+4 movl $18432, __v86+24 @@ -28,20 +28,20 @@ pushl %eax pushl $.LC4 call printf - xorl %eax, %eax - xorl %edx, %edx - popl %ecx - popl %ecx + movl $0, %eax + movl $0, %edx + addl $8, %esp jmp .L16 + .p2align 2,,3 .L14: pushl $8 - leal -58(%ebp), %eax + leal -50(%ebp), %eax pushl %eax - leal -8(%ebp), %eax + leal -76(%ebp), %eax pushl %eax call memcpy - movl -8(%ebp), %eax - movl -4(%ebp), %edx + movl -76(%ebp), %eax + movl -72(%ebp), %edx addl $12, %esp .L16: leave