Linux host FreeBSD guest + two possible fixes

Adam Goska

unread,

Dec 3, 2021, 12:50:46 PM12/3/21

to syzk...@googlegroups.com

Hello,

I've been working on getting syzkaller setup on a freebsd guest from a linux host using qemu/kvm. I am a student looking to build up some experience and get the career ball rolling, and someone suggested to me that contributing to an open source project is a good place to start. On the FreeBSD project ideas page I found references to syzkaller which caught my interest. I hoped I could start by adding syscall definitions if any are still unsupported and help with the FreeBSD documentation section. In the process of getting everything up and running I made a few changes to correct some errors that I got along the way.

I noticed that the docs are a bit dated, particularly in the section about compiling the executor on the guest. I found a thread in this group which was helpful in getting the executor compilation going without errors. For some reason, I wasn't able to use 'go get' to acquire the syzkaller sources (probably user error, I haven't done much with golang before) and ended up cloning the repo with git. I'm not sure if that will cause issues down the road but please correct me if I am wrong.

Basically, the steps I took were :

1.) Setup a freebsd14 VM from the .qcow2 snapshot and expand the image to 45GB with 'qemu-img resize image.qcow2 +45GB' on the host as well as '/etc/rc.d/growfs onestart' on the guest (45GB is probably overkill, but I kept running out of space during buildworld and added extra to be safe, less would probably work, my guest image ended up using about 22GB out of 47GB in the end). then make the SYZKALLER custom kernel config file according to the current freebsd doc. At first I had issues building the kernel. For me, it worked after running:

make -j4 buildworld && make -j4 buildkernel KERNCONF=SYZKALLER

make -j4 installkernel KERNCONF=SYZKALLER

2.) Install syzkaller dependencies on the host (replace pacman with the relevant package manager for the host OS):

pacman -S bash gcc git gmake go golangci-lint llvm

(golangci-lint was not available in the pacman repos, and was installed from the AUR, so may need to be acquired separate from the other dependencies)

3.) Acquire syzkaller sources via 'git clone' or 'go get' (git clone in my case, correct me if this was the wrong way to do things):

git clone https://github.com/google/syzkaller/

(OR)

go get -d -u https://github.com/google/syzkaller/

4.) Compile the binaries (in host syzkaller directory):

make all HOSTOS=linux HOSTARCH=amd64 TARGETOS=freebsd TARGETARCH=amd64

5.) Start the guest image:

qemu-system-x86_64 -m 2048 -smp 4 -chardev socket,id=SOCKSYZ,server=on,wait=off,host=localhost,port=51727 -mon chardev=SOCKSYZ,mode=control -display none -serial stdio -no-reboot -name VM-0 -device virtio-rng-pci -enable-kvm -device e1000,netdev=net0 -netdev user,id=net0,restrict=on,hostfwd=tcp:127.0.0.1:1569-:22 -hda/path/to/snapshot.qcow2

6.) Copy the syzkaller/executor/* files from the host to the guest (replace $PORT and $DIR/TO/SSHKEY as appropriate):

scp -P $PORT -i $DIR/TO/SSHKEY syzkaller/executor/ root@localhost:/root/

7.) Compile syz-executor (on the guest ):

c++ -o syz-executor executor/executor.cc -pthread -Wall -Wframe-larger-than=8192 -Wparentheses -Werror -O2 -m64 -static -DGOOS_freebsd=1 -DGOARCH_amd64=1 -DGIT_REVISION=\"61f862782082c777ba335aa4b4b08d4f74d7d86e\"

8.) Copy the syz-executor binary from the guest to the host in the syzkaller/bin/freebsd_amd64/ directory (replace $PORT and $SSHKEY as appropriate):

scp -P $PORT -i $SSHKEY root@localhost:/root/syz-executor syzkaller/bin/freebsd_amd64/

9.) Shut down the guest and run syz-manager on the host with:

sys-manager -config freebsd.cfg

At first, I had some issues with ssh authentication, but realized I ran the qemu command with the -snapshot flag which made config changes to the guest not persistent across reboots, removing that flag allowed me to properly configure the ssh keys. Even so, I was getting tons of, "SYZFAIL: tun_id out of range" errors (almost 200 in 30-40 minutes of uptime, near constant), which I didn't see any explicit references to in the docs, on the mailing list, or on syzbot. I saw in the report logs that these errors would appear when syz-executor would try creating a tun/tap interface with an ID higher than 4. Being new to kernel fuzzing, at first I wasn't sure if this was normal behavior or a syzkaller bug. I found this thread which had a brief reference to similar crashes with NetBSD guests and mentions increasing the number of tun/tap devices to resolve the issue. I grep'd around and looked at the commit history finding that not long after that discussion executor/common_bsd.h was changed to give NetBSD guests a MAX_TUN value of 64, higher than the default of 4. I subsequently edited that line (130) from:

#if GOOS_netbsd

to:

#if GOOS_netbsd || GOOS_freebsd

After recompiling, this gave freebsd guests more tun/tap interfaces to work with and eliminated the SYZFAIL crashes. The syz-manager was now able to run properly as far as I am aware. I let it run for 2-3 hours and didn't have a single SYZFAIL so I think it worked.

I also noticed a warning very early into the execution of sys-manager that when calling qemu to start the guest snapshots the 'nowait' option is deprecated and is now 'wait=off' and I was able to edit vm/qemu/qemu.go (line 408) from:

"-chardev", fmt.Sprintf("socket,id=SOCKSYZ,server=on,nowait,host=localhost,port=%v", inst.monport),

to:

"-chardev", fmt.Sprintf("socket,id=SOCKSYZ,server=on,wait=off,host=localhost,port=%v", inst.monport),

After recompiling again this change stopped that particular warning. I'm not sure if either of these changes are desirable to be merged to the main branch , but I wanted to document the process and my rationale for the patches in a way that would be visible to others trying to use syzkaller with freebsd. I'd be happy to put in a PR if it would be helpful to the project as a whole. I'm also willing to update the documentation pages for freebsd to include the steps I took to get things up and running for future users.

I have a few questions after going through the process of setting everything up. I've seen that when a crash is found the automatic reproducing fails each time with a cant_build_freebsd_on_linux error and wonder if this is expected behavior, or another issue that I should be looking into. I have not tried to manually reproduce any crashes thus far, but I expect it has something to do with freebsd using a different make than linux and may just turn reproduction off for now. I also see that when clicking the coverage hyperlink in the web interface, that a coverage profile failed to generate due to a kernel obj directory not being specified. Can the obj directory be copied to the host and specified, or is this functionality not yet supported for freebsd? Lastly, I see that most of the crashes (46 and 75 respectively from 3-4 hours of fuzzing) are either, 'lost connection to test machine' or, 'no output from test machine' which from the docs I understand can be indicative of a kernel lockup or other issue, but is the high rate of these two crash types to be expected, or should I be looking into whether something else is going wrong?

Please let me know if I have misunderstood something along the way or am barking up the wrong tree. I am new to participating in large projects and kernel fuzzing and am open to feedback on how to improve.

Thanks,

Adam Goska

Dmitry Vyukov

unread,

Dec 8, 2021, 11:32:34 AM12/8/21

to Adam Goska, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

On Fri, 3 Dec 2021 at 18:50, 'Adam Goska' via syzkaller
<syzk...@googlegroups.com> wrote:
>
> Hello,
>
> I've been working on getting syzkaller setup on a freebsd guest from a linux host using qemu/kvm. I am a student looking to build up some experience and get the career ball rolling, and someone suggested to me that contributing to an open source project is a good place to start. On the FreeBSD project ideas page I found references to syzkaller which caught my interest. I hoped I could start by adding syscall definitions if any are still unsupported and help with the FreeBSD documentation section. In the process of getting everything up and running I made a few changes to correct some errors that I got along the way.

Hi Adam,

This somehow got into a spam folder for me, so answering only now.
Contributions are welcome. Also +Mark and Michael who actively
contributed FreeBSD support.

If some docs get out of date, updating/improving them may be a good
first Pull Request.

> I noticed that the docs are a bit dated, particularly in the section about compiling the executor on the guest. I found a thread in this group which was helpful in getting the executor compilation going without errors. For some reason, I wasn't able to use 'go get' to acquire the syzkaller sources (probably user error, I haven't done much with golang before) and ended up cloning the repo with git. I'm not sure if that will cause issues down the road but please correct me if I am wrong.

The way go get works by default has changed recently with the
introduction of Go modules. We also had that part copy-pasted across
several docs, and potentially some of these docs were updated, while
others were not.

> Basically, the steps I took were :
>
> 1.) Setup a freebsd14 VM from the .qcow2 snapshot and expand the image to 45GB with 'qemu-img resize image.qcow2 +45GB' on the host as well as '/etc/rc.d/growfs onestart' on the guest (45GB is probably overkill, but I kept running out of space during buildworld and added extra to be safe, less would probably work, my guest image ended up using about 22GB out of 47GB in the end). then make the SYZKALLER custom kernel config file according to the current freebsd doc. At first I had issues building the kernel. For me, it worked after running:
>
> make -j4 buildworld && make -j4 buildkernel KERNCONF=SYZKALLER
>
> make -j4 installkernel KERNCONF=SYZKALLER
>
>
> 2.) Install syzkaller dependencies on the host (replace pacman with the relevant package manager for the host OS):
>
> pacman -S bash gcc git gmake go golangci-lint llvm
>
> (golangci-lint was not available in the pacman repos, and was installed from the AUR, so may need to be acquired separate from the other dependencies)

Installing golangci-lint may even be not necessary not because we
"vendor" it in the /vendor dir and go command should use it from
there(if everything is setup properly).

> 3.) Acquire syzkaller sources via 'git clone' or 'go get' (git clone in my case, correct me if this was the wrong way to do things):
>
> git clone https://github.com/google/syzkaller/
>
> (OR)
>
> go get -d -u https://github.com/google/syzkaller/
>
>
> 4.) Compile the binaries (in host syzkaller directory):
>
> make all HOSTOS=linux HOSTARCH=amd64 TARGETOS=freebsd TARGETARCH=amd64

I think HOSTOS/HOSTARCH should be auto-detected, so no need to pass explicitly.

> 5.) Start the guest image:
>
> qemu-system-x86_64 -m 2048 -smp 4 -chardev socket,id=SOCKSYZ,server=on,wait=off,host=localhost,port=51727 -mon chardev=SOCKSYZ,mode=control -display none -serial stdio -no-reboot -name VM-0 -device virtio-rng-pci -enable-kvm -device e1000,netdev=net0 -netdev user,id=net0,restrict=on,hostfwd=tcp:127.0.0.1:1569-:22 -hda/path/to/snapshot.qcow2
>
>
> 6.) Copy the syzkaller/executor/* files from the host to the guest (replace $PORT and $DIR/TO/SSHKEY as appropriate):
>
> scp -P $PORT -i $DIR/TO/SSHKEY syzkaller/executor/ root@localhost:/root/
>
>
> 7.) Compile syz-executor (on the guest ):
>
> c++ -o syz-executor executor/executor.cc -pthread -Wall -Wframe-larger-than=8192 -Wparentheses -Werror -O2 -m64 -static -DGOOS_freebsd=1 -DGOARCH_amd64=1 -DGIT_REVISION=\"61f862782082c777ba335aa4b4b08d4f74d7d86e\"
>
>
> 8.) Copy the syz-executor binary from the guest to the host in the syzkaller/bin/freebsd_amd64/ directory (replace $PORT and $SSHKEY as appropriate):
>
> scp -P $PORT -i $SSHKEY root@localhost:/root/syz-executor syzkaller/bin/freebsd_amd64/
>
>
> 9.) Shut down the guest and run syz-manager on the host with:
>
> sys-manager -config freebsd.cfg
>
>
> At first, I had some issues with ssh authentication, but realized I ran the qemu command with the -snapshot flag which made config changes to the guest not persistent across reboots, removing that flag allowed me to properly configure the ssh keys. Even so, I was getting tons of, "SYZFAIL: tun_id out of range" errors (almost 200 in 30-40 minutes of uptime, near constant), which I didn't see any explicit references to in the docs, on the mailing list, or on syzbot. I saw in the report logs that these errors would appear when syz-executor would try creating a tun/tap interface with an ID higher than 4. Being new to kernel fuzzing, at first I wasn't sure if this was normal behavior or a syzkaller bug. I found this thread which had a brief reference to similar crashes with NetBSD guests and mentions increasing the number of tun/tap devices to resolve the issue. I grep'd around and looked at the commit history finding that not long after that discussion executor/common_bsd.h was changed to give NetBSD guests a MAX_TUN value of 64, higher than the default of 4. I subsequently edited that line (130) from:
>
> #if GOOS_netbsd
>
> to:
>
> #if GOOS_netbsd || GOOS_freebsd
>
>
> After recompiling, this gave freebsd guests more tun/tap interfaces to work with and eliminated the SYZFAIL crashes. The syz-manager was now able to run properly as far as I am aware. I let it run for 2-3 hours and didn't have a single SYZFAIL so I think it worked.

Frequent SYZFAIL's always indicate a bug in syzkaller sources.

A fix to ifdef's may be a good PR as well.
However, somehow we don't see this SYZFAIL on syzbot:
https://syzkaller.appspot.com/freebsd
so I am not sure what happens/what's the difference.

> I also noticed a warning very early into the execution of sys-manager that when calling qemu to start the guest snapshots the 'nowait' option is deprecated and is now 'wait=off' and I was able to edit vm/qemu/qemu.go (line 408) from:
>
> "-chardev", fmt.Sprintf("socket,id=SOCKSYZ,server=on,nowait,host=localhost,port=%v", inst.monport),
>
> to:
>
> "-chardev", fmt.Sprintf("socket,id=SOCKSYZ,server=on,wait=off,host=localhost,port=%v", inst.monport),

Such fixes are also welcome (provided the new option works with
reasonably old qemu's).

> After recompiling again this change stopped that particular warning. I'm not sure if either of these changes are desirable to be merged to the main branch , but I wanted to document the process and my rationale for the patches in a way that would be visible to others trying to use syzkaller with freebsd. I'd be happy to put in a PR if it would be helpful to the project as a whole. I'm also willing to update the documentation pages for freebsd to include the steps I took to get things up and running for future users.
>
>
> I have a few questions after going through the process of setting everything up. I've seen that when a crash is found the automatic reproducing fails each time with a cant_build_freebsd_on_linux error and wonder if this is expected behavior, or another issue that I should be looking into. I have not tried to manually reproduce any crashes thus far, but I expect it has something to do with freebsd using a different make than linux and may just turn reproduction off for now.

The cant_build_freebsd_on_linux error is expected in the sense that
this is not implemented.
To generate and test a C reproducer syz-manager needs to be able to
compile the C reproducer, but we don't have freebsd cross-compiler on
linux, so this can't be done.
On syzbot you can see that bugs have reproducers:
https://syzkaller.appspot.com/freebsd
But that's because syzbot runs syz-manager on a FreeBSD host.

Potentially syz-manger could come up with at least "syz" reproducer in
such case (if a C compiler is not available).

> I also see that when clicking the coverage hyperlink in the web interface, that a coverage profile failed to generate due to a kernel obj directory not being specified. Can the obj directory be copied to the host and specified, or is this functionality not yet supported for freebsd?

Yes, kernel object and source directories should be on the host and
specified in the manager config.

However, I see we now have some bug on syzbot as well:
https://storage.googleapis.com/syzkaller/cover/ci-freebsd-main.html
Not sure what happens there...

If you are going to look deeper into coverage report generation, you
may find tools/syz-cover utility useful as it makes test iterations
much shorter. You can run syz-manager for some time, then save
/rawcover web page from it, and then invoke tools/syz-cover multiple
time passing in this rawcover file and -kernel_obj/src flags.

> Lastly, I see that most of the crashes (46 and 75 respectively from 3-4 hours of fuzzing) are either, 'lost connection to test machine' or, 'no output from test machine' which from the docs I understand can be indicative of a kernel lockup or other issue, but is the high rate of these two crash types to be expected, or should I be looking into whether something else is going wrong?

Hard to say, it can be both, or actually a mix of both.
The problem with these cases is that there are no good diagnostics
(like for other kernel bugs, like "kernel crashed in this function").
They can only be analyzed by looking at the logs and/or trying to
reproduce them.
But the high rate of these crashes means a problem one way or another
-- either there is a kernel bug that makes the kernel hang/silently
die, or there is some infrastructure issue that harms fuzzing process.
You can also see there is a large number of these on the syzbot:
https://syzkaller.appspot.com/freebsd
But they can be dealt with only by digging deeper into individual
cases. But if you do it, highly likely you will find something to
fix/improve somewhere (in the kernel, in the syzkaller, in the kernel
configuration or debugging features).

> Please let me know if I have misunderstood something along the way or am barking up the wrong tree. I am new to participating in large projects and kernel fuzzing and am open to feedback on how to improve.

You actually understood a lot of things already correctly and on your
own, which is great!

Mark Johnston

unread,

Dec 8, 2021, 12:04:37 PM12/8/21

to Dmitry Vyukov, Adam Goska, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

On Wed, Dec 08, 2021 at 05:32:20PM +0100, Dmitry Vyukov wrote:
> On Fri, 3 Dec 2021 at 18:50, 'Adam Goska' via syzkaller
> <syzk...@googlegroups.com> wrote:
> >
> > Hello,
> >
> > I've been working on getting syzkaller setup on a freebsd guest from a linux host using qemu/kvm. I am a student looking to build up some experience and get the career ball rolling, and someone suggested to me that contributing to an open source project is a good place to start. On the FreeBSD project ideas page I found references to syzkaller which caught my interest. I hoped I could start by adding syscall definitions if any are still unsupported and help with the FreeBSD documentation section. In the process of getting everything up and running I made a few changes to correct some errors that I got along the way.
>
> Hi Adam,
>
> This somehow got into a spam folder for me, so answering only now.
> Contributions are welcome. Also +Mark and Michael who actively
> contributed FreeBSD support.

For some reason much (maybe 25%) of the syzkaller mail I get goes to
spam. Including some mail from you, ironically. :)

I've only ever used FreeBSD/bhyve as a syzkaller host so I can't provide
much specific advice on setting up on a Linux host with QEMU. I'm happy
to help review pull requests though, and I left a few specific comments
below.

I will note that the FreeBSD main branch currently has a regression
involving RLIMIT_STACK which affects syz-executor. This is the cause of
the "failing" status on the syzbot/FreeBSD page. I'm actively working
on it today, so with a bit of luck it will be resolved shortly.

> > I also see that when clicking the coverage hyperlink in the web interface, that a coverage profile failed to generate due to a kernel obj directory not being specified. Can the obj directory be copied to the host and specified, or is this functionality not yet supported for freebsd?
>
> Yes, kernel object and source directories should be on the host and
> specified in the manager config.
>
> However, I see we now have some bug on syzbot as well:
> https://storage.googleapis.com/syzkaller/cover/ci-freebsd-main.html
> Not sure what happens there...
>
> If you are going to look deeper into coverage report generation, you
> may find tools/syz-cover utility useful as it makes test iterations
> much shorter. You can run syz-manager for some time, then save
> /rawcover web page from it, and then invoke tools/syz-cover multiple
> time passing in this rawcover file and -kernel_obj/src flags.

Hum. I think checkBinSupport() in symbolizer.go won't work on FreeBSD:
we have a different addr2line implementation (from the ELFToolChain
project rather than GNU binutils) which won't pass the tests there. In
particular:

$ addr2line --help
Usage: addr2line [options] hexaddress...
Map program addresses to source file names and line numbers.

Options:
...
$ echo $?
1

This is a bug IMO and I'll get it fixed, but that won't help syzbot
unless the manager host is updated. The check for supported targets
won't work either.

> > Lastly, I see that most of the crashes (46 and 75 respectively from 3-4 hours of fuzzing) are either, 'lost connection to test machine' or, 'no output from test machine' which from the docs I understand can be indicative of a kernel lockup or other issue, but is the high rate of these two crash types to be expected, or should I be looking into whether something else is going wrong?
>
> Hard to say, it can be both, or actually a mix of both.
> The problem with these cases is that there are no good diagnostics
> (like for other kernel bugs, like "kernel crashed in this function").
> They can only be analyzed by looking at the logs and/or trying to
> reproduce them.
> But the high rate of these crashes means a problem one way or another
> -- either there is a kernel bug that makes the kernel hang/silently
> die, or there is some infrastructure issue that harms fuzzing process.
> You can also see there is a large number of these on the syzbot:
> https://syzkaller.appspot.com/freebsd
> But they can be dealt with only by digging deeper into individual
> cases. But if you do it, highly likely you will find something to
> fix/improve somewhere (in the kernel, in the syzkaller, in the kernel
> configuration or debugging features).

Yes, I am sure there are some kernel bugs lurking here. I have not
invested much time into debugging them recently. In the past I would
sometimes rm -rf the crash database and start syzkaller again in the
hopes that it can find a reproducer for a particular hang. This has
borne some fruit, for instance it found a nasty deadlock in UFS which
has since been fixed. I would like to investigate these hangs further
though.

Dmitry Vyukov

unread,

Dec 8, 2021, 12:10:15 PM12/8/21

to Mark Johnston, Adam Goska, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

FWIW linux has a number of debugging facilities that help to diagnose
various dead- and live-locks (hangs, stalls, unkillable tasks, etc).
These are useful and help to move at least of bugs from "lost
connection"/"no output" bucket into more concrete bug buckets.

tue...@fh-muenster.de

unread,

Dec 8, 2021, 1:57:02 PM12/8/21

to Mark Johnston, Dmitry Vyukov, Adam Goska, syzk...@googlegroups.com, Mark Johnston

> On 8. Dec 2021, at 18:04, Mark Johnston <ma...@freebsd.org> wrote:
>
> On Wed, Dec 08, 2021 at 05:32:20PM +0100, Dmitry Vyukov wrote:
>> On Fri, 3 Dec 2021 at 18:50, 'Adam Goska' via syzkaller
>> <syzk...@googlegroups.com> wrote:
>>>
>>> Hello,
>>>
>>> I've been working on getting syzkaller setup on a freebsd guest from a linux host using qemu/kvm. I am a student looking to build up some experience and get the career ball rolling, and someone suggested to me that contributing to an open source project is a good place to start. On the FreeBSD project ideas page I found references to syzkaller which caught my interest. I hoped I could start by adding syscall definitions if any are still unsupported and help with the FreeBSD documentation section. In the process of getting everything up and running I made a few changes to correct some errors that I got along the way.
>>
>> Hi Adam,
>>
>> This somehow got into a spam folder for me, so answering only now.
>> Contributions are welcome. Also +Mark and Michael who actively
>> contributed FreeBSD support.
>
> For some reason much (maybe 25%) of the syzkaller mail I get goes to
> spam. Including some mail from you, ironically. :)
>
> I've only ever used FreeBSD/bhyve as a syzkaller host so I can't provide
> much specific advice on setting up on a Linux host with QEMU. I'm happy
> to help review pull requests though, and I left a few specific comments
> below.

Same for me. No (substantial) experience with Linux...

Best regards
Michael

Adam Goska

unread,

Dec 10, 2021, 12:34:04 AM12/10/21

to Dmitry Vyukov, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

Thanks for the responses! I might be at fault for the mail being flagged as spam. The email account I'm using runs on an openbsd VPS that I rented and configured, and while I managed to get SPF/DKIM/DMARC/rDNS working, I've had mixed results on whether the major providers file mails from my server as spam which I believe could be related to the .xyz TLD. Curiously, on Outlook accounts it always goes right to the spam folder, but on my personal Gmail account it has gone to the inbox without issue, funny that it doesn't for google group mails. If it becomes a persistent issue I may switch over to a Gmail account to make things easier, I use this one for mailing lists and the like because it doesn't get too much spam or unwanted messages from various services I've signed up for over the years.

Back on topic, I appreciate all of your input and it's given me a lot to think about. I've put in an initial pull request to update the readme with a working command to compile syz-executor on a freebsd guest. It's mostly as a test to see if I am doing things correctly, formatting the commit message right and the like. Being new to using git/go I'm a bit worried I have messed up somewhere. I had issues with running the presubmit tests, not sure yet if that's related to my go setup not being correct or because I run arch linux on the laptop I'm working this with and 'make install_prerequisites' runs commands meant for debian based distros which aren't compatible with pacman. After a cursory look at the Makefile and repos, some packages look like they might have slightly different naming conventions between the two distros. I'll have to look into this further, but it might be better to set up syz-env instead. I figured because I had only edited a single line of a markdown file without changing any formatting it wasn't critical for me to run the local tests and pulled the trigger anyway, though I hope to get the tests working soon and definitely before I make any changes to files which will affect the compiled binaries.

I'm excited to start exploring some of these crashes! Since my last mail I've attempted to manually reproduce a crash with sys-execprog that seemed to work, though I shot myself in the foot a bit by not running the VM in snapshot mode. The disk seemed to be somewhat corrupted, and the VM wouldn't boot normally again until I entered single user mode and ran chkdsk. I haven't yet minimized the programs that caused the crash to determine the exact cause, but I have successfully reproduced it using syz-execprog with the log file.

As far as the first change I had made to stop the syzfail messages, is it possible that syzbot running a freebsd host configuration compared to my linux+qemu config could be causing the issue? I did find it weird that there were no occurrences of the 'tun device out of range' error on syzbot or reports from other users. The only other explanation I can think of at the moment is that my using git clone instead of go get somehow left out a dependency which would prevent that error, or that my go environment being out of whack in general somehow caused it. I will hold off on submitting a pull request for that change until I understand the problem a little more.

For the second change to modify the qemu flags called to start the fuzzing VMs, I saw that the nowait option has been deprecated since version 6.0, which was released in April 2021. This probably means that it is too early to commit this change just yet, to preserve compatibility with users on older versions of qemu. I didn't notice a significant difference in performance before and after the patch, just that the warning message didn't appear during the startup routine.

Dmitry Vyukov

unread,

Dec 10, 2021, 1:23:40 AM12/10/21

to Adam Goska, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

On Fri, 10 Dec 2021 at 06:34, Adam Goska <a...@goska.xyz> wrote:
> Thanks for the responses! I might be at fault for the mail being flagged as spam. The email account I'm using runs on an openbsd VPS that I rented and configured, and while I managed to get SPF/DKIM/DMARC/rDNS working, I've had mixed results on whether the major providers file mails from my server as spam which I believe could be related to the .xyz TLD. Curiously, on Outlook accounts it always goes right to the spam folder, but on my personal Gmail account it has gone to the inbox without issue, funny that it doesn't for google group mails. If it becomes a persistent issue I may switch over to a Gmail account to make things easier, I use this one for mailing lists and the like because it doesn't get too much spam or unwanted messages from various services I've signed up for over the years.
>
> Back on topic, I appreciate all of your input and it's given me a lot to think about. I've put in an initial pull request to update the readme with a working command to compile syz-executor on a freebsd guest. It's mostly as a test to see if I am doing things correctly, formatting the commit message right and the like.

Nice when people think of this before sending PRs!

> Being new to using git/go I'm a bit worried I have messed up somewhere. I had issues with running the presubmit tests, not sure yet if that's related to my go setup not being correct or because I run arch linux on the laptop I'm working this with and 'make install_prerequisites' runs commands meant for debian based distros which aren't compatible with pacman. After a cursory look at the Makefile and repos, some packages look like they might have slightly different naming conventions between the two distros. I'll have to look into this further, but it might be better to set up syz-env instead. I figured because I had only edited a single line of a markdown file without changing any formatting it wasn't critical for me to run the local tests and pulled the trigger anyway, though I hope to get the tests working soon and definitely before I make any changes to files which will affect the compiled binaries.

Yes, syz-env may be the simplest option.
That's Linux unfortunately, no way to get something like this to work
portably. I think the current instructions are broken on my Debian as
well (but syz-env still works).

Also github always runs all checks. So if you skip some steps locally
(arch is probably the most problematic), Ci will cover you.

> I'm excited to start exploring some of these crashes! Since my last mail I've attempted to manually reproduce a crash with sys-execprog that seemed to work, though I shot myself in the foot a bit by not running the VM in snapshot mode. The disk seemed to be somewhat corrupted, and the VM wouldn't boot normally again until I entered single user mode and ran chkdsk. I haven't yet minimized the programs that caused the crash to determine the exact cause, but I have successfully reproduced it using syz-execprog with the log file.

That's good. Reproducing is the first step.

> As far as the first change I had made to stop the syzfail messages, is it possible that syzbot running a freebsd host configuration compared to my linux+qemu config could be causing the issue? I did find it weird that there were no occurrences of the 'tun device out of range' error on syzbot or reports from other users. The only other explanation I can think of at the moment is that my using git clone instead of go get somehow left out a dependency which would prevent that error, or that my go environment being out of whack in general somehow caused it. I will hold off on submitting a pull request for that change until I understand the problem a little more.

I will leave this to Mark and Michael (I doubt it's go get).

> For the second change to modify the qemu flags called to start the fuzzing VMs, I saw that the nowait option has been deprecated since version 6.0, which was released in April 2021. This probably means that it is too early to commit this change just yet, to preserve compatibility with users on older versions of qemu. I didn't notice a significant difference in performance before and after the patch, just that the warning message didn't appear during the startup routine.

It's more about when "wait=off" was introduced rather than when
"nowait" was deprecated. If "wait=off" was introduced... I don't
know... N years ago, then it's fine to switch to it.

Adam Goska

unread,

Dec 15, 2021, 2:16:12 PM12/15/21

to Mark Johnston, Dmitry Vyukov, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

Thanks for the response my apologies for not getting back sooner. I copied the /usr/src directory to the host and specified it in the config and don't believe there is any issue, but I think I am a little lost when it comes to the kernel_obj config. At first I tried to copy the /usr/obj directory to the host and got a kernel.full no such file error. I thought maybe the /usr/obj directory was not what I needed and tried the /boot/kernel/kernel file, renaming it to kernel.full. Now I'm getting:
'failed to generate coverage profile: failed to parse DWARF: decoding dwarf section info at offset 0x0: too short (set CONFIG_DEBUG_INFO=y on linux)'
I have a feeling I'm just not using the right file. What directory/file should I be looking to copy from the freebsd host?

I also tried to rebuild on the latest revision using syz-env to see if the syzfail: tun_id out of range error would still show up without my change to executor/common_bsd.h and it still was. I'm wondering if it not showing up in syzbot has to do with me using qemu instead of bhyve. Would this be worth a PR, or do you think it would negatively affect other setups?

On the topic of syz-env. When running presubmit I see in the output that I get a lot of passes and no explicit failures which seems good to me, but there are also a lot that have question marks in the results, and a few that were skipped, is this normal?

Dmitry Vyukov

unread,

Dec 16, 2021, 1:53:13 AM12/16/21

to Adam Goska, Mark Johnston, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

I am not sure what are question marks, how do they look like?

We skip some tests when the required toolchain is not available. E.g.
on Linux we don't have means to test FreeBSD builds. syz-env only
contains NewBSD, Fuchsia and Akaros cross-toolchains. If testing is
not supported, testing is skipped.

Adam Goska

unread,

Dec 16, 2021, 12:15:44 PM12/16/21

to Dmitry Vyukov, Mark Johnston, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

At the beginning of those lines of output where I get either pass or okay for other tests is the "?" and toward the end of those lines it says 'no test files' so maybe these are just being skipped as you said. There does seem to be quite a few of them though. I am using the syz-env from the current pull of the repo, is it possible that there is a more recent version in the gcr.io registry?

I'm hoping to get some feedback from Mark on whether increasing MAX_TUN for freebsd is a good move or not. I'm curious why I would be getting those syzfails when others don't seem to be.

I also did some looking into the qemu warning change, and found the commit where the preferred wait=off syntax was introduced, it was back in 2009 so I don't believe that anyone will be using an old enough qemu version to be affected. The qemu deprecated features page says that after a feature is declared deprecated it will stay functional for that release and then one more being liable to be removed after that point. The short form boolean options like nowait were declared deprecated on the 6.0 release and it looks like they're on 6.2 now. Am I clear to put in a PR for this change?

Dmitry Vyukov

unread,

Dec 16, 2021, 1:43:21 PM12/16/21

to Adam Goska, Mark Johnston, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

On Thu, 16 Dec 2021 at 18:15, Adam Goska <a...@goska.xyz> wrote:
>
>
> On 12/16/2021 1:52 AM, Dmitry Vyukov wrote:
>
> On Wed, 15 Dec 2021 at 20:16, Adam Goska <a...@goska.xyz> wrote:
>
>
> Thanks for the response my apologies for not getting back sooner. I copied the /usr/src directory to the host and specified it in the config and don't believe there is any issue, but I think I am a little lost when it comes to the kernel_obj config. At first I tried to copy the /usr/obj directory to the host and got a kernel.full no such file error. I thought maybe the /usr/obj directory was not what I needed and tried the /boot/kernel/kernel file, renaming it to kernel.full. Now I'm getting:
> 'failed to generate coverage profile: failed to parse DWARF: decoding dwarf section info at offset 0x0: too short (set CONFIG_DEBUG_INFO=y on linux)'
> I have a feeling I'm just not using the right file. What directory/file should I be looking to copy from the freebsd host?
>
> I also tried to rebuild on the latest revision using syz-env to see if the syzfail: tun_id out of range error would still show up without my change to executor/common_bsd.h and it still was. I'm wondering if it not showing up in syzbot has to do with me using qemu instead of bhyve. Would this be worth a PR, or do you think it would negatively affect other setups?
>
> On the topic of syz-env. When running presubmit I see in the output that I get a lot of passes and no explicit failures which seems good to me, but there are also a lot that have question marks in the results, and a few that were skipped, is this normal?
>
> I am not sure what are question marks, how do they look like?
>
> We skip some tests when the required toolchain is not available. E.g.
> on Linux we don't have means to test FreeBSD builds. syz-env only
> contains NewBSD, Fuchsia and Akaros cross-toolchains. If testing is
> not supported, testing is skipped.
>
> At the beginning of those lines of output where I get either pass or okay for other tests is the "?" and toward the end of those lines it says 'no test files' so maybe these are just being skipped as you said. There does seem to be quite a few of them though. I am using the syz-env from the current pull of the repo, is it possible that there is a more recent version in the gcr.io registry?

"no test files" means that this package doesn't have any tests at all.
This is nothing to worry about, consider them as "passing" :)

No, there is no recent container on gcr.io. What you fetch by default
is the "latest" tag.

> I'm hoping to get some feedback from Mark on whether increasing MAX_TUN for freebsd is a good move or not. I'm curious why I would be getting those syzfails when others don't seem to be.
>
> I also did some looking into the qemu warning change, and found the commit where the preferred wait=off syntax was introduced, it was back in 2009 so I don't believe that anyone will be using an old enough qemu version to be affected. The qemu deprecated features page says that after a feature is declared deprecated it will stay functional for that release and then one more being liable to be removed after that point. The short form boolean options like nowait were declared deprecated on the 6.0 release and it looks like they're on 6.2 now. Am I clear to put in a PR for this change?

Yes, if the new form was introduced in 2009 it's totally fine to use it now.

Adam Goska

unread,

Dec 16, 2021, 3:39:32 PM12/16/21

to Dmitry Vyukov, Mark Johnston, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

I put in the PR after running presubmit tests locally and not seeing any
failures, but the arch test failed when github ran the checks. On my
machine I'm able to fuzz successfully with these changes and syz-manager
is calling qemu without error. Do you have any insight as to what the
issue may be?

Adam Goska

unread,

Dec 19, 2021, 8:44:57 PM12/19/21

to Dmitry Vyukov, Mark Johnston, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

Nevermind, I see that there is an issue where a lot of PRs are
experiencing arch test failures with the 137 exit code so this is
probably just another case of that.

Dmitry Vyukov

unread,

Dec 20, 2021, 5:36:24 AM12/20/21

to Adam Goska, Mark Johnston, syzk...@googlegroups.com, Mark Johnston, tue...@fh-muenster.de

Yes, unfortunately this happens (probably need to restructure the
'arch' step a bit).
I've re-started CI jobs, let's see if it passes second time.

Reply all

Reply to author

Forward