Of course, I'll do some of my own tests just to get a feel of the machine.
I think that I'll need a 9-CURRENT snapshot on it to run all 128 CPUs,
right?
_______________________________________________
freebsd...@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "freebsd-hacke...@freebsd.org"
...
> I think that I'll need a 9-CURRENT snapshot on it to run all 128 CPUs,
> right?
A 9.0-BETA1 snapshot, yes.
-Garrett
Well, I'll leave it another half an hour but the 9.9-beta1 shapshot
froze on boot after showing a "SRAT: No CPU found for memory domain 4".
(all this after the traditional "do-nothing" pause of 10-or so minutes
before displaying the copyright banner).
No luck - it's frozen. Linux and Windows Server work fine.
Not sure if hw.memtest.tests tunable has made it into 9.0-BETA1.
Setting it to zero should result in skipping the checks.
You may also try to capture and share a verbose dmesg, if possible.
--
Andriy Gapon
See one line above your question :-)
>> You may also try to capture and share a verbose dmesg, if possible.
>
> I'll take some photos of the screen.
No serial console? :(
>>> Not sure if hw.memtest.tests tunable has made it into 9.0-BETA1.
>>> Setting it to zero should result in skipping the checks.
>>
>> If it did, to what should I set it?
>
> See one line above your question :-)
Sorry :) I blame the ifluence of fan noise on my head :)
>>> You may also try to capture and share a verbose dmesg, if possible.
>>
>> I'll take some photos of the screen.
>
> No serial console? :(
There is on the server side but not on the client... didn't bring my
usb2serial cable.
If it did, to what should I set it?
> You may also try to capture and share a verbose dmesg, if possible.
I'll take some photos of the screen.
This message implies the memory affinity information coming from ACPI
is either non-sensical, or you have an unexpected physical setup where
there really are CPUs with no memory in the local sockets.
You should be able to boot with something like hint.srat.0="disabled"
at the boot loader prompt.
Thanks,
matthew
Unfortunately, neither the memtest or the srat disabling tunables
worked (I also tried disabling srat.4).
My time with the machine is over, so I can't do more testing.
The hint to set would be 'hint.srat.0.disabled=1'.
However, the SRAT code just ignores the table when it encounters an issue like
this, it doesn't hang. Something else later in the boot must have hung.
--
John Baldwin
> However, the SRAT code just ignores the table when it encounters an issue like
> this, it doesn't hang. Something else later in the boot must have hung.
Anyway... that machine can in its maximal configuration be populated
with eight 10-core CPUs, i.e. 80 physical / 160 logical, so here's a
vote from me to bump the shiny new cpuset infrastructure maximum CPU
count to 256 before 9.0.
http://www.supermicro.com/products/system/5U/5086/SYS-5086B-TRF.cfm
Doesn't that (MAXCPU) seriously impact VM usage, lock contention
etc ... ?
I mean, if we have 2 cpus in a machine, but MAXCPU is set to 256, there
is a bunch of "lost" memory and higher levels of lock contention?
I thought that attilio was taking a stab at enhancing this, but at the
current time anything more than a value of 64 for MAXCPU is kind of a
"caveat emptor" area of FreeBSD.
Sean
P.S. I say 64 as yahoo has been running 64 cpus with local patches for
a while, so I know that this works fairly well.
With newest current you can redefine MAXCPU in your kernel config, so
you don't need to bump the default value.
I think 64 as default value is good enough.
Removing MAXCPU dependency from the KBI is an important project
someone should adopt and bring to conclusion.
Thanks,
Attilio
--
Peace can only be achieved by understanding - A. Einstein
>> I mean, if we have 2 cpus in a machine, but MAXCPU is set to 256, there
>> is a bunch of "lost" memory and higher levels of lock contention?
>>
>> I thought that attilio was taking a stab at enhancing this, but at the
>> current time anything more than a value of 64 for MAXCPU is kind of a
>> "caveat emptor" area of FreeBSD.
>
> With newest current you can redefine MAXCPU in your kernel config, so
> you don't need to bump the default value.
> I think 64 as default value is good enough.
>
> Removing MAXCPU dependency from the KBI is an important project
> someone should adopt and bring to conclusion.
That's certainly one half of it and thanks for the work, but the real
question in this thread is what Sean asked: what are the negative
side-effects of simply bumping MAXCPU to 256 by default? AFAIK, there
are not that many structures which are statically sized by MAXCMPU and
most use the runtime-detected smp_cpus?
Well, there are quite a few statically allocated, but as I said,
making the kernel MAXCPU-agnostic (or sort of agnostic) is a goal and
a good project.
Thanks,
Attilio
--
Peace can only be achieved by understanding - A. Einstein