On 11/4/2013 4:34 AM, SKiAt wrote:
> the same here..
> rev A5A with angstrom as provided by factory.
>
> using a stress program that read/write on the nand and is ok I leave the
> board running it for weeks.. and it is ok.. If I login to ssh and ask for
> something else .. bang.. let's see similar panic messages..
>
> I stopped the porting of my application to the new BBB even if it is
> cheaper and has better performance because of this, can anyone help or
> anyone knows about some fix of this in newer kernels?
Hmm...your stack trace doesn't match where mine goes wrong when I get
these hangs:
http://bb-lcnc.blogspot.com/2013/10/hung-task-bug-in-xenomai-kernel.html
...but I suspect it is the same issue. The mmc hang is a known problem
with the BeagleBone 3.8.13 kernel, although it shows up a *LOT* more
once you apply the Xenomai real-time patches (which are not directly
related to the mmc hang but increase IRQ service times enough to
'tickle' the bug a lot more). The reason you don't see hangs until you
login via ssh is likely because you need to have multiple threads
talking to the SD card to trigger the bug.
The solution identified by Rolf Roesch is to cherry-pick a fix for the
problem from the 3.12 kernel source:
http://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=7472bab236bdee1173412585591329e718f4d324
This commit seems to fix the problem on both normal and xenomai patched
kernels, and will hopefully fix your problem as well.
I've already added this commit to my xenomai real-time kernel build, but
I was leaving it to Rolf to let everyone know about the cherry-pick fix
and get credit for finding it since he did all the leg-work to figure it
out. Since you're asking about it, though, I figured I should share the
'magic'. :)
Now does anyone know who to push this to so it gets into the upstream
BeagleBone kernel patch-set for 3.8?
--
Charles Steinkuehler
cha...@steinkuehler.net