Futex problem

1,458 views
Skip to first unread message

Jonathan MERCIER

unread,
Jul 27, 2016, 9:33:52 AM7/27/16
to Drools Usage
Dear,

I run drools reasoner and after a short moment the process hang.
Indeed strace show:

$ strace -p 11893
Process 11893 attached
futex
(0x7f2b708eb9d0, FUTEX_WAIT, 11894, NULL


and the trace with thread of 1193 show:
$ trace -f -p 11893
Process 11893 attached with 19 threads
[pid 11942] futex(0x7f2b1ae506f4, FUTEX_WAIT_PRIVATE, 27, NULL <unfinished ...>
[pid 11941] futex(0x7f2b0d573c34, FUTEX_WAIT_PRIVATE, 25, NULL <unfinished ...>
[pid 11939] futex(0x7f2b6ff184e4, FUTEX_WAIT_PRIVATE, 23, NULL <unfinished ...>
[pid 11936] futex(0x7f2b19b27d84, FUTEX_WAIT_PRIVATE, 23, NULL <unfinished ...>
[pid 11928] futex(0x7f2b6c197f54, FUTEX_WAIT_PRIVATE, 3, NULL <unfinished ...>
[pid 11907] restart_syscall(<... resuming interrupted call ...> <unfinished ...>
[pid 11906] restart_syscall(<... resuming interrupted call ...> <unfinished ...>
[pid 11905] futex(0x7f2b718a2c60, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished ...>
[pid 11904] futex(0x7f2b6c14ae54, FUTEX_WAIT_PRIVATE, 41, NULL <unfinished ...>
[pid 11903] futex(0x7f2b6c146354, FUTEX_WAIT_PRIVATE, 25, NULL <unfinished ...>
[pid 11902] restart_syscall(<... resuming interrupted call ...> <unfinished ...>
[pid 11901] futex(0x7f2b6c024254, FUTEX_WAIT_PRIVATE, 165, NULL <unfinished ...>
[pid 11900] futex(0x7f2b6c022554, FUTEX_WAIT_PRIVATE, 163, NULL <unfinished ...>
[pid 11896] futex(0x7f2b6c020854, FUTEX_WAIT_PRIVATE, 179, NULL <unfinished ...>
[pid 11895] futex(0x7f2b6c01ea54, FUTEX_WAIT_PRIVATE, 171, NULL <unfinished ...>
[pid 11893] futex(0x7f2b708eb9d0, FUTEX_WAIT, 11894, NULL <unfinished ...>
[pid 11932] restart_syscall(<... resuming interrupted call ...> <unfinished ...>
[pid 11908] restart_syscall(<... resuming interrupted call ...> <unfinished ...>
[pid 11932] <... restart_syscall resumed> ) = -1 ETIMEDOUT (Connection timed out)
[pid 11932] futex(0x7f2b6c19aa28, FUTEX_WAKE_PRIVATE, 1) = 0
[pid 11932] futex(0x7f2b6c19aa54, FUTEX_WAIT_BITSET_PRIVATE, 1, {70106, 3497834}, ffffffff) = -1 ETIMEDOUT (Connection timed out)
[pid 11932] futex(0x7f2b6c19aa28, FUTEX_WAKE_PRIVATE, 1) = 0
[pid 11932] futex(0x7f2b6c19aa54, FUTEX_WAIT_BITSET_PRIVATE, 1, {70106, 54255295}, ffffffff) = -1 ETIMEDOUT (Connection timed out)
[pid 11932] futex(0x7f2b6c19aa28, FUTEX_WAKE_PRIVATE, 1) = 0
[pid 11932] futex(0x7f2b6c19aa54, FUTEX_WAIT_BITSET_PRIVATE, 1, {70106, 104481210}, ffffffff) = -1 ETIMEDOUT (Connection timed out)
[pid 11932] futex(0x7f2b6c19aa28, FUTEX_WAKE_PRIVATE, 1) = 0
[pid 11932] futex(0x7f2b6c19aa54, FUTEX_WAIT_BITSET_PRIVATE, 1, {70106, 154701714}, ffffffff <unfinished ...>
[pid 11902] <... restart_syscall resumed> ) = -1 ETIMEDOUT (Connection timed out)
[pid 11902] futex(0x7f2b6c13e728, FUTEX_WAKE_PRIVATE, 1) = 0
[pid 11902] futex(0x7f2b6c13e754, FUTEX_WAIT_BITSET_PRIVATE, 1, {70107, 144161476}, ffffffff <unfinished ...>
[pid 11932] <... futex resumed> )       = -1 ETIMEDOUT (Connection timed out)


these kind of messages is printed something like an infinite loop

As this problem do not appear with my unit test, I am not able to make a minimal example.
I can to provides an executable jar which reproduce well the problem (14Mb)  and corresponding source code

I do not know what I can do with a such problem


Thanks for your help

Best regards

Jonathan MERCIER

unread,
Jul 28, 2016, 7:56:55 AM7/28/16
to Drools Usage
So I let my reasoner to process. After one hour he has finally end to process...
So it is strange as usually the reasoner need 40sec to process the same data.
So the problem come from newest rules. I continue to search why...
Reply all
Reply to author
Forward
0 new messages