--
You received this message because you are subscribed to the Google Groups "openthread-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to openthread-use...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/openthread-users/979708d0-efc8-4c2a-89f4-2ccdec498ca8n%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/openthread-users/7b5129b3-e67a-4189-a5c5-02f22f047791n%40googlegroups.com.
Hi JonathanI agree 5 seconds is long but the lead up to the lines:Apr 23 11:48:34 localhost otbr-agent[124449]: [CRIT]-PLAT----: exit(1): ProcessRadioStateMachine line 999, radio tx timeout, FailureApr 23 11:48:34 localhost systemd[1]: otbr-agent.service: Main process exited, code=exited, status=1/FAILUREApr 23 11:48:34 localhost systemd[1]: otbr-agent.service: Failed with result 'exit-code'.shows messages being received and responded to continually right up to 1 second before (at 1 sec resolution)Apr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-PLAT----: processReceive: OKApr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-PLAT----: processTransmit: OKApr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-MAC-----: Received IPv6 UDP msg, len:141, chksum:c942, from:0x9000, sec:yes, prio:normal, rss:0.0Apr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-MAC-----: src:[fdde:ad00:0:0:c008:7f34:f8ed:62df]:5683Apr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-MAC-----: dst:[fdde:ad00:0:0:cc71:d1a5:df06:9614]:5683Apr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-PLAT----: processReceive: OKApr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-PLAT----: processTransmit: OKApr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-MAC-----: Received IPv6 UDP msg, len:206, chksum:71ef, from:0x9800, sec:yes, prio:normal, rss:-11.0Apr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-MAC-----: src:[fdde:ad00:0:0:6293:3689:4ef3:2b53]:5683Apr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-MAC-----: dst:[fdde:ad00:0:0:cc71:d1a5:df06:9614]:5683Apr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-PLAT----: processReceive: OKApr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-PLAT----: processTransmit: OKApr 23 11:48:33 localhost otbr-agent[124449]: [INFO]-PLAT----: processTransmit: OK
So I'm not sure I understand, are you saying the timeout at 11:48:34 is from a message sent back at 11:43:xx, and that the system kept sending and receiving successfully for the 5 seconds after this message failed?
The code reference you gave me comments:// Frame has been successfully passed to radio, but no `TransmitDone` event received within TX_WAIT_US.So is this timeout from the otbr-agent sending a message to the RCP and not getting back a response?
Does this one instance of timeout trigger the process which results in the wpan0 shutting down and restarting?
Does the protocol employ retries to try to recover or is a single timeout instance enough to warrant this action in shutting the wpan0?
Is the OTBR sending a message to reset the RCP at:Apr 23 11:48:39 localhost otbr-agent[126299]: [INFO]-PLAT----: RCP reset: RESET_SOFTWAREor is it just recording the the RCP restarted. I am trying to understand cause and affect.
Is there any configuration around this process which might help?
I will need to here back from Silbas on the RCP if it is responsible for the missing response, but it seems to me the otbr-agent should be more robust than to take such drastic action from one message response timeout when the messages after this are still being processed successfully?
Can you comment on how often the network should normally run continuously without resetting itself.
The recovery also seems to take a long time:Apr 23 11:48:34 localhost otbr-agent[124449]: [CRIT]-PLAT----: exit(1): ProcessRadioStateMachine line 999, radio tx timeout, Failure...to just get Thread back upApr 23 11:48:39 localhost otbr-agent[126299]: Thread is up... takes over 5 seconds.
So my otbr-agent doesn't seem to know about ?uart-baudrate=460800 and is ignoring it? Does this sound possible?
Should I specifically update to commit a37e299ff as above or just make sure I have the latest?