Azure-SR plugin no-input-timeout is not triggered

10 views
Skip to first unread message

Roberto Lucchetti

unread,
May 13, 2024, 9:07:27 AMMay 13
to UniMRCP
Hi All, I have an issue with the Azure-SR plugin that non sends back the no-input-timeout when no audio is received during a recognize.
The RECOGNIZE arrives with No-Input-Timeout: 10000 and the IVR did not send any audio, but nothing happen after 10s, till the client sends a STOP.

Here the log, any idea? Thanks in advance.

2024-05-13 08:26:18:773466 [INFO]   Receive MRCPv2 Data 172.31.1.68:1564 <-> 172.31.1.68:59082 [511 bytes]
MRCP/2.0 511 RECOGNIZE 1
Channel-Identifier: 52e5f9134cf944b5@speechrecog
Content-Type: text/uri-list
Logging-Tag: sLAAAAAEAAADSPA9xjwEAAAEAAA
Cancel-If-Queue: false
No-Input-Timeout: 10000
Recognition-Timeout: 60000
Speech-Complete-Timeout: 5000
Speech-Incomplete-Timeout: 10000
Confidence-Threshold: 0.2
Speed-Vs-Accuracy: 1.0
Sensitivity-Level: 0.2
Speech-Language: pt-BR
Start-Input-Timers: true
N-Best-List-Length: 1
Recognition-Mode: normal
Content-Length: 25

builtin:speech/transcribe
2024-05-13 08:26:18:773553 [INFO]   Assign Control Channel <52e5f9134cf944b5@speechrecog> to Connection 172.31.1.68:1564 <-> 172.31.1.68:59082 [0] -> [1]
2024-05-13 08:26:18:773583 [INFO]   Process RECOGNIZE Request <52e5f9134cf944b5@speechrecog> [1]
2024-05-13 08:26:18:773637 [DEBUG]  Add Speech Grammar [transcribe] <52e5f9134cf944b5@azuresr>
2024-05-13 08:26:18:773703 [INFO]   Init Speech Detector: frame-size=160, max-frame-count=1050, output-frame-count=20, vad-mode=0, noinput-timeout=10000 ms, input-timeout=60000 ms, start-timeout=50 ms, complete-timeout=5000 ms, incomplete-timeout=10000 ms, leading-silence=300 ms, trailing-silence=300 ms, interim-results=1, start-of-input=external <52e5f9134cf944b5>
2024-05-13 08:26:18:773878 [INFO]   Open Waveform File for Writing /opt/unimrcp/var/sr_waveform/umsazuresr-52e5f9134cf944b5-1-8-kHz.wav, sampling-rate [8000]
2024-05-13 08:26:18:775148 [INFO]   Set Subscription Profile [cognitive.subscription.key] <52e5f9134cf944b5@azuresr>
2024-05-13 08:26:18:775168 [INFO]   Set Recognition Mode [interactive] <52e5f9134cf944b5@azuresr>
2024-05-13 08:26:18:775214 [INFO]   Create Resampler from 8000 Hz to 16000 Hz <52e5f9134cf944b5@azuresr>
2024-05-13 08:26:18:775250 [INFO]   Initiate WS connection <52e5f9134cf944b5> guid [e0e34549f9b249b28ecfa03574868c7e] uri [https://westeurope.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?format=detailed&language=pt-BR]
2024-05-13 08:26:18:775520 [INFO]   Open Waveform File for Writing /opt/unimrcp/var/sr_waveform/umsazuresr-52e5f9134cf944b5-1-16-kHz.wav, sampling-rate [16000]
2024-05-13 08:26:18:776044 [INFO]   Process RECOGNIZE Response <52e5f9134cf944b5@speechrecog> [1]
2024-05-13 08:26:18:776051 [INFO]   State Transition IDLE -> RECOGNIZING <52e5f9134cf944b5@speechrecog>
2024-05-13 08:26:18:776089 [INFO]   Send MRCPv2 Data 172.31.1.68:1564 <-> 172.31.1.68:59082 [83 bytes]
MRCP/2.0 83 1 200 IN-PROGRESS
Channel-Identifier: 52e5f9134cf944b5@speechrecog


2024-05-13 08:26:19:029204 [DEBUG]  WS event cb <52e5f9134cf944b5>
2024-05-13 08:26:19:029230 [INFO]   WS connected <52e5f9134cf944b5>
2024-05-13 08:26:19:029233 [INFO]   WS upgrade guid [e0e34549f9b249b28ecfa03574868c7e] <52e5f9134cf944b5>
2024-05-13 08:26:19:029324 [DEBUG]  WS write cb <52e5f9134cf944b5>
2024-05-13 08:26:19:118623 [DEBUG]  WS read cb <52e5f9134cf944b5>
2024-05-13 08:26:19:118678 [INFO]   WS upgraded <52e5f9134cf944b5>
HTTP/1.1 101 Switching Protocols
Connection: Upgrade
Date: Mon, 13 May 2024 08:26:18 GMT
Server: Kestrel
Upgrade: websocket
Strict-Transport-Security: max-age=31536000; includeSubDomains
Sec-WebSocket-Accept: UN4MFWTqr36gkOu7Uu8pTsbJ/N4=


2024-05-13 08:26:19:118778 [INFO]   Send WS msg [232 bytes] <52e5f9134cf944b5>
Path:speech.config
X-Timestamp:2024-05-13T08:26:19.118697
Content-Type:application/json; charset=utf-8

{"context":{"system":{"version":"2.0.12341"}},"device":{"manufacturer":"UniSpeech","model":"umsazuresr","version":"1.25.1"}}
2024-05-13 08:26:19:118867 [DEBUG]  WS write cb <52e5f9134cf944b5>
2024-05-13 08:26:45:453584 [INFO]   Receive MRCPv2 Data 172.31.1.68:1564 <-> 172.31.1.68:59082 [72 bytes]
MRCP/2.0 72 STOP 2
Channel-Identifier: 52e5f9134cf944b5@speechrecog

Reply all
Reply to author
Forward
0 new messages