Hello!
I'm using UniMRCP + FreeSwitch (mod_unimrcp) with Google Speech Recognition plugin and Google Dialog Flow plugin.
UniMRCP v1.7.0
GSR Plugin v1.22.1
GDF Plugin v1.20.1
I faced with a missing inputs in GDF and GSR.
I'm receiving detect-speech event with empty body, despite the fact, that MRCP server logs contain the correct transcription of the user input.
It happens quite often, so I believe that it's not an extraordinary case.
In UniMRCP server logs I have next lines:
2023-02-27 15:01:41:951005 [INFO] Speech Detector State Transition NO-INPUT -> IN-PROGRESS [1170 ms] <ef0dedaf887245f8>
2023-02-27 15:01:41:952432 [INFO] Start Input Timer [12000 ms] <ef0dedaf887245f8>
2023-02-27 15:01:41:952650 [INFO] Enable Output Audio: encoding=1, sampling-rate=8000 <ef0dedaf887245f8@gdf>
2023-02-27 15:01:41:952657 [INFO] Init Streaming Request: encoding=1, sampling-rate=8000, language=pl-PL, single-utterance=0, word-info=0 <ef0dedaf887245f8@gdf>
2023-02-27 15:01:41:952661 [INFO] Set Model [default] <ef0dedaf887245f8@gdf>
2023-02-27 15:01:43:077523 [INFO] Received Response: status [1] recog result [1] detect intent response [0] <ef0dedaf887245f8@gdf>
2023-02-27 15:01:43:077557 [INFO] Recognition Result: transcript [imię] confidence [0.00] final [0] end-offset [1:320] <ef0dedaf887245f8@gdf>
2023-02-27 15:01:43:077577 [INFO] Set Result Flag [1000 ms] <ef0dedaf887245f8>
2023-02-27 15:01:43:077612 [INFO] Process START-OF-INPUT Event <ef0dedaf887245f8@speechrecog> [4]
2023-02-27 15:01:43:077641 [INFO] Send MRCPv2 Data
10.168.8.111:1544 <->
10.168.8.83:55484 [115 bytes]
MRCP/2.0 115 START-OF-INPUT 4 IN-PROGRESS
Channel-Identifier: ef0dedaf887245f8@speechrecog
Input-Type: speech
2023-02-27 15:01:43:206761 [INFO] Received Response: status [1] recog result [1] detect intent response [0] <ef0dedaf887245f8@gdf>
2023-02-27 15:01:43:206808 [INFO] Recognition Result: transcript [Kim jest] confidence [0.00] final [0] end-offset [1:440] <ef0dedaf887245f8@gdf>
2023-02-27 15:01:43:579055 [INFO] Received Response: status [1] recog result [1] detect intent response [0] <ef0dedaf887245f8@gdf>
2023-02-27 15:01:43:579095 [INFO] Recognition Result: transcript [imię] confidence [0.00] final [0] end-offset [1:800] <ef0dedaf887245f8@gdf>2023-02-27 15:01:45:640933 [INFO] Speech Detector State Transition IN-PROGRESS -> COMPLETE [3690 ms] <ef0dedaf887245f8>
2023-02-27 15:01:45:640964 [INFO] Detector Stats: leading-silence=300 ms, input=2820 ms, trailing-silence=1000 ms <ef0dedaf887245f8>
2023-02-27 15:01:45:641204 [INFO] Input Complete [success] size=65920 bytes, dur=4120 ms <ef0dedaf887245f8@gdf>
As you can see, it contains
transcript [imię] , but I'm not receiving that input in the detect-speech event (body is empty).
How it can be fixed?