Adjusting parameters on the UMSazureSR plugin

37 views
Skip to first unread message

Fidel Gil

unread,
Jan 4, 2023, 10:03:22 AM1/4/23
to UniMRCP
Hello,

Hello I am using MRCP server 1.7.0 with the Azure SR plugin, I have installed and run the diagnostic test of using the bsr1 scenario from the umc-addons.

I have created a copy of the bsr1 scenario and changed it to use a custom audio file. And here comes my problem.

When I set the audio file to be a small audio file, say a single sentence, I get the transcript of it as expected. When I set the audio File to be something larger in my case a 20 second recording, the server gets as reply the following:

MRCP/2.0 243 RECOGNITION-COMPLETE 1 COMPLETE
Channel-Identifier: f56e798f646c4b7f@speechrecog
Completion-Cause: 001 no-match

While on the umc side I get the following output:

[WARN]   No NLSML data available

I have modified the umsazuresr.xml file to not apply single utterances as can be seen here:

   <ws-streaming-recognition
      language="de-DE"
      max-alternatives="1"
      alternatives-below-threshold="false"
      sort-alternatives="false"
      confidence-format="auto"
      confidence-precision="2"
      results-indent="2"
      results-format="standard"
      tag-format="default"
      input-format="default"
      start-of-input="service-originated"
      skip-unsupported-grammars="true"
      transcription-grammar="transcribe"
      grammar-param-separator=";"
      auth-validation-period="480"
      auth-request-timeout="30"
      inter-result-timeout="0"
      input-token="Lexical"
      instance-token="ITN"
      connect-timeout="0"
      auto-reconnect="false"
      max-audio-data-chunks="0"
      max-connection-duration="0"
      graceful-ws-close="false"
      single-utterance="false"
   />
Asides of that while reading the usage guide:
 https://docs.unispeech.io/en/ums/ms-azure/speech-recog/usage

I read that the endpoint could be part of the issue so I changed it from interactive to dicatation. So I specified the endpoint on the cognitive.subscription.key file 


If anyone could point me on the right direction on how to do the speech recognition in continious mode, that would be awesome. Also I am not well versed on the UNImrcp Server logic so it could be that there is some default I have missed 

Kind Regards
Fidel Gil

Arsen Chaloyan

unread,
Jan 26, 2023, 6:48:58 PM1/26/23
to uni...@googlegroups.com
Hi Gil,

Setting the single-utterance attribute to false in the configuration shall be sufficient, unless the parameter is overridden in the recognition request. The server logs located in the /opt/unimrcp/log directory with the sample utterance that you use would be required to identify the problem.

--
You received this message because you are subscribed to the Google Groups "UniMRCP" group.
To unsubscribe from this group and stop receiving emails from it, send an email to unimrcp+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/unimrcp/5d604d9b-2423-48aa-940d-7f4dc4b2229en%40googlegroups.com.


--
Arsen Chaloyan
Author of UniMRCP
http://www.unimrcp.org
Reply all
Reply to author
Forward
0 new messages