Unable to override streaming recognition language for umsgdf

266 views
Skip to first unread message

Anchit Dave

unread,
Apr 23, 2020, 2:46:10 AM4/23/20
to UniMRCP
Hello Team,

We are planning to have a multilingual Bot but the challenge we are facing is that we are unable to override the default recognition language.
Here is the default language set in umsgdf:
   <streaming-recognition
      single-utterance="true"
      interim-results="true"
      start-of-input="service-originated"
      language="hi"
      max-alternatives="1"
      project-id=""
      skip-unsupported-grammars="true"
      transcription-grammar="transcribe"
      generate-output-audio="false"
   />

We are using FreeSWITCH's play and detect speech module to interact with the unimrcp server. We have added a parameter "language" just like we add the parameter projected. 
session.execute("play_and_detect_speech","say:<speak><prosody rate="moderate">Welcome to MetroHomes,Hindi or English</prosody></speak> detect:unimrcp:google-mrcp-v2 {language=en,start-input-timers=false,no-input-timeout=50000,recognition-timeout=15000}builtin:speech/transcribe?language=en;projectid=translator-xpqlcp")

Can you please guide us, What am I doing wrong?

Thank you

Doug Rylaarsdam

unread,
Apr 24, 2020, 1:59:11 PM4/24/20
to UniMRCP
Hi, can you share the UniMRCP server logs showing for example the DEFINE-GRAMMAR and RECOGNIZE requests? That might confirm how the detect arguments are being sent in MRCP.

Anchit Dave

unread,
Apr 26, 2020, 6:06:40 PM4/26/20
to UniMRCP
Hello Doug,

Sorry for the late reply

Here is the unimrcp server log:
2020-04-26 21:43:28:742632 [INFO]   Receive SIP Event [nua_i_invite] Status 100 Trying [SIP-Agent-GDF]
2020-04-26 21:43:28:742660 [INFO]   Receive SIP Event [nua_i_state] Status 100 Trying [SIP-Agent-GDF]
2020-04-26 21:43:28:742668 [NOTICE] SIP Call State  [received]
2020-04-26 21:43:28:742681 [INFO]   Create Session 0x7f73f0004798 <new> [gdf]
2020-04-26 21:43:28:742689 [INFO]   Remote SDP 0x7f73f0004798 <new>
v=0
o=rtr.cust.mtro.co 8026584194003876396 8341444783553101867 IN IP4 172.16.0.10
s=-
c=IN IP4 172.16.0.10
t=0 0
m=application 9 TCP/MRCPv2 1
a=setup:active
a=connection:new
a=resource:speechrecog
a=cmid:1
m=audio 14517 RTP/AVP 0 8 96
a=rtpmap:0 PCMU/8000
a=rtpmap:8 PCMA/8000
a=rtpmap:96 L16/8000
a=sendonly
a=mid:1

2020-04-26 21:43:28:742787 [NOTICE] Add Session <a86c59fbb5464fbc>
2020-04-26 21:43:28:742809 [INFO]   Receive Offer 0x7f73f0004798 <a86c59fbb5464fbc> [c:1 a:1 v:0]
2020-04-26 21:43:28:742817 [INFO]   Found MRCP Engine [Google-DF-1] for Resource [speechrecog] 0x7f73f0004798 <a86c59fbb5464fbc>
2020-04-26 21:43:28:742887 [INFO]   Add Pending Control Channel <a86c59fbb5464fbc@speechrecog> [1]
2020-04-26 21:43:28:751888 [INFO]   Enable RTP Session 172.16.0.10:5228
2020-04-26 21:43:28:751914 [INFO]   Open RTP Receiver 172.16.0.10:5228 <- 172.16.0.10:14517 playout [50 ms] bounds [0 - 600 ms] adaptive [1] skew detection [1]
2020-04-26 21:43:28:751923 [INFO]   Media Path 0x7f73f0004798 Source->[PCMU/8000/1]->Decoder->[LPCM/8000/1]->Bridge->[LPCM/8000/1]->Sink
2020-04-26 21:43:28:752022 [INFO]   Open <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:28:752142 [NOTICE] GDF Usage: 1/1/3
2020-04-26 21:43:28:752367 [INFO]   Send Answer 0x7f73f0004798 <a86c59fbb5464fbc> [c:1 a:1 v:0] Status OK
2020-04-26 21:43:28:752422 [INFO]   Local SDP 0x7f73f0004798 <a86c59fbb5464fbc>
v=0
o=UniMRCPServer 0 0 IN IP4 172.16.0.10
s=-
c=IN IP4 172.16.0.10
t=0 0
m=application 1544 TCP/MRCPv2 1
a=setup:passive
a=connection:new
a=channel:a86c59fbb5464fbc@speechrecog
a=cmid:1
m=audio 5228 RTP/AVP 0
a=rtpmap:0 PCMU/8000
a=recvonly
a=mid:1

2020-04-26 21:43:28:753339 [NOTICE] Accepted TCP/MRCPv2 Connection 172.16.0.10:1544 <-> 172.16.0.10:32950
2020-04-26 21:43:28:753305 [INFO]   Receive SIP Event [nua_i_state] Status 200 OK [SIP-Agent-GDF]
2020-04-26 21:43:28:753441 [NOTICE] SIP Call State 0x7f73f0004798 [completed]
2020-04-26 21:43:28:753451 [INFO]   Receive SIP Event [nua_i_ack] Status 200 OK [SIP-Agent-GDF]
2020-04-26 21:43:28:753458 [INFO]   Receive SIP Event [nua_i_state] Status 200 OK [SIP-Agent-GDF]
2020-04-26 21:43:28:753463 [NOTICE] SIP Call State 0x7f73f0004798 [ready]
2020-04-26 21:43:28:753468 [INFO]   Receive SIP Event [nua_i_active] Status 200 Call active [SIP-Agent-GDF]
2020-04-26 21:43:28:762424 [INFO]   Receive MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:32950 [343 bytes]
MRCP/2.0 343 RECOGNIZE 1
Channel-Identifier: a86c59fbb5464fbc@speechrecog
Content-Type: text/uri-list
Cancel-If-Queue: false
Recognition-Timeout: 15000
Start-Input-Timers: false
No-Input-Timeout: 50000
Vendor-Specific-Parameters: language=en-US
Content-Length: 68

builtin:speech/transcribe?language=en-US;projectid=translator-xpqlcp
2020-04-26 21:43:28:762463 [INFO]   Assign Control Channel <a86c59fbb5464fbc@speechrecog> to Connection 172.16.0.10:1544 <-> 172.16.0.10:32950 [0] -> [1]
2020-04-26 21:43:28:762513 [INFO]   Process RECOGNIZE Request <a86c59fbb5464fbc@speechrecog> [1]
2020-04-26 21:43:28:762600 [INFO]   Init Speech Detector: frame-size=160, max-frame-count=1320, output-frame-count=20, vad-mode=2, noinput-timeout=50000 ms, input-timeout=15000 ms, start-timeout=50 ms, complete-timeout=10000 ms, incomplete-timeout=10000 ms, leading-silence=3000 ms, trailing-silence=3000 ms, interim-results=1, start-of-input=external <a86c59fbb5464fbc>
2020-04-26 21:43:28:762632 [INFO]   Create gRPC Stream <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:28:762917 [INFO]   Process RECOGNIZE Response <a86c59fbb5464fbc@speechrecog> [1]
2020-04-26 21:43:28:762937 [INFO]   State Transition IDLE -> RECOGNIZING <a86c59fbb5464fbc@speechrecog>
2020-04-26 21:43:28:762997 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:32950 [83 bytes]
MRCP/2.0 83 1 200 IN-PROGRESS
Channel-Identifier: a86c59fbb5464fbc@speechrecog


2020-04-26 21:43:28:772535 [INFO]   Receive SIP Event [nua_i_invite] Status 100 Trying [SIP-Agent-GDF]
2020-04-26 21:43:28:772549 [INFO]   Receive SIP Event [nua_i_state] Status 100 Trying [SIP-Agent-GDF]
2020-04-26 21:43:28:772555 [NOTICE] SIP Call State  [received]
2020-04-26 21:43:28:772568 [INFO]   Create Session 0x7f73f0008c08 <new> [gdf]
2020-04-26 21:43:28:772575 [INFO]   Remote SDP 0x7f73f0008c08 <new>
v=0
o=rtr.cust.mtro.co 3014491319362233072 3596740157236489695 IN IP4 172.16.0.10
s=-
c=IN IP4 172.16.0.10
t=0 0
m=application 9 TCP/MRCPv2 1
a=setup:active
a=connection:existing
a=resource:speechsynth
a=cmid:1
m=audio 14519 RTP/AVP 0 8 96
a=rtpmap:0 PCMU/8000
a=rtpmap:8 PCMA/8000
a=rtpmap:96 L16/8000
a=recvonly
a=mid:1

2020-04-26 21:43:28:772640 [NOTICE] Add Session <bd4e9c473fb44232>
2020-04-26 21:43:28:772655 [INFO]   Receive Offer 0x7f73f0008c08 <bd4e9c473fb44232> [c:1 a:1 v:0]
2020-04-26 21:43:28:772662 [INFO]   Found MRCP Engine [Google-SS-1] for Resource [speechsynth] 0x7f73f0008c08 <bd4e9c473fb44232>
2020-04-26 21:43:28:772697 [INFO]   Add Pending Control Channel <bd4e9c473fb44232@speechsynth> [1]
2020-04-26 21:43:28:781870 [INFO]   Enable RTP Session 172.16.0.10:5230
2020-04-26 21:43:28:781894 [INFO]   Open RTP Transmitter 172.16.0.10:5230 -> 172.16.0.10:14519
2020-04-26 21:43:28:781901 [INFO]   Media Path 0x7f73f0008c08 Source->[LPCM/8000/1]->Bridge->[LPCM/8000/1]->Encoder->[PCMU/8000/1]->Sink
2020-04-26 21:43:28:781967 [INFO]   Open <bd4e9c473fb44232@gss>
2020-04-26 21:43:28:782080 [NOTICE] GSS Usage: 1/1/3
2020-04-26 21:43:28:782346 [INFO]   Send Answer 0x7f73f0008c08 <bd4e9c473fb44232> [c:1 a:1 v:0] Status OK
2020-04-26 21:43:28:782374 [INFO]   Local SDP 0x7f73f0008c08 <bd4e9c473fb44232>
v=0
o=UniMRCPServer 0 0 IN IP4 172.16.0.10
s=-
c=IN IP4 172.16.0.10
t=0 0
m=application 1544 TCP/MRCPv2 1
a=setup:passive
a=connection:existing
a=channel:bd4e9c473fb44232@speechsynth
a=cmid:1
m=audio 5230 RTP/AVP 0
a=rtpmap:0 PCMU/8000
a=sendonly
a=mid:1

2020-04-26 21:43:28:782998 [INFO]   Receive SIP Event [nua_i_state] Status 200 OK [SIP-Agent-GDF]
2020-04-26 21:43:28:783011 [NOTICE] SIP Call State 0x7f73f0008c08 [completed]
2020-04-26 21:43:28:783281 [INFO]   Receive SIP Event [nua_i_ack] Status 200 OK [SIP-Agent-GDF]
2020-04-26 21:43:28:783295 [INFO]   Receive SIP Event [nua_i_state] Status 200 OK [SIP-Agent-GDF]
2020-04-26 21:43:28:783301 [NOTICE] SIP Call State 0x7f73f0008c08 [ready]
2020-04-26 21:43:28:783306 [INFO]   Receive SIP Event [nua_i_active] Status 200 Call active [SIP-Agent-GDF]
2020-04-26 21:43:28:792042 [INFO]   Receive MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:32950 [247 bytes]
MRCP/2.0 247 SPEAK 1
Channel-Identifier: bd4e9c473fb44232@speechsynth
Content-Type: application/ssml+xml
Voice-Name: hi-IN-Wavenet-A
Content-Length: 88

<speak><prosody rate="moderate">Welcome to MetroHomes,Hindi or English</prosody></speak>
2020-04-26 21:43:28:792070 [INFO]   Assign Control Channel <bd4e9c473fb44232@speechsynth> to Connection 172.16.0.10:1544 <-> 172.16.0.10:32950 [0] -> [2]
2020-04-26 21:43:28:792123 [INFO]   Process SPEAK Request <bd4e9c473fb44232@speechsynth> [1]
2020-04-26 21:43:28:792191 [INFO]   Set Voice Name [hi-IN-Wavenet-A] <bd4e9c473fb44232@gss>
2020-04-26 21:43:28:792207 [INFO]   Start Async Synth: encoding=1, sampling-rate=8000, language=hi-IN <bd4e9c473fb44232@gss>
<speak><prosody rate="moderate">Welcome to MetroHomes,Hindi or English</prosody></speak>
2020-04-26 21:43:29:302773 [INFO]   Process SPEAK Response <bd4e9c473fb44232@speechsynth> [1]
2020-04-26 21:43:29:302803 [NOTICE] State Transition IDLE -> SPEAKING <bd4e9c473fb44232@speechsynth>
2020-04-26 21:43:29:302878 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:32950 [83 bytes]
MRCP/2.0 83 1 200 IN-PROGRESS
Channel-Identifier: bd4e9c473fb44232@speechsynth


2020-04-26 21:43:29:541857 [INFO]   Speech Detector State Transition NO-INPUT -> IN-PROGRESS [780 ms] <a86c59fbb5464fbc>
2020-04-26 21:43:29:541872 [INFO]   Start Input Timer [15000 ms] <a86c59fbb5464fbc>
2020-04-26 21:43:29:541897 [INFO]   Init Streaming Request: encoding=1, sampling-rate=8000, language=hi, single-utterance=1, word-info=0 <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:32:011896 [NOTICE] Speech Complete <bd4e9c473fb44232@gss>
2020-04-26 21:43:32:012018 [INFO]   Process SPEAK-COMPLETE Event <bd4e9c473fb44232@speechsynth> [1]
2020-04-26 21:43:32:012040 [NOTICE] State Transition SPEAKING -> IDLE <bd4e9c473fb44232@speechsynth>
2020-04-26 21:43:32:012103 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:32950 [122 bytes]
MRCP/2.0 122 SPEAK-COMPLETE 1 COMPLETE
Channel-Identifier: bd4e9c473fb44232@speechsynth
Completion-Cause: 000 normal


2020-04-26 21:43:32:133655 [INFO]   Receive SIP Event [nua_i_bye] Status 200 Session Terminated [SIP-Agent-GDF]
2020-04-26 21:43:32:133775 [INFO]   Receive SIP Event [nua_i_state] Status 200 Session Terminated [SIP-Agent-GDF]
2020-04-26 21:43:32:133790 [NOTICE] SIP Call State 0x7f73f0008c08 [terminated]
2020-04-26 21:43:32:133817 [INFO]   Receive SIP Event [nua_i_terminated] Status 200 Session Terminated [SIP-Agent-GDF]
2020-04-26 21:43:32:133833 [INFO]   Deactivate Session 0x7f73f0008c08 <bd4e9c473fb44232>
2020-04-26 21:43:32:133912 [INFO]   Terminate Session 0x7f73f0008c08 <bd4e9c473fb44232>
2020-04-26 21:43:32:134005 [INFO]   Close <bd4e9c473fb44232@gss>
2020-04-26 21:43:32:134008 [INFO]   Remove Control Channel <bd4e9c473fb44232@speechsynth> [1]
2020-04-26 21:43:32:134065 [NOTICE] GSS Usage: 0/1/3
2020-04-26 21:43:32:141961 [INFO]   Close RTP Transmitter 172.16.0.10:5230 -> 172.16.0.10:14519 [s:135 o:21600]
2020-04-26 21:43:32:142084 [INFO]   Remove RTP Session 172.16.0.10:5230
2020-04-26 21:43:32:142432 [INFO]   Receive MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:32950 [86 bytes]
MRCP/2.0 86 START-INPUT-TIMERS 2
Channel-Identifier: a86c59fbb5464fbc@speechrecog


2020-04-26 21:43:32:142663 [NOTICE] Remove Session <bd4e9c473fb44232>
2020-04-26 21:43:32:142702 [INFO]   Session Terminated 0x7f73f0008c08 <bd4e9c473fb44232>
2020-04-26 21:43:32:142743 [NOTICE] Destroy Session <bd4e9c473fb44232>
2020-04-26 21:43:32:142764 [INFO]   Process START-INPUT-TIMERS Request <a86c59fbb5464fbc@speechrecog> [2]
2020-04-26 21:43:32:142808 [INFO]   Start No-Input Timer [50000 ms] <a86c59fbb5464fbc>
2020-04-26 21:43:32:142834 [INFO]   Process START-INPUT-TIMERS Response <a86c59fbb5464fbc@speechrecog> [2]
2020-04-26 21:43:32:142881 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:32950 [80 bytes]
MRCP/2.0 80 2 200 COMPLETE
Channel-Identifier: a86c59fbb5464fbc@speechrecog


2020-04-26 21:43:33:956267 [INFO]   Received Response: status [1] id [] recog result [1] query result [0] webhook status [0] output audio [0 bytes] <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:33:956295 [INFO]   Recognition Result: transcript [हिंदी] confidence [0.00] final [0] end-offset [7:10] <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:33:956299 [INFO]   Set Result Flag [10000 ms] <a86c59fbb5464fbc>
2020-04-26 21:43:33:956325 [INFO]   Process START-OF-INPUT Event <a86c59fbb5464fbc@speechrecog> [1]
2020-04-26 21:43:33:956349 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:32950 [115 bytes]
MRCP/2.0 115 START-OF-INPUT 1 IN-PROGRESS
Channel-Identifier: a86c59fbb5464fbc@speechrecog
Input-Type: speech


2020-04-26 21:43:34:362279 [INFO]   Received Response: status [1] id [] recog result [1] query result [0] webhook status [0] output audio [0 bytes] <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:34:362328 [INFO]   Recognition Result: transcript [हिंदी] confidence [0.00] final [0] end-offset [7:580] <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:34:954065 [INFO]   Received Response: status [1] id [] recog result [1] query result [0] webhook status [0] output audio [0 bytes] <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:34:954116 [INFO]   Recognition Result: end-of-utterance <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:34:956994 [INFO]   Received Response: status [1] id [] recog result [1] query result [0] webhook status [0] output audio [0 bytes] <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:34:957036 [INFO]   Recognition Result: transcript [हिंदी] confidence [0.96] final [1] end-offset [8:30] <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:35:805795 [INFO]   Received Response: status [1] id [7e517a3b-75aa-4de5-a8c2-e6ba58fb96f9-eec93b43] recog result [0] query result [1] webhook status [1] output audio [0 bytes] <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:35:806104 [INFO]   Query Result: {
 "queryText": "हिंदी",
 "action": "input.unknown",
 "parameters": {},
 "allRequiredParamsPresent": true,
 "fulfillmentText": "What was that?",
 "fulfillmentMessages": [
  {
   "text": {
    "text": [
     "What was that?"
    ]
   }
  }
 ],
 "outputContexts": [
  {
   "name": "projects/translator-xpqlcp/agent/sessions/a86c59fbb5464fbc/contexts/__system_counters__",
   "lifespanCount": 1,
   "parameters": {
    "no-input": 0,
    "no-match": 1
   }
  }
 ],
 "intent": {
  "name": "projects/translator-xpqlcp/agent/intents/0897899d-32c8-4426-b35e-a12780ca0630",
  "displayName": "Default Fallback Intent",
  "isFallback": true
 },
 "intentDetectionConfidence": 1,
 "languageCode": "en"
}
 <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:35:806119 [INFO]   Webhook Status: code [0] message [] <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:35:806228 [INFO]   Received Response: status [0] id [7e517a3b-75aa-4de5-a8c2-e6ba58fb96f9-eec93b43] recog result [0] query result [1] webhook status [1] output audio [0 bytes] <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:35:806430 [INFO]   Query Result: {
 "queryText": "हिंदी",
 "action": "input.unknown",
 "parameters": {},
 "allRequiredParamsPresent": true,
 "fulfillmentText": "What was that?",
 "fulfillmentMessages": [
  {
   "text": {
    "text": [
     "What was that?"
    ]
   }
  }
 ],
 "outputContexts": [
  {
   "name": "projects/translator-xpqlcp/agent/sessions/a86c59fbb5464fbc/contexts/__system_counters__",
   "lifespanCount": 1,
   "parameters": {
    "no-input": 0,
    "no-match": 1
   }
  }
 ],
 "intent": {
  "name": "projects/translator-xpqlcp/agent/intents/0897899d-32c8-4426-b35e-a12780ca0630",
  "displayName": "Default Fallback Intent",
  "isFallback": true
 },
 "intentDetectionConfidence": 1,
 "languageCode": "en"
}
 <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:35:806460 [INFO]   Webhook Status: code [0] message [] <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:35:811914 [INFO]   Input Complete [stopped] size=146240 bytes, dur=9320 ms <a86c59fbb5464fbc@gdf>
2020-04-26 21:43:35:812095 [INFO]   Process RECOGNITION-COMPLETE Event <a86c59fbb5464fbc@speechrecog> [1]
2020-04-26 21:43:35:812118 [INFO]   State Transition RECOGNIZING -> RECOGNIZED <a86c59fbb5464fbc@speechrecog>
2020-04-26 21:43:35:812188 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:32950 [1022 bytes]
MRCP/2.0 1189 RECOGNITION-COMPLETE 1 COMPLETE
Channel-Identifier: a86c59fbb5464fbc@speechrecog
Completion-Cause: 000 success
Content-Type: application/x-nlsml
Content-Length: 1002

<?xml version="1.0"?><result><interpretation grammar="builtin:speech/transcribe" confidence="1"><instance><query_text>हिंदी</query_text><action>input.unknown</action><parameters></parameters><all_required_params_present>true</all_required_params_present><fulfillment_text>What was that?</fulfillment_text><fulfillment_messages><text><text>What was that?</text></text></fulfillment_messages><output_contexts><name>projects/translator-xpqlcp/agent/sessions/a86c59fbb5464fbc/contexts/__system_counters__</name><lifespan_count>1</lifespan_count><parameters><no_input>0</no_input><no_match>1</no_match></parameters></output_contexts><intent><name>projects/translator-xpqlcp/agent/intents/0897899d-32c8-4426-b35e-a12780ca0630</name><display_name>Default Fallback Intent</display_name><is_fallback>true</is_fallback></intent><intent
2020-04-26 21:43:35:812319 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:32950 [167 bytes]
_detection_confidence>1</intent_detection_confidence><language_code>en</language_code></instance><input mode="speech">हिंदी</input></interpretation></result>
2020-04-26 21:43:35:862025 [INFO]   Receive SIP Event [nua_i_invite] Status 100 Trying [SIP-Agent-GDF]
2020-04-26 21:43:35:862079 [INFO]   Receive SIP Event [nua_i_state] Status 100 Trying [SIP-Agent-GDF]

Here is FreeSWITCH execute command: 
play_and_detect_speech(say:<speak><prosody rate="moderate">Welcome to MetroHomes,Hindi or English</prosody></speak> detect:unimrcp:google-mrcp-v2 {language=en-US,start-input-timers=false,no-input-timeout=50000,recognition-timeout=15000}builtin:speech/transcribe?language=en-US;projectid=translator-xpqlcp)

I suppose that since the Streaming request has the language "hi" which is set as default in umsgdf.conf we get a response in "hi-IN" but the required is "en-US" as you can see from the parameters set in execute command (i.e. language=en-US),
2020-04-26 21:43:29:541897 [INFO]   Init Streaming Request: encoding=1, sampling-rate=8000, language=hi, single-utterance=1, word-info=0 <a86c59fbb5464fbc@gdf>

Please guide us.

Thank you so much for your help
Anchit Dave

Doug Rylaarsdam

unread,
Apr 27, 2020, 12:25:12 AM4/27/20
to UniMRCP
Hi Anchit,

Thanks for posting the server logs. I haven't been able to reproduce, but I'm using a slightly different setup (GSR instead of GDF, and an SRGS grammar instead of the builtin).

One thing to check, on the construction of the builtin uri:

builtin:speech/transcribe?language=en-US;projectid=translator-xpqlcp

Can you try this form? 

builtin:speech/transcribe?language=en-US&projectid=translator-xpqlcp

Anchit Dave

unread,
Apr 27, 2020, 1:38:34 AM4/27/20
to UniMRCP
Hi Doug,

Thank you for your response. I tried by changing the URI but no luck.
After replacing ';' with '&', The speech won't get detected in any language. If I bring it back to the earlier form i.e. with ";" it detects but not in "en-US".

2020-04-27 05:33:21:006490 [INFO]   Receive SIP Event [nua_i_invite] Status 100 Trying [SIP-Agent-GDF]
2020-04-27 05:33:21:006544 [INFO]   Receive SIP Event [nua_i_state] Status 100 Trying [SIP-Agent-GDF]
2020-04-27 05:33:21:006559 [NOTICE] SIP Call State  [received]
2020-04-27 05:33:21:006583 [INFO]   Create Session 0x7f73f0004798 <new> [gdf]
2020-04-27 05:33:21:006597 [INFO]   Remote SDP 0x7f73f0004798 <new>
v=0
o=rtr.cust.mtro.co 9207152300165791324 2196405641786193050 IN IP4 172.16.0.10
s=-
c=IN IP4 172.16.0.10
t=0 0
m=application 9 TCP/MRCPv2 1
a=setup:active
a=connection:new
a=resource:speechrecog
a=cmid:1
m=audio 14589 RTP/AVP 0 8 96
a=rtpmap:0 PCMU/8000
a=rtpmap:8 PCMA/8000
a=rtpmap:96 L16/8000
a=sendonly
a=mid:1

2020-04-27 05:33:21:006765 [NOTICE] Add Session <57ead346fd034b34>
2020-04-27 05:33:21:006806 [INFO]   Receive Offer 0x7f73f0004798 <57ead346fd034b34> [c:1 a:1 v:0]
2020-04-27 05:33:21:006822 [INFO]   Found MRCP Engine [Google-DF-1] for Resource [speechrecog] 0x7f73f0004798 <57ead346fd034b34>
2020-04-27 05:33:21:006935 [INFO]   Add Pending Control Channel <57ead346fd034b34@speechrecog> [1]
2020-04-27 05:33:21:011890 [INFO]   Enable RTP Session 172.16.0.10:5330
2020-04-27 05:33:21:011931 [INFO]   Open RTP Receiver 172.16.0.10:5330 <- 172.16.0.10:14589 playout [50 ms] bounds [0 - 600 ms] adaptive [1] skew detection [1]
2020-04-27 05:33:21:011948 [INFO]   Media Path 0x7f73f0004798 Source->[PCMU/8000/1]->Decoder->[LPCM/8000/1]->Bridge->[LPCM/8000/1]->Sink
2020-04-27 05:33:21:012052 [INFO]   Open <57ead346fd034b34@gdf>
2020-04-27 05:33:21:012276 [NOTICE] GDF Usage: 1/1/3
2020-04-27 05:33:21:012685 [INFO]   Send Answer 0x7f73f0004798 <57ead346fd034b34> [c:1 a:1 v:0] Status OK
2020-04-27 05:33:21:012726 [INFO]   Local SDP 0x7f73f0004798 <57ead346fd034b34>
v=0
o=UniMRCPServer 0 0 IN IP4 172.16.0.10
s=-
c=IN IP4 172.16.0.10
t=0 0
m=application 1544 TCP/MRCPv2 1
a=setup:passive
a=connection:new
a=channel:57ead346fd034b34@speechrecog
a=cmid:1
m=audio 5330 RTP/AVP 0
a=rtpmap:0 PCMU/8000
a=recvonly
a=mid:1

2020-04-27 05:33:21:014370 [NOTICE] Accepted TCP/MRCPv2 Connection 172.16.0.10:1544 <-> 172.16.0.10:36792
2020-04-27 05:33:21:014554 [INFO]   Receive SIP Event [nua_i_state] Status 200 OK [SIP-Agent-GDF]
2020-04-27 05:33:21:014576 [NOTICE] SIP Call State 0x7f73f0004798 [completed]
2020-04-27 05:33:21:014586 [INFO]   Receive SIP Event [nua_i_ack] Status 200 OK [SIP-Agent-GDF]
2020-04-27 05:33:21:014600 [INFO]   Receive SIP Event [nua_i_state] Status 200 OK [SIP-Agent-GDF]
2020-04-27 05:33:21:014609 [NOTICE] SIP Call State 0x7f73f0004798 [ready]
2020-04-27 05:33:21:014618 [INFO]   Receive SIP Event [nua_i_active] Status 200 Call active [SIP-Agent-GDF]
2020-04-27 05:33:21:016172 [INFO]   Receive MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:36792 [343 bytes]
MRCP/2.0 343 RECOGNIZE 1
Channel-Identifier: 57ead346fd034b34@speechrecog
Content-Type: text/uri-list
Cancel-If-Queue: false
Recognition-Timeout: 15000
Start-Input-Timers: false
No-Input-Timeout: 50000
Vendor-Specific-Parameters: language=en-US
Content-Length: 68

builtin:speech/transcribe?language=en-US&projectid=translator-xpqlcp
2020-04-27 05:33:21:016238 [INFO]   Assign Control Channel <57ead346fd034b34@speechrecog> to Connection 172.16.0.10:1544 <-> 172.16.0.10:36792 [0] -> [1]
2020-04-27 05:33:21:016281 [INFO]   Process RECOGNIZE Request <57ead346fd034b34@speechrecog> [1]
2020-04-27 05:33:21:016349 [INFO]   Init Speech Detector: frame-size=160, max-frame-count=1320, output-frame-count=20, vad-mode=2, noinput-timeout=50000 ms, input-timeout=15000 ms, start-timeout=50 ms, complete-timeout=10000 ms, incomplete-timeout=10000 ms, leading-silence=3000 ms, trailing-silence=3000 ms, interim-results=1, start-of-input=external <57ead346fd034b34>
2020-04-27 05:33:21:016447 [INFO]   Create gRPC Stream <57ead346fd034b34@gdf>
2020-04-27 05:33:21:016903 [INFO]   Process RECOGNIZE Response <57ead346fd034b34@speechrecog> [1]
2020-04-27 05:33:21:016928 [INFO]   State Transition IDLE -> RECOGNIZING <57ead346fd034b34@speechrecog>
2020-04-27 05:33:21:017008 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:36792 [83 bytes]
MRCP/2.0 83 1 200 IN-PROGRESS
Channel-Identifier: 57ead346fd034b34@speechrecog


2020-04-27 05:33:21:026446 [INFO]   Receive SIP Event [nua_i_invite] Status 100 Trying [SIP-Agent-GDF]
2020-04-27 05:33:21:026493 [INFO]   Receive SIP Event [nua_i_state] Status 100 Trying [SIP-Agent-GDF]
2020-04-27 05:33:21:026507 [NOTICE] SIP Call State  [received]
2020-04-27 05:33:21:026531 [INFO]   Create Session 0x7f73f0008c08 <new> [gdf]
2020-04-27 05:33:21:026545 [INFO]   Remote SDP 0x7f73f0008c08 <new>
v=0
o=rtr.cust.mtro.co 4294408801613369614 1534749027568805804 IN IP4 172.16.0.10
s=-
c=IN IP4 172.16.0.10
t=0 0
m=application 9 TCP/MRCPv2 1
a=setup:active
a=connection:existing
a=resource:speechsynth
a=cmid:1
m=audio 14591 RTP/AVP 0 8 96
a=rtpmap:0 PCMU/8000
a=rtpmap:8 PCMA/8000
a=rtpmap:96 L16/8000
a=recvonly
a=mid:1

2020-04-27 05:33:21:026692 [NOTICE] Add Session <319384d7968946bc>
2020-04-27 05:33:21:026722 [INFO]   Receive Offer 0x7f73f0008c08 <319384d7968946bc> [c:1 a:1 v:0]
2020-04-27 05:33:21:026736 [INFO]   Found MRCP Engine [Google-SS-1] for Resource [speechsynth] 0x7f73f0008c08 <319384d7968946bc>
2020-04-27 05:33:21:026791 [INFO]   Add Pending Control Channel <319384d7968946bc@speechsynth> [1]
2020-04-27 05:33:21:031873 [INFO]   Enable RTP Session 172.16.0.10:5332
2020-04-27 05:33:21:031897 [INFO]   Open RTP Transmitter 172.16.0.10:5332 -> 172.16.0.10:14591
2020-04-27 05:33:21:031907 [INFO]   Media Path 0x7f73f0008c08 Source->[LPCM/8000/1]->Bridge->[LPCM/8000/1]->Encoder->[PCMU/8000/1]->Sink
2020-04-27 05:33:21:031951 [INFO]   Open <319384d7968946bc@gss>
2020-04-27 05:33:21:032074 [NOTICE] GSS Usage: 1/1/3
2020-04-27 05:33:21:032340 [INFO]   Send Answer 0x7f73f0008c08 <319384d7968946bc> [c:1 a:1 v:0] Status OK
2020-04-27 05:33:21:032377 [INFO]   Local SDP 0x7f73f0008c08 <319384d7968946bc>
v=0
o=UniMRCPServer 0 0 IN IP4 172.16.0.10
s=-
c=IN IP4 172.16.0.10
t=0 0
m=application 1544 TCP/MRCPv2 1
a=setup:passive
a=connection:existing
a=channel:319384d7968946bc@speechsynth
a=cmid:1
m=audio 5332 RTP/AVP 0
a=rtpmap:0 PCMU/8000
a=sendonly
a=mid:1

2020-04-27 05:33:21:033238 [INFO]   Receive SIP Event [nua_i_state] Status 200 OK [SIP-Agent-GDF]
2020-04-27 05:33:21:033269 [NOTICE] SIP Call State 0x7f73f0008c08 [completed]
2020-04-27 05:33:21:033482 [INFO]   Receive SIP Event [nua_i_ack] Status 200 OK [SIP-Agent-GDF]
2020-04-27 05:33:21:033553 [INFO]   Receive SIP Event [nua_i_state] Status 200 OK [SIP-Agent-GDF]
2020-04-27 05:33:21:033574 [NOTICE] SIP Call State 0x7f73f0008c08 [ready]
2020-04-27 05:33:21:033581 [INFO]   Receive SIP Event [nua_i_active] Status 200 Call active [SIP-Agent-GDF]
2020-04-27 05:33:21:035144 [INFO]   Receive MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:36792 [247 bytes]
MRCP/2.0 247 SPEAK 1
Channel-Identifier: 319384d7968946bc@speechsynth
Content-Type: application/ssml+xml
Voice-Name: hi-IN-Wavenet-A
Content-Length: 88

<speak><prosody rate="moderate">Welcome to MetroHomes,Hindi or English</prosody></speak>
2020-04-27 05:33:21:035171 [INFO]   Assign Control Channel <319384d7968946bc@speechsynth> to Connection 172.16.0.10:1544 <-> 172.16.0.10:36792 [0] -> [2]
2020-04-27 05:33:21:035221 [INFO]   Process SPEAK Request <319384d7968946bc@speechsynth> [1]
2020-04-27 05:33:21:035270 [INFO]   Set Voice Name [hi-IN-Wavenet-A] <319384d7968946bc@gss>
2020-04-27 05:33:21:035283 [INFO]   Start Async Synth: encoding=1, sampling-rate=8000, language=hi-IN <319384d7968946bc@gss>
<speak><prosody rate="moderate">Welcome to MetroHomes,Hindi or English</prosody></speak>
2020-04-27 05:33:21:870827 [INFO]   Process SPEAK Response <319384d7968946bc@speechsynth> [1]
2020-04-27 05:33:21:870853 [NOTICE] State Transition IDLE -> SPEAKING <319384d7968946bc@speechsynth>
2020-04-27 05:33:21:870913 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:36792 [83 bytes]
MRCP/2.0 83 1 200 IN-PROGRESS
Channel-Identifier: 319384d7968946bc@speechsynth


2020-04-27 05:33:23:301914 [INFO]   Speech Detector State Transition NO-INPUT -> IN-PROGRESS [2290 ms] <57ead346fd034b34>
2020-04-27 05:33:23:301965 [INFO]   Start Input Timer [15000 ms] <57ead346fd034b34>
2020-04-27 05:33:23:302063 [INFO]   Init Streaming Request: encoding=1, sampling-rate=8000, language=hi, single-utterance=1, word-info=0 <57ead346fd034b34@gdf>
2020-04-27 05:33:24:571855 [NOTICE] Speech Complete <319384d7968946bc@gss>
2020-04-27 05:33:24:571955 [INFO]   Process SPEAK-COMPLETE Event <319384d7968946bc@speechsynth> [1]
2020-04-27 05:33:24:571965 [NOTICE] State Transition SPEAKING -> IDLE <319384d7968946bc@speechsynth>
2020-04-27 05:33:24:572009 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:36792 [122 bytes]
MRCP/2.0 122 SPEAK-COMPLETE 1 COMPLETE
Channel-Identifier: 319384d7968946bc@speechsynth
Completion-Cause: 000 normal


2020-04-27 05:33:24:692545 [INFO]   Receive SIP Event [nua_i_bye] Status 200 Session Terminated [SIP-Agent-GDF]
2020-04-27 05:33:24:692581 [INFO]   Receive SIP Event [nua_i_state] Status 200 Session Terminated [SIP-Agent-GDF]
2020-04-27 05:33:24:692584 [NOTICE] SIP Call State 0x7f73f0008c08 [terminated]
2020-04-27 05:33:24:692592 [INFO]   Receive SIP Event [nua_i_terminated] Status 200 Session Terminated [SIP-Agent-GDF]
2020-04-27 05:33:24:692603 [INFO]   Deactivate Session 0x7f73f0008c08 <319384d7968946bc>
2020-04-27 05:33:24:692612 [INFO]   Terminate Session 0x7f73f0008c08 <319384d7968946bc>
2020-04-27 05:33:24:692629 [INFO]   Close <319384d7968946bc@gss>
2020-04-27 05:33:24:692633 [NOTICE] GSS Usage: 0/1/3
2020-04-27 05:33:24:692655 [INFO]   Remove Control Channel <319384d7968946bc@speechsynth> [1]
2020-04-27 05:33:24:694832 [INFO]   Receive MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:36792 [86 bytes]
MRCP/2.0 86 START-INPUT-TIMERS 2
Channel-Identifier: 57ead346fd034b34@speechrecog


2020-04-27 05:33:24:694857 [INFO]   Process START-INPUT-TIMERS Request <57ead346fd034b34@speechrecog> [2]
2020-04-27 05:33:24:694900 [INFO]   Start No-Input Timer [50000 ms] <57ead346fd034b34>
2020-04-27 05:33:24:694913 [INFO]   Process START-INPUT-TIMERS Response <57ead346fd034b34@speechrecog> [2]
2020-04-27 05:33:24:694925 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:36792 [80 bytes]
MRCP/2.0 80 2 200 COMPLETE
Channel-Identifier: 57ead346fd034b34@speechrecog


2020-04-27 05:33:24:701814 [INFO]   Close RTP Transmitter 172.16.0.10:5332 -> 172.16.0.10:14591 [s:135 o:21600]
2020-04-27 05:33:24:701831 [INFO]   Remove RTP Session 172.16.0.10:5332
2020-04-27 05:33:24:702031 [NOTICE] Remove Session <319384d7968946bc>
2020-04-27 05:33:24:702040 [INFO]   Session Terminated 0x7f73f0008c08 <319384d7968946bc>
2020-04-27 05:33:24:702054 [NOTICE] Destroy Session <319384d7968946bc>
2020-04-27 05:33:27:019379 [INFO]   Received Response: status [1] id [] recog result [1] query result [0] webhook status [0] output audio [0 bytes] <57ead346fd034b34@gdf>
2020-04-27 05:33:27:019462 [INFO]   Recognition Result: transcript [इन] confidence [0.00] final [0] end-offset [6:360] <57ead346fd034b34@gdf>
2020-04-27 05:33:27:019478 [INFO]   Set Result Flag [10000 ms] <57ead346fd034b34>
2020-04-27 05:33:27:019526 [INFO]   Process START-OF-INPUT Event <57ead346fd034b34@speechrecog> [1]
2020-04-27 05:33:27:019652 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:36792 [115 bytes]
MRCP/2.0 115 START-OF-INPUT 1 IN-PROGRESS
Channel-Identifier: 57ead346fd034b34@speechrecog
Input-Type: speech


2020-04-27 05:33:27:023212 [INFO]   Received Response: status [1] id [] recog result [1] query result [0] webhook status [0] output audio [0 bytes] <57ead346fd034b34@gdf>
2020-04-27 05:33:27:023238 [INFO]   Recognition Result: transcript [हिंदी] confidence [0.00] final [0] end-offset [6:450] <57ead346fd034b34@gdf>
2020-04-27 05:33:27:620179 [INFO]   Received Response: status [1] id [] recog result [1] query result [0] webhook status [0] output audio [0 bytes] <57ead346fd034b34@gdf>
2020-04-27 05:33:27:620209 [INFO]   Recognition Result: transcript [हिंदी] confidence [0.00] final [0] end-offset [7:20] <57ead346fd034b34@gdf>
2020-04-27 05:33:28:413519 [INFO]   Received Response: status [1] id [] recog result [1] query result [0] webhook status [0] output audio [0 bytes] <57ead346fd034b34@gdf>
2020-04-27 05:33:28:413572 [INFO]   Recognition Result: end-of-utterance <57ead346fd034b34@gdf>
2020-04-27 05:33:28:416730 [INFO]   Received Response: status [1] id [] recog result [1] query result [0] webhook status [0] output audio [0 bytes] <57ead346fd034b34@gdf>
2020-04-27 05:33:28:416776 [INFO]   Recognition Result: transcript [हिंदी] confidence [0.96] final [1] end-offset [7:710] <57ead346fd034b34@gdf>
2020-04-27 05:33:28:688673 [INFO]   Received Response: status [0] id [] recog result [1] query result [0] webhook status [0] output audio [0 bytes] <57ead346fd034b34@gdf>
2020-04-27 05:33:28:688738 [INFO]   Recognition Result: transcript [हिंदी] confidence [0.96] final [1] end-offset [7:710] <57ead346fd034b34@gdf>
2020-04-27 05:33:28:688987 [WARN]   gRPC Status: com.google.apps.framework.request.NotFoundException: No DesignTimeAgent found for project 'voicebots-264410'. <57ead346fd034b34@gdf>
2020-04-27 05:33:28:689078 [INFO]   Create gRPC Stream <57ead346fd034b34@gdf>
2020-04-27 05:33:28:881854 [INFO]   Init Streaming Request: encoding=1, sampling-rate=8000, language=hi, single-utterance=1, word-info=0 <57ead346fd034b34@gdf>
2020-04-27 05:33:38:826974 [INFO]   Received Response: status [1] id [] recog result [1] query result [0] webhook status [0] output audio [0 bytes] <57ead346fd034b34@gdf>
2020-04-27 05:33:38:827030 [INFO]   Recognition Result: end-of-utterance <57ead346fd034b34@gdf>
2020-04-27 05:33:38:890935 [INFO]   Received Response: status [1] id [] recog result [0] query result [1] webhook status [0] output audio [0 bytes] <57ead346fd034b34@gdf>
2020-04-27 05:33:38:891059 [INFO]   Query Result: {
 "languageCode": "hi"
}
 <57ead346fd034b34@gdf>
2020-04-27 05:33:38:891715 [INFO]   Received Response: status [0] id [] recog result [0] query result [1] webhook status [0] output audio [0 bytes] <57ead346fd034b34@gdf>
2020-04-27 05:33:38:891816 [INFO]   Query Result: {
 "languageCode": "hi"
}
 <57ead346fd034b34@gdf>
2020-04-27 05:33:38:892048 [INFO]   Create gRPC Stream <57ead346fd034b34@gdf>
2020-04-27 05:33:39:081996 [INFO]   Init Streaming Request: encoding=1, sampling-rate=8000, language=hi, single-utterance=1, word-info=0 <57ead346fd034b34@gdf>
2020-04-27 05:33:42:011939 [INFO]   Speech Detector State Transition IN-PROGRESS -> COMPLETE [18710 ms] <57ead346fd034b34>
2020-04-27 05:33:42:011953 [INFO]   Detector Stats: leading-silence=2160 ms, input=16060 ms, trailing-silence=10000 ms <57ead346fd034b34>
2020-04-27 05:33:42:012069 [INFO]   Input Complete [timedout] size=348320 bytes, dur=21770 ms <57ead346fd034b34@gdf>
2020-04-27 05:33:42:144102 [INFO]   Received Response: status [1] id [] recog result [0] query result [1] webhook status [0] output audio [0 bytes] <57ead346fd034b34@gdf>
2020-04-27 05:33:42:144234 [INFO]   Query Result: {
 "languageCode": "hi"
}
 <57ead346fd034b34@gdf>
2020-04-27 05:33:42:144549 [INFO]   Received Response: status [0] id [] recog result [0] query result [1] webhook status [0] output audio [0 bytes] <57ead346fd034b34@gdf>
2020-04-27 05:33:42:144652 [INFO]   Query Result: {
 "languageCode": "hi"
}
 <57ead346fd034b34@gdf>
2020-04-27 05:33:42:144845 [INFO]   Process RECOGNITION-COMPLETE Event <57ead346fd034b34@speechrecog> [1]
2020-04-27 05:33:42:144886 [INFO]   State Transition RECOGNIZING -> RECOGNIZED <57ead346fd034b34@speechrecog>
2020-04-27 05:33:42:144999 [INFO]   Send MRCPv2 Data 172.16.0.10:1544 <-> 172.16.0.10:36792 [390 bytes]
MRCP/2.0 390 RECOGNITION-COMPLETE 1 COMPLETE
Channel-Identifier: 57ead346fd034b34@speechrecog
Completion-Cause: 000 success
Content-Type: application/x-nlsml
Content-Length: 205

<?xml version="1.0"?><result><interpretation grammar="builtin:speech/transcribe" confidence="0"><instance><language_code>hi</language_code></instance><input mode="speech"></input></interpretation></result>
2020-04-27 05:33:42:177683 [INFO]   Receive SIP Event [nua_i_invite] Status 100 Trying [SIP-Agent-GDF]
2020-04-27 05:33:42:177739 [INFO]   Receive SIP Event [nua_i_state] Status 100 Trying [SIP-Agent-GDF]

It does transcribe but in "hi-IN" and not in "en-US". as you can see from this part of the log:
2020-04-27 05:33:28:688738 [INFO]   Recognition Result: transcript [हिंदी] confidence [0.96] final [1] end-offset [7:710] <57ead346fd034b34@gdf>

After adding '&', the bot which had the language as "hi-IN also stopped working".


On Thursday, April 23, 2020 at 12:16:10 PM UTC+5:30, Anchit Dave wrote:

Doug Rylaarsdam

unread,
Apr 29, 2020, 12:22:04 AM4/29/20
to UniMRCP
Hi Anchit, Thanks for trying although it probably wasn't a good suggestion since I was thinking of the format for builtin:dtmf/digits?minlength=2&maxlength=6.

Arsen Chaloyan

unread,
May 2, 2020, 3:17:44 PM5/2/20
to UniMRCP
In order to specify a language per recognition request, you may use either of the two methods documented in Section 4.3.


The use of

builtin:speech/transcribe?language=en-US&projectid=translator-xpqlcp

is logical and consistent to GSR, but is not available for GDF, which is mostly an oversight. On the other hand, this method has never been declared available for GDF.

Anyway, the most straightforward way would be using the standard MRCP header field Speech-Language.

As a side note, parameters passed to a built-in grammar are supposed to be separated by ';' character but not '&'.

--
You received this message because you are subscribed to the Google Groups "UniMRCP" group.
To unsubscribe from this group and stop receiving emails from it, send an email to unimrcp+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/unimrcp/da900013-ee9b-45b2-88c7-f06730f6dbc1%40googlegroups.com.


--
Arsen Chaloyan
Author of UniMRCP
http://www.unimrcp.org

Anchit Dave

unread,
May 11, 2020, 6:16:52 AM5/11/20
to UniMRCP
Hello Arsen, 

Thank you for your assistance. But can you help me by letting me know how should I go about and use standard MRCP header field Speech-Language from FreeSWITCH.
A simple example would be just enough

Thank you


On Thursday, April 23, 2020 at 12:16:10 PM UTC+5:30, Anchit Dave wrote:

Arsen Chaloyan

unread,
May 11, 2020, 10:08:30 PM5/11/20
to UniMRCP
Hello Anchit,

You may try to specify the speech language as follows given your example.

session.execute("play_and_detect_speech","say:<speak><prosody rate="moderate">Welcome to MetroHomes,Hindi or English</prosody></speak> detect:unimrcp:google-mrcp-v2 {speech-language=en-US,start-input-timers=false,no-input-timeout=50000,recognition-timeout=15000}builtin:speech/transcribe?projectid=translator-xpqlcp")


--
You received this message because you are subscribed to the Google Groups "UniMRCP" group.
To unsubscribe from this group and stop receiving emails from it, send an email to unimrcp+u...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages