Thanks Arsen,
We are taking your advice and am looking to make it work with LexV2 and have been looking at the x-amz-lex parameters. It's not clear whether these settings however manages to get to the Speech-To-Text element via Lex or if these are only effective within the Lex bot itself post Speech-to-Text (which renders some of these settings effectively useless).
In order to help quantify this as well as to validate a production deployment architecture, we attempted to put in place a PoC system to allow us to try out the x-amz-lex settings.
For our deployment needs against the Cisco CVP 11.6 IVR platform, we have had to get internet access (i.e. access to AWS from Unimrcp) via a proxy server. However we encountered a strange issue with the UMS LexV2 plugin where the code appears to still attempt to use a direct connection? I didn't think it was at first because the disconnected message is logged within ms of the create connection message but my firewall guys said they saw some https traffic that wasn't routed via the proxy (hence in their denied access logs). I did do a pcap trace and noted that the disconnect wasn't from any SIP messaging from the VVB side either and replicated the test with the same servers but without needing or using a proxy (in which case instead of a Disconnected message, I got a Connected message instead and recognition etc continues as expected).
2022-01-25 13:42:49:578017 [INFO] Start Conversation botId [F2Z6WALK24] aliasId [IJP9KZXNDD] locale [en-GB] <43fb12e5736644e0@lex>
2022-01-25 13:42:49:578278 [INFO] Process RECOGNIZE Response <43fb12e5736644e0@speechrecog> [103]
2022-01-25 13:42:49:578286 [INFO] State Transition IDLE -> RECOGNIZING <43fb12e5736644e0@speechrecog>
2022-01-25 13:42:49:578307 [INFO] Send MRCPv2 Data
172.21.101.14:1544 <->
172.21.102.168:59662 [85 bytes]
MRCP/2.0 85 103 200 IN-PROGRESS^M
Channel-Identifier: 43fb12e5736644e0@speechrecog^M
^M
2022-01-25 13:42:49:578371 [INFO] Create HTTP/2 connection [https://runtime-v2-lex.eu-west-2.amazonaws.com:443] <43fb12e5736644e0>
2022-01-25 13:42:49:589515 [INFO] Disconnected <43fb12e5736644e0>2022-01-25 13:42:49:589535 [INFO] Delete H2 session <43fb12e5736644e0>
2022-01-25 13:42:49:589587 [DEBUG] Stop Input <43fb12e5736644e0@lex>
Am I reading/interpreting that log entry correctly? I have doubled checked the start of that unimrcpserver log file to confirm that we are using and setting the proxy details (which were fine with umspolly as that manages to send and receive back the Polly TTS audio files).
2022-01-25 13:29:19:240863 [DEBUG] Load Streaming Recognition Attribute: proxy-scheme = http
2022-01-25 13:29:19:240871 [DEBUG] Load Streaming Recognition Attribute: proxy-port = 8080
2022-01-25 13:29:19:240874 [DEBUG] Load Streaming Recognition Attribute: proxy-host = 86.54.150.60
2022-01-25 13:29:19:240878 [DEBUG] Load Streaming Recognition Attribute: alias = dummy
2022-01-25 13:29:19:240881 [DEBUG] Load Streaming Recognition Attribute: bot-name = dummy
2022-01-25 13:29:19:240884 [DEBUG] Load Streaming Recognition Attribute: region = eu-west-2
2022-01-25 13:29:19:240887 [DEBUG] Load Streaming Recognition Attribute: generate-output-audio = false
2022-01-25 13:29:19:240891 [DEBUG] Load Streaming Recognition Attribute: grammar-param-separator = ,
2022-01-25 13:29:19:240894 [DEBUG] Load Streaming Recognition Attribute: transcription-grammar = transcribe
2022-01-25 13:29:19:240897 [DEBUG] Load Streaming Recognition Attribute: skip-empty-results = true
2022-01-25 13:29:19:240900 [DEBUG] Load Streaming Recognition Attribute: skip-unsupported-grammars = true
2022-01-25 13:29:19:240903 [DEBUG] Load Streaming Recognition Attribute: language = en_GB
2022-01-25 13:29:19:240906 [DEBUG] Load Streaming Recognition Attribute: start-of-input = service-originated
Kind regards,
Daniel