Thank you very much for the reply Michael.
We would probably fine to have the two channels with separate transcriptions, especially if we could use timestamps to show how they align.
Our Asterisk based system already as call recording, but we had thought that UniMRCP's integration with AWS or Google would allow real-time transcription of those recordings (ie while a person is still talking).
The unispeech website says "You can process audio in batch or in near real-time. Using a secure connection, you can send a live audio stream to the service, and receive a stream of text in response.", as per:
So what is this near real-time functionality? If we have to start listening, stop listening, send that for transcription, then I wouldn't call that near real-time. The mention of "receive a stream of text" makes it sound like you can receive text while the person is still speaking.
Have I misunderstood what the website says?
Thanks again for your help.