Streaming from ARI snoop channel for Speech recognition

Mani3274 · September 3, 2018, 5:54am

Hi all,

I’d like to stream audio in realtime to an external speech engine (Google, Watson). I am using ARI API’s to get the recoding path of the audio file and I am reading continuously from that file, while asterisk is still writing to it parallel.

Is this a good approach to stream audio for realtime transcription. Is anybody doing realtime transcription from Asterisk. I know there are multiple approaches for speech transcription, If it is possible can you share your approach. I am talking about streaming to cloud based speech engines like Google, Watson, nuance etc.

gilles.verriez · January 11, 2019, 3:10pm

Hi,

I’m trying to do the exact same thing you had problem on.
I’m currently fighting for getting the stream audio from Node.js ARI-Client and I can’t figure out how to do it.
I can imagine you could find a way to solve your problem, but whatever happened, if you could share your work it could be a great help for me!

Thank you in advance.

jcolp · January 11, 2019, 3:52pm

ARI itself does not currently provide a mechanism for getting the audio stream.

gilles.verriez · January 11, 2019, 4:31pm

Hi, thank you for this quick answer.
Does ARI provide one for getting the audio file instead?

jcolp · January 11, 2019, 4:38pm

ARI itself no, but I’ve heard that it may be possible to configure the HTTP server itself to allow downloading of such files. I don’t have any experience with it though.

gilles.verriez · January 11, 2019, 4:56pm

Maybe found a solution
This example allow to send POST request with a file stored on the disk : https://gist.github.com/alepez/9205394
On my environment, records are stored at this location /var/spool/asterisk/recording
So I just had to replace “filename” variable with ‘/var/spool/asterisk/recording/’+recording.name+’.wav’

yeya · April 26, 2020, 3:11pm

to get a live data, I think you need to start here

https://docs.asterisk.org/Development/Reference-Information/Asterisk-Framework-and-API-Examples/External-Media-and-ARI

danjenkins · April 28, 2020, 9:27am

@yeya yeah and heres an example project on how to use it to connect it to dialogflow - theres another in the nimble ape github org that takes audio and sends it to google.

This one and it’s associated ARI bridge project actually uses snoop

harmonyts · June 3, 2020, 6:44am

@[danjenkins] Could please inform us (or anyone else) if I we can do the following?

We have an asterisk that receives calls from several users at the same time. For each phone call we create a sound file and record only the Tx audio from the caller. After hang-up, we transfer the audio file to the google cloud and we use the STT API to receive the transcription text.

Could we made this with realtime streaming for each call channel separately?
Also, if we need that for 20 simultaneous calls, what processor/memory resources we will need?

thank you in advance

danjenkins · June 3, 2020, 10:00am

Yes, 100% possible with streaming.

Resources - very little because youre just passing media from A to B

harmonyts · June 3, 2020, 10:16am

dialogflow is a good solution for this ?

danjenkins · June 4, 2020, 8:18am

If you just want to do transcription… just use google’s speech to text engine. You can find an example of how to do that over at https://github.com/nimbleape/dana-tsg-rtp-stt-audioserver

harmonyts · June 4, 2020, 10:39am

Thank you very much for your reply. I’ll check it.

ToniNavarro · October 21, 2020, 4:19pm

Hi Dan, thanks for sharing your code with community. It is so inspiring.
I am just trying to install your stt audioserver solution but I have this error:

root@raspbx:/home/audioserver# ls
config Dockerfile index.js lib package.json yarn.lock
root@raspbx:/home/audioserver# yarn start
yarn run v1.22.10
$ node index.js
internal/modules/cjs/loader.js:638
throw err;
^

Error: Cannot find module ‘config’
at Function.Module._resolveFilename (internal/modules/cjs/loader.js:636:15)
at Function.Module._load (internal/modules/cjs/loader.js:562:25)
at Module.require (internal/modules/cjs/loader.js:692:17)
at require (internal/modules/cjs/helpers.js:25:18)
at Object. (/home/audioserver/index.js:2:16)
at Module._compile (internal/modules/cjs/loader.js:778:30)
at Object.Module._extensions…js (internal/modules/cjs/loader.js:789:10)
at Module.load (internal/modules/cjs/loader.js:653:32)
at tryModuleLoad (internal/modules/cjs/loader.js:593:12)
at Function.Module._load (internal/modules/cjs/loader.js:585:3)
error Command failed with exit code 1.
info Visit https://yarnpkg.com/en/docs/cli/run for documentation about this command.

After a whole afternoon digging in stackoverflow and this forum, I am not able to realize what is wrong.
Do you know what could be happening?

Thanks in advance and sorry for inconveniences

ToniNavarro · October 22, 2020, 10:44am

Forget my last question. I already realized what was the problem. I am newbie in JS and the name threw me off

Thanks a lot

Topic		Replies	Views
ARI-CLIENT SPEECH TO TEXT STREAMING Asterisk APIs	3	1105	October 18, 2023
Using custom TTS from external server with ARI Asterisk APIs	7	1982	March 5, 2020
Hello, I want to stream both the parties audio separately to a web socket for real time transcription and diarization(speaker labelling). I am able to record the audio separately using monitor for both agent and costumer but i want to steam the audio Asterisk APIs	11	804	August 10, 2024
Transcribe Audio to File - Realtime Asterisk APIs	3	1009	May 27, 2023
Stream audio to Speech to Text Asterisk Support	2	1359	June 3, 2019

Streaming from ARI snoop channel for Speech recognition

Related topics