Record, transcribe, TTS between callers

Vieri · March 24, 2025, 12:28am

Hi,

When a caller calls another one, is it possible to preprocess the caller’s and callee’s audio streams so they are “fed” to a transcribing ASR (vosk or whisper) and a TTS so that the latter resulting audio will replace both the caller’s and callee’s original audios?

Maybe with Stasis() but any ideas / pointers?

Thanks

UPDATE: the transcription and TTS would need to be on-going, a bit like Record() in an infinite loop with silence detection. On silence → run ASR + TTS and playback.

jcolp · March 24, 2025, 10:04am

ARI + Dialling + Bridges + External Media + Bidirectional Audio.

All the pieces are present to do it using ARI, but you have to put them together.

system · April 23, 2025, 10:05am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Hello, I want to stream both the parties audio separately to a web socket for real time transcription and diarization(speaker labelling). I am able to record the audio separately using monitor for both agent and costumer but i want to steam the audio Asterisk APIs	11	773	August 10, 2024
Realtime voice to voice Streaming Asterisk APIs	17	1434	August 25, 2024
Transcribe Audio to File - Realtime Asterisk APIs	3	1000	May 27, 2023
How do I get the called party's audio stream while playing audio Asterisk APIs	1	46	October 4, 2024
AEAP and Speech to text in the dialplan Asterisk APIs	7	827	May 7, 2023

Record, transcribe, TTS between callers

Related topics