Assistance Needed with Asterisk Audio Routing for Speech-to-Text and Text-to-Speech Application

I face a simillar issue when the duration of the audio being injected is less than 5sec. Audio of 10 sec goes fine. What is the duration of audio that you are injecting ?