Category : speech-recognition

import nemo import ffmpeg import nemo.collections.asr as nemo_asr model = nemo_asr.models.EncDecCTCModel.from_pretrained(model_name="QuartzNet15x5Base-En") files=["D:archivemedical speech transcription and intentMedical Speech, Transcription, and Intentrecordingstest49120_1853182_11719913.wav"] transcription=model.transcribe(paths2audio_files=files) I am getting a Warning and a runtime error Warning: warn("Couldn’t find ffmpeg or avconv – defaulting to ffmpeg, but may not work", RuntimeWarning) Runtime Error: An attempt has been made to start a ..

Read more

I want to determine, when exactly a speech in the audio file starts and ends. Firstly, I am using speech_recognition library to determine speech content of the audio file: import speech_recognition as sr filename = ‘./dir_name/1.wav’ r = sr.Recognizer() with sr.AudioFile(filename) as source: audio_data = r.record(source) text = r.recognize_google(audio_data, language = "en-US") print(text) Running this ..

Read more

Python needs to automatically recognize the language of the audio file being loaded and print the text from the audio file in a specific language when the user clicks the Transcribe button, whether this is possible and what the function should look like, please help. from flask import Flask, render_template, request, redirect import speech_recognition as ..

Read more

import speech_recognition as SRG import time store = SRG.Recognizer() with SRG.Microphone() as s: print("Speak…") audio_input = store.record(s, duration=1) try: text_output = store.recognize_google(audio_input) print(text_output) except: print("Couldn’t process the audio input.") if(store.recognize_google(audio_input) == "hello") : print("Hello Sir") Hello so i cant figure out how to stop the detection when it does not recognize anything in line 6 ..

Read more

This is my code and I am getting this error…. import speech_recognition as sr r = sr.Recognizer() file = sr.AudioFile(‘E:/music/jack.wav’) with file as source: audio_file = r.record(source,duration=20) print(r.recognize_google(source)) And error looks like this… [Running] python -u "e:Visual studio codefile-1.py" Traceback (most recent call last): File "e:Visual studio codefile-1.py", line 8, in print(r.recognize_google(source)) File "C:UsersasusAppDataRoamingPythonPython39site-packagesspeech_recognition_init_.py", line ..

Read more