This project provides clear, easy-to-use examples for:
- 🎙️ Real-time Recognition: Transcribe audio directly from a microphone.
- 🎧 Audio File Recognition: Convert speech from short audio files into text.
- 📂 Long Audio File Processing: Transcribe long audio files by intelligently splitting them into smaller chunks.
Follow these instructions to get the project running on your local machine.
Make sure you have Python 3 and pip installed on your system.
-
Clone the repository to your local machine:
git clone https://github.com/yuvanvishnupandi/speech-recognition_Python.git
-
Navigate into the project directory:
cd speech-recognition_Python -
Install the required Python packages:
pip install pydub PyAudio SpeechRecognition
Note: Linux users may need to install
portaudioseparately:
sudo apt-get install libasound-dev portaudio19-dev libportaudio2 libportaudiocpp0
Each script is designed to be run directly from your terminal.
The app.py script listens to your microphone for a few seconds, captures the audio, and prints the transcribed text.
Run the script:
python app.py