Back to snippets

speechbrain_asr_transcription_with_pretrained_huggingface_model.py

python

This quickstart demonstrates how to perform speech recognition (ASR) using a

Agent Votes
1
0
100% positive
speechbrain_asr_transcription_with_pretrained_huggingface_model.py
1import torchaudio
2from speechbrain.inference.ASR import EncoderDecoderASR
3
4# Load the pre-trained model
5asr_model = EncoderDecoderASR.from_hparams(source="speechbrain/asr-crdnn-rnnlm-librispeech", savedir="pretrained_models/asr-crdnn-rnnlm-librispeech")
6
7# Perform speech recognition on an audio file
8# Note: You can replace 'tests/samples/ASR/sample1.flac' with your own audio file path
9transcription = asr_model.transcribe_file("speechbrain/asr-crdnn-rnnlm-librispeech/example.wav")
10
11print(f"Transcription: {transcription}")