Ranjit
/

odia_whisper_small_v3.0

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Ranjit commited on Jun 9, 2023

Commit

e8df31c

·

1 Parent(s): c250bd8

Update README.md

Files changed (1) hide show

README.md +0 -32

README.md CHANGED Viewed

@@ -65,35 +65,3 @@ The following hyperparameters were used during training:
-import gradio as gr
-import torch
-from transformers import WhisperForConditionalGeneration, WhisperTokenizer
-torch.backends.cudnn.enabled = True
-# Load the speech-to-text model from Hugging Face
-model_name = "Ranjit/Whisper_v2.0"
-task = "transcribe"
-tokenizer = WhisperTokenizer.from_pretrained(model_name, task=task)
-model = WhisperForConditionalGeneration.from_pretrained(model_name).to("cuda")
-# Define a function to transcribe speech to text
-def transcribe_audio(audio):
-    input_values = tokenizer(audio, return_tensors="pt").input_values.to("cuda")
-    logits = model(input_values).logits
-    predicted_ids = torch.argmax(logits, dim=-1)
-    transcription = tokenizer.batch_decode(predicted_ids)[0]
-    return transcription
-# Create the Gradio interface
-gradio_interface = gr.Interface(
-    fn=transcribe_audio,
-    inputs="microphone",
-    outputs="text",
-    capture_session=True,  # Leverage GPU acceleration
-    title="Speech-to-Text",
-    description="Transcribe speech to text using a Wav2Vec2 model.",
-    theme="default",
-)
-gradio_interface.launch(share=True)


65
66
67