Whisper Engines Collection Compiled engines for running Whisper with TRT LLM for much faster inference. • 219 items • Updated 25 days ago
baseten/btest-llama3.1-70b-instruct-NVIDIA-H100-80GB-HBM3-0.15.0-TP1-fp8-checkpoint Updated 25 days ago • 8