On my device, I was able to achieve a 64.04x speedup over WASM! š¤Æ How much does WebGPU speed up ML models running locally in your browser? Try it out and share your results! š
Introducing Phi-3 WebGPU, a private and powerful AI chatbot that runs 100% locally in your browser, powered by š¤ Transformers.js and onnxruntime-web!
š On-device inference: no data sent to a server ā”ļø WebGPU-accelerated (> 20 t/s) š„ Model downloaded once and cached