import gradio as gr gr.load("models/ahmedgongi/Llama_devops2_merged_4bit").launch()