split up llama model loading so config can be loaded from base config and models can be loaded from a path
2520ecd
winglian
commited on