Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jerryzh168
/
llama3-8b-autoquant
like
0
Text Generation
Transformers
PyTorch
llama
text-generation-inference
Inference Endpoints
torchao
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
llama3-8b-autoquant
1 contributor
History:
31 commits
jerryzh168
Upload compile_artifacts.pt2 with huggingface_hub
85dd6c4
verified
19 days ago
.gitattributes
1.63 kB
Upload compile_artifacts.pt2 with huggingface_hub
19 days ago
README.md
5.17 kB
Upload LlamaForCausalLM
30 days ago
compile_artifacts.pt2
81.6 MB
LFS
Upload compile_artifacts.pt2 with huggingface_hub
19 days ago
config.json
927 Bytes
Upload LlamaForCausalLM
19 days ago
generation_config.json
177 Bytes
Upload LlamaForCausalLM
30 days ago
pytorch_model-00001-of-00002.bin
pickle
Detected Pickle imports (22)
"torch.IntStorage"
,
"torchao.dtypes.affine_quantized_tensor.AffineQuantizedTensor"
,
"torch.serialization._get_layout"
,
"torch.CharStorage"
,
"torch.BFloat16Storage"
,
"torchao.quantization.quant_primitives.ZeroPointDomain"
,
"torchao.quantization.autoquant.AQInt4G64WeightOnlyQuantizedLinearWeight"
,
"torch._tensor._rebuild_from_type_v2"
,
"torch.int8"
,
"torchao.dtypes.utils.PlainLayout"
,
"torch._utils._rebuild_wrapper_subclass"
,
"torchao.dtypes.uintx.tensor_core_tiled_layout.TensorCoreTiledAQTTensorImpl"
,
"torch.int32"
,
"torch._utils._rebuild_tensor_v2"
,
"torchao.dtypes.uintx.plain_layout.PlainAQTTensorImpl"
,
"torchao.dtypes.uintx.tensor_core_tiled_layout.TensorCoreTiledLayout"
,
"torch.bfloat16"
,
"torch.device"
,
"collections.OrderedDict"
,
"torch.LongStorage"
,
"torchao.quantization.autoquant.AQInt8DynamicallyQuantizedLinearWeight"
,
"torchao.quantization.quant_api._int8_symm_per_token_reduced_range_quant"
How to fix it?
4.99 GB
LFS
Upload LlamaForCausalLM
19 days ago
pytorch_model-00002-of-00002.bin
pickle
Detected Pickle imports (22)
"torch.serialization._get_layout"
,
"torchao.dtypes.affine_quantized_tensor.AffineQuantizedTensor"
,
"torch.int32"
,
"torch.bfloat16"
,
"torchao.dtypes.uintx.plain_layout.PlainAQTTensorImpl"
,
"torch.IntStorage"
,
"torchao.dtypes.utils.PlainLayout"
,
"torch.LongStorage"
,
"torch._tensor._rebuild_from_type_v2"
,
"torch._utils._rebuild_wrapper_subclass"
,
"torchao.dtypes.uintx.tensor_core_tiled_layout.TensorCoreTiledAQTTensorImpl"
,
"torch.int8"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.device"
,
"torchao.quantization.quant_api._int8_symm_per_token_reduced_range_quant"
,
"torchao.dtypes.uintx.tensor_core_tiled_layout.TensorCoreTiledLayout"
,
"torchao.quantization.autoquant.AQInt4G64WeightOnlyQuantizedLinearWeight"
,
"torchao.quantization.quant_primitives.ZeroPointDomain"
,
"torchao.quantization.autoquant.AQInt8DynamicallyQuantizedLinearWeight"
,
"torch.CharStorage"
,
"torch.BFloat16Storage"
,
"collections.OrderedDict"
How to fix it?
2.89 GB
LFS
Upload LlamaForCausalLM
19 days ago
pytorch_model.bin.index.json
28 kB
Upload LlamaForCausalLM
19 days ago
special_tokens_map.json
301 Bytes
Upload tokenizer
30 days ago
tokenizer.json
17.2 MB
LFS
Upload tokenizer
30 days ago
tokenizer_config.json
50.6 kB
Upload tokenizer
30 days ago