Update config.json
#115 opened 4 months ago
by
mohdazlah
Need a little guidance accessing https://huggingface.co/spaces/stevenijacobs/AI4Reading using an API. I'm trying to setup a resource to help students with learning disabilities.
#114 opened 4 months ago
by
stevenijacobs
Add missing space in prompt template
5
#113 opened 4 months ago
by
Rocketknight1
UPDATE README.md
#112 opened 4 months ago
by
Kryslynn93
tokenizer offset_mapping is incorrect
1
#111 opened 4 months ago
by
Aflt98
KeyError: 'llama'
2
#110 opened 4 months ago
by
ronnief1
OutOfMemoryError: CUDA out of memory
2
#109 opened 4 months ago
by
sieudd
Issue with accessing gated repo
6
#107 opened 5 months ago
by
vdcapriles
Deploy error (RuntimeError: weight lm_head.weight does not exist)
1
#106 opened 5 months ago
by
steveleancommerce
"TypeError: Object of type Undefined is not JSON serializable" when tokenizing tool_call inputs
3
#104 opened 5 months ago
by
ztgeng
Formats for prompting the model using Hugging face
3
#103 opened 5 months ago
by
javalenzuela
Request: DOI
#102 opened 5 months ago
by
guicozmaciel
Time Module issue or Model?
1
#101 opened 5 months ago
by
rkapuaala
Issues with Tools use and Chat templates
#99 opened 5 months ago
by
pyrator
Upgrading Linux Dist
#98 opened 5 months ago
by
rkapuaala
Clone Repository
1
#96 opened 5 months ago
by
clearcash
llama3.1 gguf format
3
#95 opened 5 months ago
by
davidomars
how can i use git clone Meta-Llama-3.1-8B-Instruct
1
#93 opened 5 months ago
by
xiangsuyu
Asking for Pro subscription
6
#92 opened 5 months ago
by
Mayo133
update rope_scaling
#91 opened 5 months ago
by
Arunjith
Update for correct tool use system prompt
3
#90 opened 5 months ago
by
ricklamers
What call() function parameters besides "query" can be used by the model when doing brave_search and wolfram_alpha tool calls?
#89 opened 5 months ago
by
sszymczyk
What form of the built-in brave_search and wolfram_alpha tool call output is expected by the model?
3
#88 opened 5 months ago
by
sszymczyk
ValueError
1
#87 opened 5 months ago
by
Bmurug3
Request: DOI
1
#86 opened 5 months ago
by
sanjeev929
Request: DOI
1
#85 opened 5 months ago
by
moh996
The model repeatedly outputs a large amount of text and does not comply with the instructs.
10
#84 opened 5 months ago
by
baremetal
Llama repo access not aproved yet
#83 opened 5 months ago
by
APaul1
Throwing Error for AutoModelForSequence Classification
1
#82 opened 5 months ago
by
deshwalmahesh
GSM8K Evaluation Result: 84.5 vs. 76.95
17
#81 opened 5 months ago
by
tanliboy
Deploying Llama3.1 to Nvidia T4 instance (sagemaker endpoints)
4
#80 opened 5 months ago
by
mleiter
Variable answer is getting predicted for same prompt
#79 opened 5 months ago
by
sjainlucky
Efficiency low after adding the adapter_model.safetensors with base model
#78 opened 5 months ago
by
antony-pk
Minimum gpu ram capacity
12
#77 opened 5 months ago
by
bob-sj
Tokenizer padding token
1
#76 opened 5 months ago
by
Rish1
new tokenizer contains the cutoff date and today date by default
4
#74 opened 5 months ago
by
yuchenlin
New bee questions
2
#73 opened 5 months ago
by
rkapuaala
Add `base_model` metadata
#72 opened 5 months ago
by
sbrandeis
Full SFT training caused lose its foundational capabilities
10
#71 opened 5 months ago
by
sinlew
Wrong number of tensors; expected 292, got 291
6
#69 opened 5 months ago
by
KingBadger
Fine tuned Meta-Llama-3.1-8B-Instruct deployment on AWS Sagemaker fails
2
#68 opened 5 months ago
by
byamasuwhatnowis
Quick Fix: Rope Scaling or Rope Type Error
4
#67 opened 5 months ago
by
deepaksiloka
Can't reproduce MATH performance
1
#66 opened 5 months ago
by
jpiabrantes
Banned for Iranian People
13
#65 opened 5 months ago
by
MustafaLotfi
Inference endpoint deployment for 'meta-llama/Meta-Llama-3.1-8B-Instruct' fails
6
#62 opened 5 months ago
by
Keertiraj
Meta-Llama-3.1-8B-Instruct deployment on AWS Sagemaker fails
3
#61 opened 5 months ago
by
Keertiraj
Error Loading the original model file consolidated.00.pth from local
2
#60 opened 5 months ago
by
chanduvkp
Unable to deploy Meta-Llama-3.1-8B-Instruct model on Sagemaker
3
#58 opened 5 months ago
by
axs531622