Commit History
fix eval regression caused in 13f7efaf74fcd3c4514277ccb71914c589873f6a
a213d99
tmm1
commited on
is_causal fix for evals?
fbf49a4
winglian
commited on
fix evals (#447)
ee26281
unverified
winglian
commited on
fix check for flash attn branching (#377)
343ac84
unverified
winglian
commited on
Attention mask and position id fixes for packing (#285)
2bb0b78
unverified
winglian
commited on
Update XFormers Attention Monkeypatch to handle Llama-2 70B (GQA) (#339)
10405b9
unverified
ssmi153
commited on
move flash-attn monkey patch alongside the others
312a9fa
tmm1
commited on
fix sdp attention to use the flash/mem-efficient context manaager
a032c9f
winglian
commited on
Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var
b1f4f7a
theobjectivedad
commited on
Adding logging enhancement
553a86b
theobjectivedad
commited on
Fix set mem_id for inference and refactor
974dc00
Nanobit
commited on
Clean up landmark patching
a6190c8
Nanobit
commited on
Refactor landmark attention patch
919727b
Nanobit
commited on
add support to extend context with xpos rope
a03a7d7
winglian
commited on
Fix grad checkpoint and outputs param
2a801b0
Nanobit
commited on
Feat: Add landmark attention
55b8542
Nanobit
commited on
don't worry about dupes
c56818b
winglian
commited on
black formatting
ad0ea6a
winglian
commited on
copy xformers attn from ooba since we removed dep on alpaca_lora_4bit
6cb2310
winglian
commited on