Commits · Dovakiins/qwerrwe

Update src/axolotl/utils/models.py

933e970
unverified

winglian

Nanobit commited on May 26, 2023

load the tokenizer seperately from the model

32e6fe9

winglian commited on May 26, 2023

qlora and 4bit check so we are able to merge and unload

1987e5c

winglian commited on May 26, 2023

fix merge conflict failure, black format

7b5e762

winglian commited on May 26, 2023

fixes to make qlora actually work

34c99f9

winglian commited on May 26, 2023

fix tokenizer loading, got openllama 3b working

e396654

winglian commited on May 25, 2023

stray s

f523a08

winglian commited on May 25, 2023

cfg.cfg fix, also de-dupe lora module list

676d7da

winglian commited on May 25, 2023

fix tuple add to list

a8771b0

winglian commited on May 25, 2023

Update src/axolotl/utils/models.py

1cf21da
unverified

winglian

Nanobit commited on May 25, 2023

attempt to find linear modules for qlora

ffd1043

winglian commited on May 25, 2023

apply black formatting

ce34d64

winglian commited on May 25, 2023

Merge branch 'main' of github.com:OpenAccess-AI-Collective/axolotl into dev

ce694e2

winglian commited on May 25, 2023

remove un-needed code, add validation

1f5d83e

winglian commited on May 25, 2023

fix: handles AutoTokenizer from untrusted source

88ad05d
unverified

Valentin De Matos commited on May 24, 2023

more qlora support

e8aacfb

winglian commited on May 24, 2023

prepare does all this already for qlora?

b9d07aa

winglian commited on May 23, 2023

integrate qlora? maybe?

3b4d055

winglian commited on May 23, 2023

Update src/axolotl/utils/models.py for info msg

1b3e401
unverified

winglian

Nanobit commited on May 22, 2023

tokenization fixes

4ea9a66

winglian commited on May 21, 2023

optionally be able to specify alpaca or chat style prompts

1d5ab84

winglian commited on May 20, 2023

Set `half` using `cfg.fp16` for 4bit

641f801
unverified

Nanobit commited on May 19, 2023

support for replit lm

8c2f3cb

winglian commited on May 17, 2023

Add `lora_modules_to_save`

2c73c81
unverified

Nanobit commited on May 16, 2023

more fixes

bdbca8f

winglian commited on May 15, 2023

more fixes

42410c7

winglian commited on May 14, 2023

fix torch_dtype for model load

aef00b6

winglian commited on May 14, 2023

optimize dataloading to use cache, fix model token embedding sizes

aa3c3f9

winglian commited on May 12, 2023

black formatting

2bc1a5b

winglian commited on May 10, 2023

various fixes

7a490a4

winglian commited on May 10, 2023

testing mpt triton

e2e68c3

winglian commited on May 10, 2023

add support for trust_remote_code for mpt models

a125693

winglian commited on May 8, 2023

refactor inference, warn if model is frozen

247825b

winglian commited on May 7, 2023

fix adam bnb optimizer grouped parameters, fix peft model 8bit conversion logic, black formatting

7748f3d

winglian commited on May 1, 2023

support llama-adapter zero init attention

2255bb7

winglian commited on May 1, 2023

fdsp config dict fix, todo list, add torchdistx support

ad2b48c

winglian commited on Apr 30, 2023

8bit and deepspeed changes

9190ada

winglian commited on Apr 30, 2023

don't load models in 8bit unless they are using an adapter, also fix tokenizer load in exceptional case

6dfdd2d

winglian commited on Apr 30, 2023

fix sharegpt tokenization, refactor tokenization debugging

5159d00

winglian commited on Apr 30, 2023

fix dataset handling, support galactica

4a17a4c

winglian commited on Apr 24, 2023

tweaks to data loading, 8 bit adam, accelerate and deepspeed

097d367

winglian commited on Apr 22, 2023

shuffle and split dataset after save/load

4f2584f

winglian commited on Apr 20, 2023

fix sharegpt handling from hf, don't worry about loading llama if using earlier transformers release

8d43785

winglian commited on Apr 20, 2023

various bugfixes

94f5e41

winglian commited on Apr 19, 2023

fix bug when model_type not explicitly passed

bb991fd

winglian commited on Apr 19, 2023

improve inference

d653859

winglian commited on Apr 19, 2023

quickstart instructions for starting from runpod (#5)

0a472e1
unverified

winglian commited on Apr 18, 2023

attempt xformers hijack attention

8746b70

winglian commited on Apr 18, 2023

WIP large refactor to make finetune script a little more manageable (#3)

6045345
unverified

winglian commited on Apr 18, 2023

Commit History

Update src/axolotl/utils/models.py 933e970 unverified

load the tokenizer seperately from the model 32e6fe9

qlora and 4bit check so we are able to merge and unload 1987e5c

fix merge conflict failure, black format 7b5e762

fixes to make qlora actually work 34c99f9

fix tokenizer loading, got openllama 3b working e396654

stray s f523a08

cfg.cfg fix, also de-dupe lora module list 676d7da

fix tuple add to list a8771b0

Update src/axolotl/utils/models.py 1cf21da unverified

attempt to find linear modules for qlora ffd1043

apply black formatting ce34d64

Merge branch 'main' of github.com:OpenAccess-AI-Collective/axolotl into dev ce694e2

remove un-needed code, add validation 1f5d83e

fix: handles AutoTokenizer from untrusted source 88ad05d unverified

more qlora support e8aacfb

prepare does all this already for qlora? b9d07aa

integrate qlora? maybe? 3b4d055

Update src/axolotl/utils/models.py for info msg 1b3e401 unverified

tokenization fixes 4ea9a66

optionally be able to specify alpaca or chat style prompts 1d5ab84

Set `half` using `cfg.fp16` for 4bit 641f801 unverified

support for replit lm 8c2f3cb

Add `lora_modules_to_save` 2c73c81 unverified

more fixes bdbca8f

more fixes 42410c7

fix torch_dtype for model load aef00b6

optimize dataloading to use cache, fix model token embedding sizes aa3c3f9

black formatting 2bc1a5b

various fixes 7a490a4

testing mpt triton e2e68c3

add support for trust_remote_code for mpt models a125693

refactor inference, warn if model is frozen 247825b

fix adam bnb optimizer grouped parameters, fix peft model 8bit conversion logic, black formatting 7748f3d

support llama-adapter zero init attention 2255bb7

fdsp config dict fix, todo list, add torchdistx support ad2b48c

8bit and deepspeed changes 9190ada

don't load models in 8bit unless they are using an adapter, also fix tokenizer load in exceptional case 6dfdd2d

fix sharegpt tokenization, refactor tokenization debugging 5159d00

fix dataset handling, support galactica 4a17a4c

tweaks to data loading, 8 bit adam, accelerate and deepspeed 097d367

shuffle and split dataset after save/load 4f2584f

fix sharegpt handling from hf, don't worry about loading llama if using earlier transformers release 8d43785

various bugfixes 94f5e41

fix bug when model_type not explicitly passed bb991fd

improve inference d653859

quickstart instructions for starting from runpod (#5) 0a472e1 unverified

attempt xformers hijack attention 8746b70

WIP large refactor to make finetune script a little more manageable (#3) 6045345 unverified

Update src/axolotl/utils/models.py

933e970
unverified

load the tokenizer seperately from the model

32e6fe9

qlora and 4bit check so we are able to merge and unload

1987e5c

fix merge conflict failure, black format

7b5e762

fixes to make qlora actually work

34c99f9

fix tokenizer loading, got openllama 3b working

e396654

stray s

f523a08

cfg.cfg fix, also de-dupe lora module list

676d7da

fix tuple add to list

a8771b0

Update src/axolotl/utils/models.py

1cf21da
unverified

attempt to find linear modules for qlora

ffd1043

apply black formatting

ce34d64

Merge branch 'main' of github.com:OpenAccess-AI-Collective/axolotl into dev

ce694e2

remove un-needed code, add validation

1f5d83e

fix: handles AutoTokenizer from untrusted source

88ad05d
unverified

more qlora support

e8aacfb

prepare does all this already for qlora?

b9d07aa

integrate qlora? maybe?

3b4d055

Update src/axolotl/utils/models.py for info msg

1b3e401
unverified

tokenization fixes

4ea9a66

optionally be able to specify alpaca or chat style prompts

1d5ab84

Set `half` using `cfg.fp16` for 4bit

641f801
unverified

support for replit lm

8c2f3cb

Add `lora_modules_to_save`

2c73c81
unverified

more fixes

bdbca8f

more fixes

42410c7

fix torch_dtype for model load

aef00b6

optimize dataloading to use cache, fix model token embedding sizes

aa3c3f9

black formatting

2bc1a5b

various fixes

7a490a4

testing mpt triton

e2e68c3

add support for trust_remote_code for mpt models

a125693

refactor inference, warn if model is frozen

247825b

fix adam bnb optimizer grouped parameters, fix peft model 8bit conversion logic, black formatting

7748f3d

support llama-adapter zero init attention

2255bb7

fdsp config dict fix, todo list, add torchdistx support

ad2b48c

8bit and deepspeed changes

9190ada

don't load models in 8bit unless they are using an adapter, also fix tokenizer load in exceptional case

6dfdd2d

fix sharegpt tokenization, refactor tokenization debugging

5159d00

fix dataset handling, support galactica

4a17a4c

tweaks to data loading, 8 bit adam, accelerate and deepspeed

097d367

shuffle and split dataset after save/load

4f2584f

fix sharegpt handling from hf, don't worry about loading llama if using earlier transformers release

8d43785

various bugfixes

94f5e41

fix bug when model_type not explicitly passed

bb991fd

improve inference

d653859

quickstart instructions for starting from runpod (#5)

0a472e1
unverified

attempt xformers hijack attention

8746b70

WIP large refactor to make finetune script a little more manageable (#3)

6045345
unverified