Vocabulary issues

by Lambent - opened Sep 26, 2024

Discussion

Lambent

Owner Sep 26, 2024

I think when I merged the Lumen adapter in, I inadvertently did it in fp16, and this for some reason simplified the embedding layer and vocabulary size to shed the 399 tokens it had room for but did not use? So it's weirdly incompatible with same-ancestor models currently. Seems functional, but that's still unfortunate.

Lambent

Owner Sep 26, 2024

... Actually no, it's when I 'reinstructed' it. So I'm unsure what exactly introduced the error.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment