AI & ML interests

None defined yet.

Recent Activity

SmerkyGย  updated a model about 2 months ago
RWKV/v6-Finch-7B-World3-HF
SmerkyGย  updated a model about 2 months ago
RWKV/v6-Finch-7B-World3-HF
SmerkyGย  new activity 2 months ago
RWKV/v6-Finch-7B-HF:Update README.md
View all activity

RWKV's activity

BlinkDLย 
posted an update about 1 month ago
view post
Post
2881
RWKV-7 "Goose" 0.4B trained w/ ctx4k automatically extrapolates to ctx32k+, and perfectly solves NIAH ctx16k ๐Ÿคฏ 100% RNN and attention-free. Only trained on the Pile. No finetuning. Replicable training runs. tested by our community: https://github.com/Jellyfish042/LongMamba
SmerkyGย 
in RWKV/v6-Finch-7B-HF 2 months ago

Update README.md

#1 opened 5 months ago by
SmerkyG
BlinkDLย 
posted an update 3 months ago
view post
Post
5394
RWKV-6-world-v3 (+3.1T tokens) is our best multilingual 7B model as of now: BlinkDL/rwkv-6-world

It's 100% RNN and attention-free. MMLU 54.2% (previous world-v2.1 = 47.9%. note: without eval-boosting tricks such as annealing).

RWKV-7-world-v4 soon :)
BlinkDLย 
posted an update 4 months ago
view post
Post
5521
RWKV-7 "Goose" preview rc2 => Peak RNN architecture?๐Ÿ˜ƒWill try to squeeze more performance for the final release. Preview code & model: https://github.com/BlinkDL/RWKV-LM/tree/main/RWKV-v7
  • 2 replies
ยท
xianbaoย 
posted an update 5 months ago
view post
Post
1910
With the open-weight release of CogVideoX-5B from THUDM, i.e. GLM team, the Video Generation Model (how about calling it VGM) field has officially became the next booming "LLM"

What does the landscape look like? What are other video generation models? This collection below is all your need.

xianbao/video-generation-models-66c350163c74f60f5c412af6

The above video is generated by @a-r-r-o-w with CogVideoX-5B, taken from a nice lookout for the field!
ybelkadaย 
posted an update 5 months ago
ybelkadaย 
posted an update 6 months ago