Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 24 days ago • 328
qwen-nekomata Collection The nekomata model series are based on the qwen series and have been continually pre-trained on Japanese-specific corpora. • 8 items • Updated Dec 5, 2024 • 5
Retentive Network: A Successor to Transformer for Large Language Models Paper • 2307.08621 • Published Jul 17, 2023 • 170