Collections

Discover the best community collections!

Collections including paper arxiv:2405.07883
Cognition
Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.
Tokenizer Adaptation
Collection of research on tokenizers' adaptation to specific domains and/or languages. Special focus on sequence compression directions
Papers
Large Language Model (LLM) and NLP related papers.
multilingual
Collection by 19 days ago
Tokenizer
Collection by Jul 20, 2024