Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published about 1 month ago • 45
Training Task Experts through Retrieval Based Distillation Paper • 2407.05463 • Published Jul 7, 2024 • 8
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper • 2410.16153 • Published Oct 21, 2024 • 44
Training Task Experts through Retrieval Based Distillation Paper • 2407.05463 • Published Jul 7, 2024 • 8
viswavi/multilingual-datafinder-huggingface-prompt-queries Feature Extraction • Updated Aug 6, 2023 • 10
viswavi/datafinder-scibert-nl-queries-dataset-description-only Feature Extraction • Updated Aug 4, 2023 • 9