Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 19 days ago • 117
NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training Paper • 2412.02030 • Published Dec 2, 2024 • 18
NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images Paper • 2412.03517 • Published Dec 4, 2024 • 18
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published Dec 4, 2024 • 121
Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models Paper • 2411.07232 • Published Nov 11, 2024 • 63
Agent S: An Open Agentic Framework that Uses Computers Like a Human Paper • 2410.08164 • Published Oct 10, 2024 • 24
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation Paper • 2409.18964 • Published Sep 27, 2024 • 26
MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model Paper • 2408.10198 • Published Aug 19, 2024 • 32
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views Paper • 2408.10195 • Published Aug 19, 2024 • 12
view post Post Diaries of Open Source. Part 5!🤯Contextual KTO Mistral PairRM: this model combines iterative KTO, SnorkelAI DPO dataset, Allenai PairRM for ranking, Mistral for the base model, and is a very strong model with Claude 3 quality on AlpacaEval 2.0Final model: ContextualAI/Contextual_KTO_Mistral_PairRMDataset: snorkelai/Snorkel-Mistral-PairRM-DPO-DatasetLeaderboard: https://tatsu-lab.github.io/alpaca_eval/Base model: mistralai/Mistral-7B-Instruct-v0.2🤏 tinyBenchmarks: Quick and cheap LLM evaluation!Code: https://github.com/felipemaiapolo/tinyBenchmarksPaper: tinyBenchmarks: evaluating LLMs with fewer examples (2402.14992)Data: tinyBenchmarks/tinyMMLU🎨Transformers.js 2.16 includes StableLM, speaker verification and diarization, and better chat templating. Try some fun demos!- Xenova/video-object-detection- Xenova/cross-encoder-web- Xenova/the-tokenizer-playground🏴☠️ Abascus Liberated-Qwen1.5-72B, a Qwen 72B-based model that strongly follows system promptsModel: abacusai/Liberated-Qwen1.5-72B👀Design2Code: benchmark of webpage screenshots to codeData: SALT-NLP/Design2CodeProject https://salt-nlp.github.io/Design2Code/Paper Design2Code: How Far Are We From Automating Front-End Engineering? (2403.03163)🌎Data and models around the world- One of the biggest Italian datasets https://hf.co/datasets/manalog/UsenetArchiveIT- IndicLLMSuite: argest Pre-training and Instruction Fine-tuning dataset collection across 22 Indic languages ai4bharat/indicllmsuite-65ee7d225c337fcfa0991707- Hebrew-Gemma-11B, the best base Hebrew model yam-peleg/Hebrew-Gemma-11B- Komodo-7B, a family of multiple Indonesian languages LLMs Yellow-AI-NLP/komodo-7b-baseYou can find the previous part at https://huggingface.co/posts/osanseviero/127895284909100 🔥 10 10 👀 2 2 + Reply