KW's picture

58 1044

KW

kevineen

·

AI & ML interests

None yet

Recent Activity

liked a dataset about 9 hours ago

unitreerobotics/LAFAN1_Retargeting_Dataset

liked a model about 17 hours ago

sentence-transformers/all-MiniLM-L6-v2

upvoted a paper about 20 hours ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

View all activity

Organizations

kevineen's activity

upvoted a paper about 20 hours ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 21 days ago • 136

upvoted a paper 3 days ago

1.58-bit FLUX

Paper • 2412.18653 • Published 10 days ago • 63

upvoted a collection 4 days ago

YuLan-Mini

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 6 days ago • 10

upvoted a paper 9 days ago

Large Motion Video Autoencoding with Cross-modal Video VAE

Paper • 2412.17805 • Published 11 days ago • 23

upvoted a paper 11 days ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published 29 days ago • 54

upvoted 2 papers 15 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 15 days ago • 334

AniDoc: Animation Creation Made Easier

Paper • 2412.14173 • Published 16 days ago • 49

upvoted a collection 27 days ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated 4 days ago • 78

upvoted 2 collections about 1 month ago

LLM-jp-3 Pre-trained Models

Pre-trained models in the LLM-jp-3 model series • 4 items • Updated 11 days ago • 5

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 68

upvoted a paper about 2 months ago

xLSTM: Extended Long Short-Term Memory

Paper • 2405.04517 • Published May 7, 2024 • 12

upvoted an article 2 months ago

Article

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

Oct 22, 2024

• 49

upvoted a collection 3 months ago

Gemma-APS Release

Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated 21 days ago • 20

upvoted 3 papers 3 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 168

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Paper • 2410.02757 • Published Oct 3, 2024 • 36

MuCodec: Ultra Low-Bitrate Music Codec

Paper • 2409.13216 • Published Sep 20, 2024 • 23

upvoted a collection 3 months ago

Kurage

Multipurpose RAG models for many languages • 13 items • Updated Oct 10, 2024 • 2

upvoted a paper 4 months ago

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Paper • 2409.12568 • Published Sep 19, 2024 • 48

upvoted a collection 4 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 452

upvoted a paper 4 months ago

Ruri: Japanese General Text Embeddings

Paper • 2409.07737 • Published Sep 12, 2024 • 7