mmx31 (David Clayton)

liked a Space 8 days ago

Running on Zero

94

🌍

MV Adapter I2MV SDXL

Generate multi-view images from a single image

liked a model 9 days ago

DoctorDiffusion/Absynth-2.0

Updated 10 days ago • 8

liked a model 10 days ago

tencent/Hunyuan3D-2

Image-to-3D • Updated 8 days ago • 26.3k • 686

liked a model 12 days ago

ostris/Flex.1-alpha

Text-to-Image • Updated 13 days ago • 16.6k • 326

liked a model 17 days ago

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 5 days ago • 191k • 891

liked 3 models 20 days ago

reacted to merve's post with ❤️ 20 days ago

Post

3614

What a beginning to this year in open ML 🤠
Let's unwrap! merve/jan-10-releases-677fe34177759de0edfc9714

Multimodal 🖼️
> ByteDance released SA2VA: a family of vision LMs that can take image, video, text and visual prompts
> moondream2 is out with new capabilities like outputting structured data and gaze detection!
> Dataset: Alibaba DAMO lab released multimodal textbook — 22k hours worth of samples from instruction videos 🤯
> Dataset: SciCap captioning on scientific documents benchmark dataset is released along with the challenge!

LLMs 💬
> Microsoft released Phi-4, sota open-source 14B language model 🔥
> Dolphin is back with Dolphin 3.0 Llama 3.1 8B 🐬🐬
> Prime-RL released Eurus-2-7B-PRIME a new language model trained using PRIME alignment
> SmallThinker-3B is a new small reasoning LM based on Owen2.5-3B-Instruct 💭
> Dataset: QWQ-LONGCOT-500K is the dataset used to train SmallThinker, generated using QwQ-32B-preview 📕
> Dataset: @cfahlgren1 released React Code Instructions: a dataset of code instruction-code pairs 📕
> Dataset: Qwen team is on the roll, they just released CodeElo, a dataset of code preferences 👩🏻‍💻

Embeddings 🔖
> @MoritzLaurer released zero-shot version of ModernBERT large 👏
> KaLM is a new family of performant multilingual embedding models with MIT license built using Qwen2-0.5B

Image/Video Generation ⏯️
> NVIDIA released Cosmos, a new family of diffusion/autoregressive World Foundation Models generating worlds from images, videos and texts 🔥
> Adobe released TransPixar: a new text-to-video model that can generate assets with transparent backgrounds (a first!)
> Dataset: fal released cosmos-openvid-1m Cosmos-tokenized OpenVid-1M with samples from OpenVid-1M

Others
> Prior Labs released TabPFNv2, the best tabular transformer is out for classification and regression
> Metagene-1 is a new RNA language model that can be used for pathogen detection, zero-shot embedding and genome understanding

liked a model 20 days ago

ByteDance/Sa2VA-8B

Image-Text-to-Text • Updated 17 days ago • 4.3k • 44

liked a model 22 days ago

Efficient-Large-Model/Sana_1600M_4Kpx_BF16

Text-to-Image • Updated 21 days ago • 1.25k • 26

reacted to nyuuzyou's post with 🔥 about 1 month ago

Post

2256

🎨 KLING AI Dataset - nyuuzyou/klingai

A collection of 12,782 AI-generated media items featuring:
- High-quality image and video generations at various resolutions
- Complete metadata including user IDs, prompts, and generation parameters
- Content generated using text-to-image, text-to-video, and image-to-video modalities
- Full generation settings and technical parameters

liked a dataset about 1 month ago

nyuuzyou/klingai

Viewer • Updated Dec 28, 2024 • 12.8k • 125 • 10

reacted to nyuuzyou's post with 👍 about 1 month ago

Post

1322

🎮 GoodGame.ru Clips Dataset - nyuuzyou/goodgame

A collection of 39,280 video clips metadata from GoodGame.ru streaming platform featuring:

- Complete clip information including direct video URLs and thumbnails
- Streamer details like usernames and avatars
- Engagement metrics such as view counts
- Game categories and content classifications
- Released under Creative Commons Zero (CC0) license

This extensive clips collection provides a valuable resource for developing and evaluating video-based AI applications, especially in Russian gaming and streaming contexts.