Pavlo Molchanov's picture

Pavlo Molchanov PRO

pmolchanov

·

https://www.pmolchanov.com

AI & ML interests

Efficiency in Multi-Modal LLMs

Recent Activity

upvoted a collection 19 minutes ago

liked a model about 14 hours ago

nvidia/Cosmos-1.0-Autoregressive-4B

new activity 24 days ago

nvidia/Hymba-1.5B-Base:Update LMFlow support

View all activity

Organizations

pmolchanov's activity

upvoted a collection 19 minutes ago

Cosmos

The collection of Cosmos models • 30 items • Updated about 13 hours ago • 98

upvoted 2 papers about 1 month ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 57

Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published Nov 25, 2024 • 15

upvoted a collection about 1 month ago

Hymba

A series of Hybrid Small Language Models. • 2 items • Updated about 14 hours ago • 25

upvoted a paper about 1 month ago

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 48

upvoted a paper about 2 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 40

upvoted 2 papers 2 months ago

ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Paper • 2410.21465 • Published Oct 28, 2024 • 11

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25, 2024 • 19

upvoted 4 papers 3 months ago

PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation

Paper • 2410.01680 • Published Oct 2, 2024 • 33

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Paper • 2409.18124 • Published Sep 26, 2024 • 32

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

Paper • 2409.17481 • Published Sep 26, 2024 • 47

Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction

Paper • 2409.17422 • Published Sep 25, 2024 • 25

upvoted a collection 3 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 552

upvoted 4 collections 4 months ago

MagpieLM

Aligning LMs with Fully Open Recipe + Synthetic Data Generated from Open-Source LMs. • 9 items • Updated 4 days ago • 15

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 6 items • Updated about 14 hours ago • 5

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 187

VILA: On Pre-training for Visual Language Models

10 items • Updated Oct 31, 2024 • 47

upvoted a paper 4 months ago

Learning to Move Like Professional Counter-Strike Players

Paper • 2408.13934 • Published Aug 25, 2024 • 23

upvoted a collection 5 months ago

Nemotron in vLLM

Nemotron models that have been converted and/or quantized to work well in vLLM • 7 items • Updated Jul 25, 2024 • 1

upvoted a paper 5 months ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 51