-
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
Paper • 2406.17294 • Published • 10 -
TokenPacker: Efficient Visual Projector for Multimodal LLM
Paper • 2407.02392 • Published • 21 -
Understanding Alignment in Multimodal LLMs: A Comprehensive Study
Paper • 2407.02477 • Published • 21 -
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Paper • 2407.03320 • Published • 93
ZyangLee
ZyangLee
AI & ML interests
None yet
Recent Activity
commented
a paper
29 days ago
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context
Learning via MCTS
Organizations
None yet
Collections
2
models
None public yet
datasets
None public yet