Beihang University

university

Verified

https://ev.buaa.edu.cn/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

quanshr authored a paper 4 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Niujunbo2002 authored a paper 25 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

quanshr authored a paper 2 months ago

Aligning CodeLLMs with Direct Preference Optimization

View all activity

Beihang's activity

hjc-owo

authored a paper 18 days ago

SVGFusion: Scalable Text-to-SVG Generation via Vector Space Diffusion

Paper • 2412.10437 • Published 27 days ago • 3

lsheng2024

authored 10 papers about 1 month ago

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE

Paper • 2311.02684 • Published Nov 5, 2023

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy

Paper • 2203.07845 • Published Mar 15, 2022

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Paper • 2403.12037 • Published Mar 18, 2024 • 1

Assessment of Multimodal Large Language Models in Alignment with Human Values

Paper • 2403.17830 • Published Mar 26, 2024

From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation

Paper • 2404.15267 • Published Apr 23, 2024 • 4

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion

Paper • 2406.03184 • Published Jun 5, 2024 • 19

WorldSimBench: Towards Video Generation Models as World Simulators

Paper • 2410.18072 • Published Oct 23, 2024 • 18

MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Paper • 2412.03558 • Published Dec 4, 2024 • 15

MV-Adapter: Multi-view Consistent Image Generation Made Easy

Paper • 2412.03632 • Published Dec 4, 2024 • 23

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

Paper • 2412.04455 • Published Dec 5, 2024 • 37

HailongSun

authored a paper about 2 months ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 22

AisingioroHao0

authored a paper about 2 months ago

InstantIR: Blind Image Restoration with Instant Generative Reference

Paper • 2410.06551 • Published Oct 9, 2024 • 6

Aaron-LHR

authored 2 papers 4 months ago

Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective Scaffold Token Removal

Paper • 2404.17808 • Published Apr 27, 2024

MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts

Paper • 2407.09816 • Published Jul 13, 2024 • 1

Fictionary

authored 2 papers 4 months ago

Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Paper • 2307.09323 • Published Jul 18, 2023

TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

Paper • 2404.15264 • Published Apr 23, 2024

ngl567

authored a paper 4 months ago

FuzzCoder: Byte-level Fuzzing Test via Large Language Model

Paper • 2409.01944 • Published Sep 3, 2024 • 45

AisingioroHao0

authored a paper 4 months ago

CSGO: Content-Style Composition in Text-to-Image Generation

Paper • 2408.16766 • Published Aug 29, 2024 • 17

BuaaCXF

authored a paper 4 months ago

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17, 2024 • 51