Centre for Vision, Speech and Signal Processing - University of Surrey

university

https://www.surrey.ac.uk/centre-vision-speech-signal-processing

cvssp_research

Activity Feed Request to join this org

AI & ML interests

Audio, Vision

Recent Activity

Xubo-Liu authored a paper about 1 month ago

Scaling Transformers for Low-Bitrate High-Quality Speech Coding

Xiatian-Zhu authored a paper about 1 month ago

FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion

Xiatian-Zhu authored a paper 4 months ago

Recognize Any Regions

View all activity

cvssp's activity

Xubo-Liu

authored a paper about 1 month ago

Scaling Transformers for Low-Bitrate High-Quality Speech Coding

Paper • 2411.19842 • Published Nov 29, 2024 • 10

Xiatian-Zhu

authored a paper about 1 month ago

FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion

Paper • 2411.18552 • Published Nov 27, 2024 • 17

Xiatian-Zhu

authored 4 papers 4 months ago

Recognize Any Regions

Paper • 2311.01373 • Published Nov 2, 2023 • 1

AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis

Paper • 2406.08920 • Published Jun 13, 2024 • 7

Gaussian Splatting with Localized Points Management

Paper • 2406.04251 • Published Jun 6, 2024

FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation

Paper • 2409.03525 • Published Sep 5, 2024 • 12

haoheliu

authored a paper 6 months ago

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Paper • 2407.14329 • Published Jul 19, 2024 • 4

Xiatian-Zhu

authored a paper 6 months ago

PartCraft: Crafting Creative Objects by Parts

Paper • 2407.04604 • Published Jul 5, 2024 • 4

haoheliu

authored 2 papers 8 months ago

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Paper • 2405.00233 • Published Apr 30, 2024 • 13

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23, 2024 • 29

sanchit-gandhi

updated 7 models 9 months ago

anindyamondal

authored 3 papers 9 months ago

OmniCount: Multi-label Object Counting with Semantic-Geometric Priors

Paper • 2403.05435 • Published Mar 8, 2024 • 1

Actor-agnostic Multi-label Action Recognition with Multi-modal Query

Paper • 2307.10763 • Published Jul 20, 2023

Time-varying Signals Recovery via Graph Neural Networks

Paper • 2302.11313 • Published Feb 22, 2023

AI & ML interests

Recent Activity

Team members 13

cvssp's activity