Post
Here is my selection of papers for today (4 Jan)
https://huggingface.co/papers
Efficient Hybrid Zoom using Camera Fusion on Mobile Phones
Incremental FastPitch: Chunk-based High Quality Text to Speech
CoMoSVC: Consistency Model-based Singing Voice Conversion
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
aMUSEd: An Open MUSE Reproduction
Image Sculpting: Precise Object Editing with 3D Geometry Control
A Vision Check-up for Language Models
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
GPT-4V(ision) is a Generalist Web Agent, if Grounded
https://huggingface.co/papers
Efficient Hybrid Zoom using Camera Fusion on Mobile Phones
Incremental FastPitch: Chunk-based High Quality Text to Speech
CoMoSVC: Consistency Model-based Singing Voice Conversion
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
aMUSEd: An Open MUSE Reproduction
Image Sculpting: Precise Object Editing with 3D Geometry Control
A Vision Check-up for Language Models
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
GPT-4V(ision) is a Generalist Web Agent, if Grounded