Papers
arxiv:2307.06962

Copy Is All You Need

Published on Jul 13, 2023
ยท Submitted by akhaliq on Jul 16, 2023
#1 Paper of the day
Authors:
,

Abstract

The dominant text generation models compose the output by sequentially selecting words from a fixed vocabulary. In this paper, we formulate text generation as progressively copying text segments (e.g., words or phrases) from an existing text collection. We compute the contextualized representations of meaningful text segments and index them using efficient vector search toolkits. The task of text generation is then decomposed into a series of copy-and-paste operations: at each time step, we seek suitable text spans from the text collection rather than selecting from a standalone vocabulary. Experiments on the standard language modeling benchmark (WikiText-103) show that our approach achieves better generation quality according to both automatic and human evaluations. Besides, its inference efficiency is comparable to token-level autoregressive models thanks to the reduction of decoding steps. We also show that our approach allows for effective domain adaptation by simply switching to domain-specific text collection without extra training. Finally, we observe that our approach attains additional performance gains by simply scaling up to larger text collections, again without further training.Our source codes are publicly available at \url{https://github.com/gmftbyGMFTBY/Copyisallyouneed.}

Community

ๅพˆๅฎž็”จ็š„ๆŠ€ๆœฏ๏ผŒๅฐคๅ…ถๆ˜ฏๅœจ็‰นๅฎš้ข†ๅŸŸ้œ€่ฆ่พ“ๅ‡บๅ—้™็š„ๆ–‡ๆœฌๅ†…ๅฎนๅœบๆ™ฏไธ‹๏ผŒไพ‹ๅฆ‚ๆณ•ๅพ‹ๆกๆ–‡ใ€ๅŒปๅญฆๆœฏ่ฏญ็ญ‰็ญ‰ใ€‚ไธ่ฟ‡ไธบไป€ไนˆๆˆ‘ไผšๆƒณ่ตทๆ›พ็ป็š„โ€œ่ƒŒ่ฏตๆฏ›่ฏญๅฝ•โ€ๅŽ†ๅฒๆ—ถๆœŸๅ‘ข[doge]

@librarian-bot recommend

ยท

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Revolutionizing Text Generation: The Power of Copy-Generator (CoG)

Links ๐Ÿ”—:

๐Ÿ‘‰ Subscribe: https://www.youtube.com/@Arxflix
๐Ÿ‘‰ Twitter: https://x.com/arxflix
๐Ÿ‘‰ LMNT (Partner): https://lmnt.com/

By Arxflix
9t4iCUHx_400x400-1.jpg

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2307.06962 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2307.06962 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2307.06962 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.