Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:1706.03762

Baichuan 2: Open Large-scale Language Models

Paper • 2309.10305 • Published Sep 19, 2023 • 19
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 50

Research Papers

Research papers related to NLP.

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 50
Self-Attention with Relative Position Representations

Paper • 1803.02155 • Published Mar 6, 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 16
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding

Paper • 2401.12954 • Published Jan 23, 2024 • 29

Learning NLP & LLM in 2024 - Ajinkya Kolhe

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 50

Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 37
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 16
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 50
Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 243

LLM_architectures

Nemotron-4 15B Technical Report

Paper • 2402.16819 • Published Feb 26, 2024 • 43
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Paper • 2402.19427 • Published Feb 29, 2024 • 53
RWKV: Reinventing RNNs for the Transformer Era

Paper • 2305.13048 • Published May 22, 2023 • 15
Reformer: The Efficient Transformer

Paper • 2001.04451 • Published Jan 13, 2020

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 50

Most influential papers

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 50
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 16
Universal Language Model Fine-tuning for Text Classification

Paper • 1801.06146 • Published Jan 18, 2018 • 6
Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 12

This collection refers to the foundational papers in the area of NLP.

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 50
MetaGPT: Meta Programming for Multi-Agent Collaborative Framework

Paper • 2308.00352 • Published Aug 1, 2023 • 2
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 16
XLNet: Generalized Autoregressive Pretraining for Language Understanding

Paper • 1906.08237 • Published Jun 19, 2019

Sora参考论文

OpenAI "Video generation models as world simulators"技术报告后面的参考论文，总共32篇。OpenAI的ImageGPT和Dalle3这两篇缺失，链接已补充到note中。

Unsupervised Learning of Video Representations using LSTMs

Paper • 1502.04681 • Published Feb 16, 2015 • 1
Recurrent Environment Simulators

Paper • 1704.02254 • Published Apr 7, 2017 • 2
World Models

Paper • 1803.10122 • Published Mar 27, 2018 • 2
Generating Videos with Scene Dynamics

Paper • 1609.02612 • Published Sep 8, 2016 • 1

Sora Reference Papers

A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report • openai.com/sora

Unsupervised Learning of Video Representations using LSTMs

Paper • 1502.04681 • Published Feb 16, 2015 • 1
Recurrent Environment Simulators

Paper • 1704.02254 • Published Apr 7, 2017 • 2
World Models

Paper • 1803.10122 • Published Mar 27, 2018 • 2
Generating Videos with Scene Dynamics

Paper • 1609.02612 • Published Sep 8, 2016 • 1

Previous
1
2
3
4
5
6
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs