Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2402.01613

Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 1
NEFTune: Noisy Embeddings Improve Instruction Finetuning

Paper • 2310.05914 • Published Oct 9, 2023 • 14
EELBERT: Tiny Models through Dynamic Embeddings

Paper • 2310.20144 • Published Oct 31, 2023 • 3
Dynamic Word Embeddings for Evolving Semantic Discovery

Paper • 1703.00607 • Published Mar 2, 2017 • 1

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Paper • 2310.05737 • Published Oct 9, 2023 • 4
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models

Paper • 2308.16692 • Published Aug 31, 2023 • 1
Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 1
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings

Paper • 2305.11554 • Published May 19, 2023 • 2

eg: Text-to-images for mac

Nomic Embed: Training a Reproducible Long Context Text Embedder

Paper • 2402.01613 • Published Feb 2, 2024 • 14

long context embedding

Nomic Embed: Training a Reproducible Long Context Text Embedder

Paper • 2402.01613 • Published Feb 2, 2024 • 14

Representation Learning

Text and Code Embeddings by Contrastive Pre-Training

Paper • 2201.10005 • Published Jan 24, 2022
Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 1
Nomic Embed: Training a Reproducible Long Context Text Embedder

Paper • 2402.01613 • Published Feb 2, 2024 • 14
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training

Paper • 2405.06932 • Published May 11, 2024 • 16

Open Source Long Context Text Embedders

nomic-ai/nomic-embed-text-v1

Sentence Similarity • Updated Sep 26, 2024 • 288k • 478
nomic-ai/nomic-embed-text-v1.5

Sentence Similarity • Updated Nov 18, 2024 • 1.32M • 477
nomic-ai/nomic-embed-text-v1-unsupervised

Sentence Similarity • Updated Aug 2, 2024 • 574 • 13
nomic-ai/nomic-embed-text-v1-ablated

Sentence Similarity • Updated Aug 2, 2024 • 673 • 4

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 145
ReFT: Reasoning with Reinforced Fine-Tuning

Paper • 2401.08967 • Published Jan 17, 2024 • 29
Tuning Language Models by Proxy

Paper • 2401.08565 • Published Jan 16, 2024 • 21
TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 66

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4, 2024 • 90
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 45
SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26, 2024 • 69
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29, 2024 • 48

facebook/seamless-m4t-v2-large

Automatic Speech Recognition • Updated Jan 4, 2024 • 36.7k • 720
openai/whisper-large-v3

Automatic Speech Recognition • Updated Aug 12, 2024 • 4.16M • • 3.9k
Nomic Embed: Training a Reproducible Long Context Text Embedder

Paper • 2402.01613 • Published Feb 2, 2024 • 14

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs