cl-reviewing

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

orionweller authored a paper 16 days ago

NevIR: Negation in Neural Information Retrieval

orionweller authored a paper 16 days ago

Learning from Task Descriptions

orionweller authored a paper 16 days ago

MegaWika: Millions of reports and their sources across 50 diverse languages

View all activity

cl-reviewing's activity

orionweller

authored 8 papers 16 days ago

CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation

Paper • 2406.17186 • Published Jun 24, 2024 • 1

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models

Paper • 2409.11136 • Published Sep 17, 2024 • 21

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 17 days ago • 116

yanaiela

authored a paper 2 months ago

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback

Paper • 2410.19133 • Published Oct 24, 2024 • 11

yanaiela

authored 5 papers 4 months ago

Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection

Paper • 2004.07667 • Published Apr 16, 2020

Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation

Paper • 2305.16938 • Published May 26, 2023

Text-based NP Enrichment

Paper • 2109.12085 • Published Sep 24, 2021

A Survey on Data Selection for Language Models

Paper • 2402.16827 • Published Feb 26, 2024 • 4

Lexical Generalization Improves with Larger Models and Longer Training

Paper • 2210.12673 • Published Oct 23, 2022

yanaiela

authored a paper 5 months ago

Data Contamination Report from the 2024 CONDA Shared Task

Paper • 2407.21530 • Published Jul 31, 2024 • 10

orionweller

authored a paper 9 months ago

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

Paper • 2403.15246 • Published Mar 22, 2024 • 9

yanaiela

authored 2 papers 11 months ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 61

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 82

yanaiela

authored 2 papers about 1 year ago

Paloma: A Benchmark for Evaluating Language Model Fit

Paper • 2312.10523 • Published Dec 16, 2023 • 12

What's In My Big Data?

Paper • 2310.20707 • Published Oct 31, 2023 • 10

AI & ML interests

Recent Activity

Team members 2

cl-reviewing's activity