Costa Pissaris

somtimz

AI & ML interests

None yet

Recent Activity

liked a model 16 days ago

Qwen/Qwen2.5-Coder-32B-Instruct

liked a Space 17 days ago

Qwen/Qwen2.5

View all activity

Organizations

somtimz's activity

upvoted an article 3 months ago

Article

Let's talk about LLM evaluation

•

May 23, 2024

• 144

upvoted a paper 6 months ago

RAG Does Not Work for Enterprises

Paper • 2406.04369 • Published May 31, 2024 • 1

upvoted a paper 7 months ago

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Paper • 2406.04271 • Published Jun 6, 2024 • 29

upvoted a collection 12 months ago

Preference Datasets for DPO

Collection

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated 28 days ago • 38

upvoted a paper 12 months ago

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 158

upvoted 4 papers about 1 year ago

WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 49

upvoted a collection about 1 year ago

Nemotron 3 8B

Collection

The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated 1 day ago • 48

upvoted 3 papers about 1 year ago

Eureka: Human-Level Reward Design via Coding Large Language Models

Paper • 2310.12931 • Published Oct 19, 2023 • 26

In-Context Pretraining: Language Modeling Beyond Document Boundaries

Paper • 2310.10638 • Published Oct 16, 2023 • 29

Table-GPT: Table-tuned GPT for Diverse Table Tasks

Paper • 2310.09263 • Published Oct 13, 2023 • 39

upvoted 4 papers over 1 year ago

Large Language Models as Analogical Reasoners

Paper • 2310.01714 • Published Oct 3, 2023 • 15

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 88

Towards Generalist Biomedical AI

Paper • 2307.14334 • Published Jul 26, 2023 • 12

A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis

Paper • 2307.12856 • Published Jul 24, 2023 • 35