Unchun Yang's picture

Unchun Yang

ucyang

·

https://ucyang.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

liked a dataset 2 days ago

cognitivecomputations/WizardLM_alpaca_evol_instruct_70k_unfiltered

upvoted a paper 3 days ago

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

View all activity

Organizations

ucyang's activity

upvoted a paper about 20 hours ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 75

upvoted a paper 3 days ago

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Paper • 2309.12284 • Published Sep 21, 2023 • 19

upvoted a paper 4 days ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 42

upvoted a collection 7 days ago

Skywork-o1-Open

Skywork o1 open model collections • 3 items • Updated Nov 27, 2024 • 18

upvoted a paper 8 days ago

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 37

upvoted a collection 8 days ago

DeepSeek-Prover

DeepSeek-V1-and-V1.5-Series • 7 items • Updated Aug 16, 2024 • 20

upvoted a paper 9 days ago

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published 21 days ago • 11

upvoted 5 collections 9 days ago

DeepSeek-VL2

4 items • Updated 17 days ago • 34

DeepSeek-V3

2 items • Updated 9 days ago • 91

Gukbap-Series-LLM

General Korean LLM • 4 items • Updated Oct 25, 2024 • 2

Granite 3.1 Language Models

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 17 days ago • 45

AI PC: Text Generation

Text generation LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. • 186 items • Updated Aug 28, 2024 • 4

upvoted 2 collections 10 days ago

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 10 items • Updated 24 days ago • 50

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated 3 days ago • 33

upvoted a paper 13 days ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 12

upvoted an article 13 days ago

Article

Selective fine-tuning of Language Models with Spectrum

By

•

Sep 3, 2024

• 30

upvoted a paper 13 days ago

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31, 2024 • 12

upvoted a paper 14 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 28 days ago • 123

upvoted 2 collections 14 days ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated 4 days ago • 78

Stable Diffusion 3.5

6 items • Updated Oct 29, 2024 • 118