Ken Tsui PRO

kenhktsui

AI & ML interests

ML Engineer Lead. Researcher on Small Language Model - Building Classifiers to Find High Quality Data/ Reasoning Benchmark/ Synthetic Data

Recent Activity

liked a model 5 days ago

Qwen/Qwen2.5-Math-PRM-7B

published an article 6 days ago

Embodied AI == Unlimited Training Data

upvoted a paper 10 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

View all activity

Articles

Embodied AI == Unlimited Training Data

6 days ago

• 2

∞🧙🏼‍♂️AnyClassifier - Generating Synthetic Data For Text Classification

Aug 19, 2024

• 8

Low Latency CPU Based Educational Value Classifier With Generic Educational Value

Jun 12, 2024

• 9

Organizations

kenhktsui's activity

liked a model 5 days ago

Qwen/Qwen2.5-Math-PRM-7B

Text Classification • Updated 2 days ago • 2.25k • 41

published an article 6 days ago

Article

Embodied AI == Unlimited Training Data

•

6 days ago

• 2

upvoted a paper 10 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 11 days ago • 232

liked a model 11 days ago

microsoft/phi-4

Text Generation • Updated 11 days ago • 124k • 1.44k

liked a model 19 days ago

Qwen/Qwen2.5-Math-7B-Instruct

Text Generation • Updated Sep 23, 2024 • 31.5k • 45

liked a dataset 19 days ago

tasksource/PRM800K

Preview • Updated May 31, 2023 • 113 • 22

updated a collection 20 days ago

LongTalk

Collection

A Very Long Chain-of-Thought Dataset for Reasoning Model Post-Training • 5 items • Updated 20 days ago

updated a model 20 days ago

kenhktsui/llama3.1-8b-instruct-thinking-sft-merged-gguf

Updated 20 days ago • 41 • 1

liked a model 20 days ago

kenhktsui/qwen2.5-7b-instruct-thinking-sft-merged-gguf

Updated 20 days ago • 91 • 1

updated a model 20 days ago

kenhktsui/llama3.1-8b-instruct-thinking-sft-merged

Text Generation • Updated 20 days ago • 30

liked 3 datasets 20 days ago

updated a collection 20 days ago

LongTalk

Collection

A Very Long Chain-of-Thought Dataset for Reasoning Model Post-Training • 5 items • Updated 20 days ago

updated 2 models 20 days ago

kenhktsui/qwen2.5-7b-instruct-thinking-sft-merged

Text Generation • Updated 20 days ago • 35

kenhktsui/qwen2.5-7b-instruct-thinking-sft-merged-gguf

Updated 20 days ago • 91 • 1

updated a dataset 20 days ago

kenhktsui/longtalk-cot-v0.1

Viewer • Updated 20 days ago • 61.2k • 175 • 11