tyzhu (Tongyao)

Collections 1

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16, 2024 • 40
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse

Paper • 2409.11242 • Published Sep 17, 2024 • 5
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models

Paper • 2409.11136 • Published Sep 17, 2024 • 21
On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 12

models 211

tyzhu/tiny_LLaMA_1b_4k_intramask_eng_thai_mixed_4k_iter-160000-ckpt-step-20000_hf

Updated Sep 11, 2024 • 2

datasets 810

tyzhu/id_cc_pool

Viewer • Updated 12 days ago • 72.5M

tyzhu/proweb

Viewer • Updated 12 days ago • 46.3M • 146

tyzhu/cmmlu_filtered

Updated Oct 7, 2024 • 31

tyzhu/lmind_nq_train6000_eval6489_v1_docidx_v3

Viewer • Updated Jun 4, 2024 • 76.7k • 32

tyzhu/flan_max_300_added

Viewer • Updated Apr 3, 2024 • 1.46M • 34

tyzhu/lmind_nq_train6000_eval6489_v1_doc_qa_v3

Viewer • Updated Mar 31, 2024 • 82.7k • 31

tyzhu/lmind_nq_train6000_eval6489_v1_recite_qa_v3

Viewer • Updated Mar 31, 2024 • 82.7k • 36

tyzhu/lmind_nq_train6000_eval6489_v1_reciteonly_qa_v3

Viewer • Updated Mar 31, 2024 • 71.8k • 29

tyzhu/lmind_nq_train6000_eval6489_v1_reciteonly_qa_v3_v3

Viewer • Updated Mar 31, 2024 • 71.8k • 28

tyzhu/lmind_nq_train6000_eval6489_v1_reciteonly_qa_v2

Viewer • Updated Mar 31, 2024 • 71.8k • 39

Tongyao PRO

AI & ML interests

Organizations

Collections 1

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models

On the Diagram of Thought

models 211

tyzhu/llama3.2_3b_8k_intramask_cc_8k_iter-400000-ckpt-step-100000_hf

tyzhu/tiny_LLaMA_1b_16k_intramask_cc_16k_iter-480000-ckpt-step-60000_hf

tyzhu/tiny_LLaMA_1b_16k_intramask_cc_16k_iter-320000-ckpt-step-40000_hf

tyzhu/tiny_LLaMA_3b_8k_dm8_cc_8k

tyzhu/tiny_LLaMA_3b_8k_cc_merged_v2_8k

tyzhu/tiny_LLaMA_3b_2k_cc_merged_v2_2k

tyzhu/tiny_LLaMA_3b_2k_intramask_cc_2k

tyzhu/tiny_LLaMA_3b_8k_cc_8k

tyzhu/tiny_LLaMA_3b_2k_cc_2k

tyzhu/tiny_LLaMA_1b_4k_intramask_eng_thai_mixed_4k_iter-160000-ckpt-step-20000_hf

datasets 810

tyzhu/id_cc_pool

tyzhu/proweb

tyzhu/cmmlu_filtered

tyzhu/lmind_nq_train6000_eval6489_v1_docidx_v3

tyzhu/flan_max_300_added

tyzhu/lmind_nq_train6000_eval6489_v1_doc_qa_v3

tyzhu/lmind_nq_train6000_eval6489_v1_recite_qa_v3

tyzhu/lmind_nq_train6000_eval6489_v1_reciteonly_qa_v3

tyzhu/lmind_nq_train6000_eval6489_v1_reciteonly_qa_v3_v3

tyzhu/lmind_nq_train6000_eval6489_v1_reciteonly_qa_v2

Tongyao PRO

AI & ML interests

Organizations

Collections 1

models 211 Sort: Recently updated

datasets 810 Sort: Recently updated

models 211

datasets 810