Chen Cui
cuichenx
AI & ML interests
None yet
Recent Activity
updated
a model
5 days ago
deepseek-ai/DeepSeek-V3
new activity
5 days ago
deepseek-ai/DeepSeek-V3:`aux_loss_alpha` should be 1e-4 instead of 1e-3?
updated
a model
5 days ago
deepseek-ai/DeepSeek-V3-Base
Organizations
cuichenx's activity
`aux_loss_alpha` should be 1e-4 instead of 1e-3?
#61 opened 5 days ago
by
cuichenx
`aux_loss_alpha` should be 1e-4 instead of 1e-3?
#60 opened 5 days ago
by
cuichenx