daiwei chen
daiweichen
AI & ML interests
representation learning, foundation models, preference learning
Recent Activity
liked
a dataset
about 1 month ago
HannahRoseKirk/prism-alignment
liked
a model
about 1 month ago
google/gemma-2-2b-it
new activity
2 months ago
meta-llama/Llama-3.2-1B:Attention doesn't work for all layers except for the first layer
Organizations
daiweichen's activity
Attention doesn't work for all layers except for the first layer
#79 opened 2 months ago
by
daiweichen