mengfanxu
fxmeng
AI & ML interests
None yet
Recent Activity
commented on
a paper
about 2 months ago
TransMLA: Multi-head Latent Attention Is All You Need
commented on
a paper
about 2 months ago
TransMLA: Multi-head Latent Attention Is All You Need
updated
a collection
about 2 months ago
CLOVER-Commonsense-148k
Organizations
None yet
Collections
8
models
55
fxmeng/PiSSA-llama-7b-commonsense-148k
Updated
•
11
fxmeng/PiSSA-Llama-3-8b-commonsense-148k
Updated
•
10
fxmeng/PiSSA-Llama-2-7b-commonsense-148k
Updated
•
11
fxmeng/PiSSA-llama-13b-commonsense-148k
Updated
•
13
fxmeng/CLOVER-llama-3-8b-commonsense-148k
Updated
•
9
fxmeng/CLOVER-llama-2-7b-commonsense-148k
Updated
•
8
fxmeng/CLOVER-llama-13b-commonsense-148k
Updated
•
8
fxmeng/CLOVER-llama-7b-commonsense-148k
Updated
•
7
fxmeng/TransMLA_qwen2.5_0.5b_instruct
Updated
fxmeng/TransMLA_llama3.2_1b_instruct
Updated
datasets
9
fxmeng/pissa-dataset
Viewer
•
Updated
•
844k
•
1.36k
•
3
fxmeng/big-bench-hard-continue-finetuning
Viewer
•
Updated
•
10.3k
•
117
fxmeng/commonsense_filtered
Viewer
•
Updated
•
170k
•
397
•
1
fxmeng/MetaMath-GSM240K
Viewer
•
Updated
•
240k
•
67
•
1
fxmeng/MetaMath-MATH155K
Viewer
•
Updated
•
155k
•
42
fxmeng/CodeFeedback-Python105K
Viewer
•
Updated
•
105k
•
374
•
6
fxmeng/llava_finetune_336x336
Preview
•
Updated
•
37
fxmeng/llava_pretrain_336x336
Preview
•
Updated
•
30
fxmeng/WizardLM_evol_instruct_V2_143k
Viewer
•
Updated
•
143k
•
55
•
2