WalidBouss/proj_only_llava_deepseek_r1_distill_llama3_8b_reasoning_visual_cot_1000_samples_10_epochs Updated 9 days ago
WalidBouss/proj_only_llava_deepseek_r1_distill_llama3_8b_reasoning_visual_cot_1000_samples_10_epochs Updated 9 days ago
WalidBouss/proj_only_llava_deepseek_r1_distill_llama3_8b_reasoning_visual_cot_1000_samples Updated 9 days ago • 5
WalidBouss/proj_only_llava_deepseek_r1_distill_llama3_8b_reasoning_visual_cot_10000_samples Updated 9 days ago • 4
WalidBouss/proj_only_llava_deepseek_r1_distill_llama3_8b_reasoning_visual_cot_10000_samples Updated 9 days ago • 4
WalidBouss/proj_only_llava_deepseek_r1_distill_llama3_8b_reasoning_visual_cot_1000_samples Updated 9 days ago • 5
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 105 items • Updated 23 days ago • 97
view article Article Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages May 24, 2024 • 25
LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity Paper • 2404.03214 • Published Apr 4, 2024 • 2
LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity Paper • 2404.03214 • Published Apr 4, 2024 • 2