-
Jamba: A Hybrid Transformer-Mamba Language Model
Paper • 2403.19887 • Published • 109 -
sDPO: Don't Use Your Data All at Once
Paper • 2403.19270 • Published • 41 -
ViTAR: Vision Transformer with Any Resolution
Paper • 2403.18361 • Published • 55 -
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Paper • 2403.18814 • Published • 47
Phuong Pham
mp1704
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Pensez: Less Data, Better Reasoning -- Rethinking French LLM
liked
a model
3 days ago
sesame/csm-1b
liked
a dataset
4 days ago
glaiveai/reasoning-v1-20m
Organizations
Collections
1
models
15
mp1704/tora_7b_sft_ckpt_200
Text Generation
•
Updated
•
6
mp1704/tora_7b_pt
Text Generation
•
Updated
•
5
mp1704/gpt-neo-sft-v2.1
Text Generation
•
Updated
•
11
mp1704/gpt-neo-sft-v2
Text Generation
•
Updated
•
7
mp1704/gpt-neo-sft
Text Generation
•
Updated
•
5
mp1704/gpt-neo-pt
Text Generation
•
Updated
•
6
mp1704/gemma_2b_sft
Text Generation
•
Updated
•
6
mp1704/gemma_2b_pt
Text Generation
•
Updated
•
5
mp1704/qwen_1.8b_sft_full_3
Text Generation
•
Updated
•
15
mp1704/qwen_1.8b_sft_full_2
Feature Extraction
•
Updated
•
6