-
PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Paper • 2410.05265 • Published • 30 -
MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents
Paper • 2410.03450 • Published • 36 -
MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Paper • 2410.08196 • Published • 46 -
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Paper • 2410.07303 • Published • 18
XXSg559
XXSg559
AI & ML interests
None yet
Recent Activity
upvoted
an
article
3 days ago
Our Transformers Code Agent beats the GAIA benchmark!
updated
a model
9 days ago
XXSg559/Qwen2.5-1.5B-Instruct-thinking-function_calling-V0
published
a model
9 days ago
XXSg559/Qwen2.5-1.5B-Instruct-thinking-function_calling-V0
Organizations
None yet
Collections
1
models
6

XXSg559/Qwen2.5-1.5B-Instruct-thinking-function_calling-V0
Updated

XXSg559/q-Taxi-v3
Reinforcement Learning
•
Updated

XXSg559/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated

XXSg559/ppo-Huggy
Reinforcement Learning
•
Updated
•
64

XXSg559/sft_output
Updated

XXSg559/SmolLM2-FT-MyDataset
Text Generation
•
Updated
•
11