Distillation-Guided Policy Optimization for Preserving Agentic RAG Capabilities
AI & ML interests
AI, ML, CV, NLP, Robotics
Recent Activity
View all activity
models 11
omron-sinicx/DGPO-qwen2.5-0.5b
Text Generation • 0.6B • Updated
omron-sinicx/SearchR1-ppo-qwen2.5-3b-instruct
3B • Updated • 58
omron-sinicx/Qwen2.5-0.5B-Instruct-sft
0.5B • Updated • 33
omron-sinicx/Qwen2.5-0.5B-Instruct-kd
0.5B • Updated • 191
omron-sinicx/SearchR1-ppo-qwen2.5-7b-instruct
8B • Updated
omron-sinicx/SearchR1-ppo-llama3.1-8b-instruct
8B • Updated
omron-sinicx/Llama-3.2-1B-Instruct-kd
1B • Updated
omron-sinicx/sbsfigures-chartqa-pix2struct
Updated • 1
omron-sinicx/sbsfigures-chartqa-donut
Updated • 1
omron-sinicx/sbsfigures-pretrain-donut
Updated
datasets 12
omron-sinicx/scipostlayouttree
Updated • 22
omron-sinicx/MaterialFigBench
Viewer • Updated • 115 • 55
omron-sinicx/MaterialBENCH
Updated • 26
omron-sinicx/AlignBench
Viewer • Updated • 102k • 1.96k • 1
omron-sinicx/sbsfigures
Viewer • Updated • 4.17M • 562 • 4
omron-sinicx/scipostgen
Preview • Updated • 8
omron-sinicx/PMC-Vid
Preview • Updated • 20
omron-sinicx/synthetic_language
Preview • Updated • 7
omron-sinicx/wiki2023_plus
Viewer • Updated • 7.4k • 77 • 1
omron-sinicx/scipostlayout_v2
Preview • Updated • 20 • 7