What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Recent Activity
liked
a model
4 days ago
google/gemma-3-27b-it
liked
a model
5 days ago
Qwen/Qwen2.5-VL-7B-Instruct
authored
a paper
21 days ago
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via
Reinforcement Learning
Organizations
Collections
2
models
18

JackBAI/aitw-general-digiq-agent
Updated

JackBAI/aitw-webshop-digiq-agent
Updated

JackBAI/llava-v1.5-7b-sfted-pad-inputtext
Updated

JackBAI/CRATE-GPT-12L-Pile-600000steps
Updated

JackBAI/webshop-off2on-filteredbc
Updated
•
1

JackBAI/general-off2on-filteredbc
Updated

JackBAI/general-off2on-digirl
Updated
•
1

JackBAI/webshop-off2on-digirl
Updated
•
2

JackBAI/crate-3l-l0-sae-1x
Updated

JackBAI/crate-1l-l0-sae-1x
Updated
datasets
6
JackBAI/autoui-zeroshot-trajectories
Preview
•
Updated
•
96
JackBAI/pile_uncopyrighted_bin
Updated
•
6
JackBAI/bert_pretrain_datasets
Viewer
•
Updated
•
80.5M
•
1.58k
•
1
JackBAI/redbajama-sampled
Viewer
•
Updated
•
24.3M
•
5.09k
JackBAI/merged_roberta_dataset
Updated
•
35
JackBAI/chatgpt-woi-finetune
Preview
•
Updated
•
89
•
3