|
# See and Think: Embodied Agent in Virtual Environment |
|
|
|
Zhonghan Zhao<sup>1\*</sup> , Wenhao Chai<sup>\*2❤</sup>, Xuan Wang<sup>1\*</sup>, Li Boyi<sup>1</sup>, Shengyu Hao<sup>1</sup>, Shidong Cao<sup>1</sup>, Tian Ye<sup>3</sup>, Jenq-Neng Hwang<sup>2</sup>, Gaoang Wang<sup>1✉</sup> |
|
|
|
<sup>1</sup> Zhejiang University <sup>2</sup> University of Washington <sup>3</sup> Hong Kong University of Science and Technology (GZ) |
|
|
|
<sup>*</sup>Equal contribution <sup>❤</sup>Project lead <sup>✉</sup>Corresponding author |
|
|
|
|
|
|
|
![STEVE, named after the protagonist of the game Minecraft, is our proposed framework aims to build an embodied agent based on the vision model and LLMs within an open world.](https://rese1f.github.io/STEVE/static/images/teaser.png) |
|
|
|
STEVE, named after the protagonist of the game Minecraft, is our proposed framework aims to build an embodied agent based on the vision model and LLMs within an open world. |
|
|
|
Link: [See and Think: Embodied Agent in Virtual Environment](https://rese1f.github.io/STEVE/) |