File size: 1,033 Bytes
4912bd3
 
22cef4c
4912bd3
 
22cef4c
 
 
4912bd3
 
22cef4c
 
4912bd3
 
c9d2023
4912bd3
c9d2023
d69db16
c9d2023
4912bd3
c9d2023
4912bd3
 
 
 
c9d2023
4912bd3
c9d2023
d69db16
 
4912bd3
c9d2023
d69db16
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
---
language:
- en
thumbnail: null
tags:
- image-classification
- multimodal
- tensorflow
license: mit
datasets:
- flex_civ_vi_screens
pipeline_tag: audio-text-to-text
library_name: tensorflow
---

# Multimodal Classification Model (BM-v1)

This model combines text and image inputs to predict player moves from in-game screenshots for the popular 4X Civilization VI. In use, screenshot inputs are provided and text inputs generated using an LLM.

## Model Details

- **Developed by:** BeakerStreet
- **Model type:** Multimodal Classification Model
- **Language(s):** English
- **License:** MIT

## Uses

Predicts the likely moves a player will make from a complete sample space of all (observed) player moves, based on a provided screenshot and associated text. Can be fine-tuned to specifically predict types of move (scouting, build orders, settle/doesn't settle)

### Direct Use

Predicts the likely moves a player will make, from a complete sample space of all player moves, based on a provided screenshot and associated text.