Interleaved text and images at inference time

#62

by pbarker - opened Dec 9, 2024

Dec 9, 2024

The training scripts make it very clear how to train on interleaved images and text by adding the <image> token. However its not clear how to do this at inference time.

pbarker changed discussion status to closed Dec 11, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment