How to choose token when predict?

#5
by cungnlp - opened

Someone can tell me, when Predict should get the second token in the output? As far as I know, Transformer will give Output the same size as input (Block_size). But with Qwen2-Audio, I think half of the token ahead represents Audio? We hope to receive the help of everyone. Thank you very much
image.png

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment