What are these models?

#1
by Nap - opened

Any idea what these LoRAs are for?

Alibaba-PAI org

Improving a certain indicator through reinforcement learning

can you elaborate on the type of indicator, as well as method of reinforcement? Thank you for your research and work :)

Alibaba-PAI org

@Nap @SynysterSocks Thanks for your interest! We've updated README.

Improving a certain indicator through reinforcement learning

一看代码风格,等号怎么都是对齐的??!再一看issue,活捉b导😁

Sign up or log in to comment