What are these models?
#1
by
Nap
- opened
Any idea what these LoRAs are for?
Improving a certain indicator through reinforcement learning
can you elaborate on the type of indicator, as well as method of reinforcement? Thank you for your research and work :)
Improving a certain indicator through reinforcement learning
一看代码风格,等号怎么都是对齐的??!再一看issue,活捉b导😁