Understanding the RL part of the source code
#12
by
smathieu
- opened
Hello!
I'm reading the source code with the paper and I do not see the part that implements the C-RLFT into the loss optimization. Sorry if it's obvious, I'm a beginner in reinforcement learning!