How should I design a reward function for a NLP problem where two models interoperate?

Asked Apr 16 '20 at 12:29

Active Apr 16 '20 at 14:12

Viewed 103 times

I would like to design a reward function. I am training two models from the first model that classify set of texts (paragraphs and keywords) and I also got some hidden states. The second model is trying to generate keywords for those paragraphs.

I want to use those hidden states from the first model to give rewards for key phrases that are generated from the second model. I want to know how can I implement this reward function since I have never used it before.

edited Apr 16 '20 at 14:12

nbro

42,615
12
119
217

asked Apr 16 '20 at 12:29

No Na

How should I design a reward function for a NLP problem where two models interoperate?

0 Answers0