4

I read about the different model series in GPT3.5 here - https://platform.openai.com/docs/models/gpt-3-5

At the beginning of the page, it mentions to look at https://platform.openai.com/docs/model-index-for-researchers to understand the difference between model series InstructGPT and GPT3.5.

But, on that page, it says InstructGPT is a part of the GPT3.5 series. So, what's the difference between GPT3.5 and InstructGPT?

nbro
  • 42,615
  • 12
  • 119
  • 217
Arya
  • 41
  • 2

1 Answers1

2

InstructGPTs are fine tuned GPT3.5 models. They have been tuned with different techniques to be good at following instructions and chatting. Examples of fine-tuning techniques are 1. Supervised fine-tuning on human demonstrations or 2. RLHF using PPO to maximise a reward model trained from comparisons by humans.

Rexcirus
  • 1,309
  • 9
  • 22