I've been researching different frameworks for hierarchical RL (mainly options, HAMs, and MAXQ) and noticed that both options and HAMs have names that relate to how they function. I can't seem to find anything stating how MAXQ got its name and I was wondering if anyone knew what the name was referencing.
Asked
Active
Viewed 53 times