참고
[1] Hong et al., “Structure-Aware Transformer Policy for Inhomogeneous Multi-Task Reinforcement Learning”, ICLR, 2022
[2] Wang et al., “NerveNet: Learning Structured Policy with Graph Neural Networks”, ICLR, 2018
[3] Huang et al., “One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control”, ICML, 2020
[4] Li et al., “Deeper Insights into Graph Convolutional Networks for Semi-Supervised Learning”, AAAI, 2018
[5] Kurin et al., “My Body is a Cage: The Role of Morphology in Graph-based Incompatible Control”, ICLR, 2021
[6] Haarnoja et al., “Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor”, ICML, 2018
[7] https://wenlong.page/modular-rl/