Fleer, Sascha: Scaffolding for learning from reinforcement: Improving interaction learning. 2020

Inhalt

1.1 Outline

I The toolbox

2 Reinforcement learning — a paradigm of human inspired artificial intelligence

2.4.1 Approximating the state-action space for a set of discrete actions

2.5 Policy gradient methods

2.5.1 The REINFORCE algorithm

2.6 Going deeper with neural networks

2.7 Summary

3 The guiding principle of scaffolding

3.1 The concept of scaffolding in educational psychology

3.1.1 Learning how to ride a bicycle: an example for scaffolded learning

3.2 Teaching devices: employing computer-based tools for scaffolding the learning process of humans

3.2.1 Scaffolding the learning of a foreign language with the help of a computer-based tool

3.3 Scaffolding artificial agents by organizing learning on a meta-level

3.4 Reformulating scaffolding as a principle for guiding the learning process of machines

3.4.1 Scaffolding in practice: inject meta-knowledge by compiling individual auxiliaries

3.5 A research map for scaffolding in machine learning

3.5.1 Four research questions for scaffolding an artificial agent

3.6 Summary

II Scaffolding: a universal approach for fostering the learning process

4 Scaffolding attention control by exploiting ``perceptive acting''

4.1 The concept of entropy and mutual information in the context of reinforcement learning

4.1.1 Exploiting mutual information as a ranking criteria for action sets

4.2 Applying the concept to complex environments

4.3 Summary

5 Scaffolding attention control by exploiting ``active visual perception''

5.1 The recurrent attention asynchronous advantage actor-critic model

5.1.1 Training

5.2 Summary

6 Scaffolding the learning of efficient haptic exploration using ``active haptic perception''

6.2.1 Training

6.3 Summary

7 Scaffolding the agent's internal representation through skill transfer

7.1 The combination of a structured curriculum with transfer learning — 4 strategies of skill transfer

7.2 Summary

III Facilitating the learning process of interaction problems: testing the proposed scaffolding approaches

8 A learning domain for mediated interaction

8.6 Summary

9 A first scaffold for learning the ``Extension-of-Reach Scenario'': determining the best action set

10 A second scaffold for learning the ``Extension-of-Reach Scenario'': structuring the learning process

10.1 Experiments

11 Scaffolding the learning process through ``active visual perception'': an attention based approach

12 A scaffold for enabling ``active haptic perception'': learning efficient haptic exploration

12.1 Designing the simulation world

IV Conclusion

13 Summary, conclusion & outlook

B.1 Linear Q-learning