[PDF] Top 20 Meta-level reasoning in reinforcement learning

Meta-level reasoning in reinforcement learning

... Reinforcement learning is a technique often used to gener- ate an optimal (or near-optimal) agent in a stochastic environ- ment in the absence of knowledge about the reward function of this ... See full document

61

The association between cognition and academic performance in Ugandan children surviving malaria with neurological involvement.

... performance. In the first univariate regression, each of the three academic performance outcomes (reading, spelling, and arithmetic) was regressed on each of the five cognitive predictors (working memory, ... See full document

7

Moral reasoning in knowledge authoring: An e-learning 4.0 analysis!

... that in order to accomplish a responsible utilization of learning technology, participants need a considerable degree of education concerning social and ethical ...update in order to respond promptly ... See full document

34

A topological reinforcement learning agent for navigation

... a reinforcement learning procedure for mobile robot navigation using a latent- like learning ...Latent learning refers to learning that occurs in the absence of ... See full document

17

On the Approximate Period Problem

... of learning of motor primitives is well ...primitives. In particular, robots need to generalize motor primitives to a different behavior by trial and error without re-learning the ...task. In ... See full document

9

Reasoning in Reference Games: Individual- vs. Population-Level Probabilistic Modeling.

... Of the 42 filler trials, 24 used the displays from the implicature conditions but the target was a) the competitor from the simple condition (six trials), b) the distractor from the simple condition (six trials), or c) ... See full document

25

Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.

... of reinforcement learning, called aspiration learning, phenomenologically behave as conditional ...aspiration level. They rein- force actions that have resulted in satisfactory outcomes and ... See full document

13

Adaptive value-at-risk policy optimization: a deep reinforcement learning approach for minimizing the capital charge

... model-free reinforcement learning framework, is comprised of three components, the state and action spaces, and the reward ...cost, in the shape of the applicable multiplier, a direct function of the ... See full document

82

Reasoning, learning, and creativity: frontal lobe function and human decision-making.

... Consistently, in both experiments, exploring participants behaved without retrieving previously learned stimulus-response ...faster in control than transfer episodes (Figure 7), indicating that unlike the ... See full document

16

Cholinergic pairing with visual activation results in long-term enhancement of visual evoked potentials.

... factor in experience-dependent plasticity allowing cholinergic enhanced stimuli to take over stimuli not associated with cholinergic reinforcement and modifying both cortical processing and representation ... See full document

8

Influence zones: a strategy to enhance reinforcement learning

... a level of signiﬁcance which algorithms are similar, and which ones have the smallest trajectories, a multiple comparison test is ...necessary. In this case, the Tukey’s signiﬁcant-difference (TSD) test ... See full document

14

Reinforcement learning using a continuous time actor-critic framework with spiking neurons.

... agents learning to avoid the immediate hazard of running into the edges of the ...slower learning, as the agents learn to swing and control the pole better and ...long learning process we resorted to ... See full document

21

Robustness analysis of deteriorating reinforced concrete slabs

... as reinforcement area reduction, concrete cracking, deterioration due to expansion around reinforcement bars and bond strength deterioration between reinforcement and ...and reinforcement ... See full document

8

An Advancement To The Security Level Through Galois Field In The Existing Password Based Technique Of Hiding Classified Information In Images

... M, in different positions as shown in ...saved in position 1, else it is saved in position ...bit in a specific order. For example, the first 10 bits are hidden in position 1 or ... See full document

6

An Analysis Of The Difference In Gender Level Of Cassava Production And Access To Land In Abia State Nigeria

... households in comparison with other ...opportunities in the rural ...face in accessing them are rarely fully ...farmers in terms of asset holding, welfare and credit ...production in ... See full document

5

Learning frequent behaviours patterns in intelligent environments for attentiveness level

... located in the scalp catch the brain waves and with the data acquired, it is possible to analyse the brain activity during a ...task. In many studies, the most important component is MMN (mismatch ... See full document

76

ACO in e-Learning: Towards an adaptive learning path

... traditional learning systems follows “one size fits all” approach ...adaptive learning provides an alternative to the traditional approach, where learning objects can be provided dynamically as per ... See full document

5

Liderança Digital: a aprendizagem e os processos de informação e comunicação na sala de aula

... No contexto dos 4 pilares de educação (Delors, 1996) e das 7 competências para o século XXI (Morin, 1999), a escola necessita de educadores que se adaptam, que comunicam, que questionam e que se colocam a si próprios ... See full document

21

Using Recommendation System for E-learning Environments at degree level

... Our investigation shows the problem of the information overload is also present in distance educational environments. The obtained results show most f the users are not willing or can‘t do all of the practises the ... See full document

4

Incontinentia pigmenti: learning disabilities are a fundamental hallmark of the disease.

... heterogeneity in the cognitive phenotype observed in our patient cohort could be a consequence of the IKBKG/NEMO mutation that might produce different phenotypic outcomes also in mental ...of ... See full document

7