Learning That Is Spaced out over Time Engages the Brain Differently

What’s the science?

We use feedback from rewards every day to learn new things. For example, if we are offered a mango and we have enjoyed several mangos previously, over time we might learn to favor mangos. Research on the neural underpinnings of this form of reward-based learning typically focuses on short-term learning (across several minutes). However, we don’t know what happens when learning occurs over several weeks time, as it might in many everyday situations. Gradual learning from reward feedback relies on a dopaminergic system in the brain, but short-term learning paradigms in typical experiments may also rely on short-term (‘working’) memory systems. This week in the Journal of Neuroscience, Wimmer and colleagues used behaviour and functional magnetic resonance imaging (fMRI) to understand the mechanisms underlying reward learning over a period of several weeks versus within a single session in humans.

How did they do it?

The authors completed two similar studies for replication purposes. In the first study, 33 participants completed a behavioural and fMRI experiment, while in the second study, 31 participants completed a behaviour-only experiment. In both studies, the participants' task was to learn whether the best response to a stimulus (scenes presented on a screen) was either ‘Yes’ or ‘No’ (the wording was arbitrary). The stimuli had been randomly assigned by the experimenter as either reward-associated or loss-associated. Reward-associated stimuli resulted in the participant winning $0.35 for ‘Yes’ (on average) and losing $0.05 for ‘No’. Loss-associated stimuli resulted in the participant losing $0.25 when ‘Yes’ was selected and gaining $0.0 when ‘No’ was selected. Feedback was given after each trial, and feedback was probabilistic, meaning there was an 80% chance that the best response would result in the best outcome/payment. In an initial learning session in the lab, participants learned about 8 'spaced' stimuli. Next, three learning sessions for the ‘spaced’ stimuli were done online (over ~ two weeks). In a lab session about two weeks later, participants learned about 8 new 'massed' stimuli for the same number of times as the previously seen 'spaced' stimuli.

wimmerpic1800.png

After learning, participants also rated whether they thought the stimuli were reward-associated or loss-associated. In the first study, fMRI data were also collected after learning was completed. Finally, three weeks later, participants tried to remember and rate whether each stimulus was reward-associated or loss-associated. This was completed online in the first study and in the lab in the second study.

What did they find?

In both the ‘spaced’ and ‘massed’ conditions, participants learned the best 'Yes' or 'No' response quickly, and performance was equivalent at the end of learning for both the ‘massed’ and ‘spaced’ training sessions. However, three weeks after learning, participants remembered the value of the ‘spaced’ stimuli much better. Additionally, during learning, working memory capacity was associated with learning after participants got used to the task (in the ‘massed’ session). These results indicate that the spacing out of learning sessions results in better long-term memory for whether stimuli are reward-associated or loss-associated, and that working memory is used during shorter learning paradigms (the ‘massed’ paradigm). During fMRI, using a searchlight (multivariate pattern analysis) approach, the authors discovered that there were very different patterns of activity for reward-associated versus loss-associated stimuli for ‘spaced’ stimuli but not ‘massed’ stimuli within the medial temporal cortex and prefrontal cortex. Patterns of brain activity were also different between the ‘spaced’ and ‘massed’ conditions in the striatum, a region of the brain known to be involved in reward learning. These results suggest that the neural mechanisms underlying ‘spaced’ and ‘massed’ learning may be partially different.

What’s the impact?

This study is the first to clearly demonstrate the effect of learning over a period of several weeks versus minutes on the maintenance of learning and the neural underpinnings of learning. The results have implications for studies of disorders that may involve changes in reward learning, such as addiction and mood disorders.

wimmerquote1800.png

Wimmer et al., Reward learning over weeks versus minutes increases the neural representation of value in the human brain. Journal of Neuroscience (2018). Access the original scientific publication here.

The Behaviour and Neurobiology Underlying Leadership Decisions

What's the science?

Leadership is critical within our society. Leaders such as teachers, soldiers, politicians and parents, to name a few, are continuously responsible for making decisions that will affect others. One aspect of leadership is the acceptance of responsibility, as leaders are responsible for others in the choices they make. Despite the fact that leadership is central to human societies, we still don’t understand the neurobiology of leadership and why some choose to lead and others chose to follow. This week in Science, Edelson and colleagues use a decision-making task to examine leadership choices and the brain regions involved.

How did they do it?

They developed a behavioural task to examine leadership preferences. The participants performed 2 tasks: a baseline task and a delegation task. In all tasks, the participant’s payoff was dependent on their choices, and therefore assessed preferences related to risk, loss and ambiguity. In the baseline task, they were required to choose whether they would accept a gamble with varying probabilities of loss or gain over several trials. In some trials the exact probability of gains and losses were shown to the participant (to assess risk), while in some trials these probabilities were not shown (i.e. ambiguous and closer to real life decisions). In the delegation task, participants were required to decide on the same gambles, however, they could choose to lead and make a decision on behalf of their group, or defer and follow the choice of the group members. In this task there were two trials, ‘self’ and ‘group’ trials, where the choice of the leader would directly affect the payoff of him or herself or affect the payoff of the group members. The group members as a whole had more information about the probabilities of an outcome, which mimics a real-life scenario where deciding as a group may be advantageous. The authors analyzed baseline choice data to determine whether preferences for risk, loss or ambiguity were associated with leadership scores (obtained using established scales, as well as real life data) and whether there is a shift in these preferences when making choices that impact others. They then used computational modelling and fMRI analysis to understand the preferences for leadership and the brain activity underlying these choices.

What did they find?

In the ‘group’ trials, participants deferred to the decision of the group more often compared to the ‘self’ trials (17.3% increase in deferral rate), demonstrating an overall preference for avoiding responsibility. Responsibility aversion (or avoidance of responsibility) was correlated with leadership scores, where those with lower responsibility aversion had higher leadership scores. Responsibility aversion was not related to individual preferences/ values related to risk, loss and ambiguity, nor was it related to regret, blame, and guilt in social situations (assessed in a separate experiment). They then used a computational model taking into account the participants preferences to try and model these decisions. From participants preferences they were able to derive subjective values (i.e. difference in value between accepting or rejecting a gamble for each participant) of choices to lead or defer. They found that deferral choices were greater when there was a low subjective value difference (i.e. less discriminability between the value of the two choices). In other words, when the participant was more certain of their choice (high discriminability), they were more likely to take on responsibility. They derived ‘deferral thresholds’ whereby a critical level of certainty decides whether they will defer or not in a gamble. Using computational modelling they found that when taking responsibility for others there was a shift in the deferral threshold, indicating a higher demand for certainty of a choice. These changes in deferral threshold were correlated with leadership scores. The greater the shift in the deferral threshold, the more likely an individual was to defer (i.e. not lead). Since individual values about risk, loss and ambiguity were not related to leadership scores, these results suggest that the key to determining whether one will lead or not lies in the shift in the amount of certainty needed (about their decision) now that they are responsible for others.

                                      Brain, Servier Medical Art, image by BrainPost, CC BY-SA 3.0

                                      Brain, Servier Medical Art, image by BrainPost, CC BY-SA 3.0

Brain activation was higher in the middle temporal gyrus during group trials compared to self-trials, and temporo-parietal junction activity was correlated with the decision to defer (i.e. not lead). Activity in several brain regions including the medial prefrontal cortex (involved in self-reflection) was associated with the subjective value difference while activity in the anterior insula was associated with the probability of choosing to lead. The authors fit a dynamic causal model to these four brain regions and found a relationship between individual differences in activity in this brain network, shift in deferral threshold and leadership scores. Temporal gyrus activity was also associated with a reduced or inhibited influence of the medial prefrontal cortex on anterior insula activity. This inhibitory influence of medial prefrontal cortex on the anterior insula was reduced in leaders, suggesting a potential neural mechanism underlying the shift in deferral threshold and responsibility aversion.

What's the impact?

This is the first study to examine the behaviour and neurobiology underlying leadership preferences. We now know that the majority of individuals show responsibility aversion and that this is a critical driver in the decision to lead. Those who were less likely to lead show a greater difference in their demand for certainty when deciding to take responsibility for others. Further, this study provides important insight into the brain regions involved in leadership choices. These findings have important implications for understanding leadership in society and can help to inform leadership decisions and consequences.

Edelson et al., Computational and neurobiological foundations of leadership decisions. Science (2018). Access the original scientific publication here.

Stress Impairs Episodic Memory Retrieval

What's the science?

Stress can impair episodic memory, meaning memory for specific events. In the brain, the hippocampus and frontoparietal network (a network consisting of the frontal and parietal lobes of the brain) are responsible for this type of memory. When remembering, medial temporal lobe structures are responsible for a sense of familiarity ('I've seen that before'), and the hippocampus is responsible for recalling specific details of events ('I saw that yesterday at the park'). Frontoparietal networks are recruited when recalling the memory is difficult - they guide the recollection by directing attention - such as toward cues to help someone recall. Stress can impair the hippocampus, which can then contribute to poor memory retrieval. But how does stress impair these memory circuits in the brain exactly? This week in Cerebral Cortex, Gagnon and colleagues used functional magnetic resonance imaging (fMRI) to understand memory circuits in the human brain during acute stress.

How did they do it?

47 young healthy male (18-35 years) participants completed the experiment (44 completed fMRI scans). Half of the participants were randomly assigned to a stress group, and half to a control group. On the first day, all participants completed a study session in which they memorized the association between random words and random scenes (192 word-scene pairs). On the next day, participants were assigned to either the stress group or the control group, and shown words they had seen before as part of the pairs, in addition to new words, while completing fMRI scanning. Participants were asked to report whether the word was old (seen before) or new, and if it was recalled as old, how sure they were about whether the scene paired with the word was indoor or outdoor. At the beginning of each run (6 runs totalling 252 trials), participants in the stress group were told the run would be a shock run (an orange border appeared on the screen during the run) or a safety run (a teal border appeared on the screen during the run). During the shock runs, participants in the stress group received an electric shock (previously calibrated for each individual to be moderately painful) via electrodes on their ankle (1-2 shocks at some point during the run). Participants in the control group were informed that the run would be an 'orange run' or a 'teal run', however no shocks were received. The authors hypothesized that stress hormones (released during stress) would act on the brain throughout the entire experiment for the stress group, and that the stress group would experience the additional difficulty of being distracted by the stressful prospect of a shock during the shock runs only. Both groups also rated whether they felt positive (happy, safe) or negative (anxious, stressed) during each run of the experiment. [The study also included cortisol testing and weak and strong levels of memory coding - see paper for details].

What did they find?

Stressed participants reported feeling more negative than control participants during shock runs, and more negative than during their own safe runs. Associative memory (recalling scene as indoor versus outdoor) performance was found to be reduced by stress (in the stress group). However, performance when simply discriminating old words from new words was only marginally affected by stress. Surprisingly, stress due to shock runs (versus safe runs) in the stressed group did not impact associative memory (recalling scene indoor vs. outdoor) performance. These results suggest that recall is influenced generally by exposure to stress (stress group vs. control group), but not by the immediate presence of a distracting threat (shock runs vs. safe runs). The authors examined neural activation across trials to study hippocampal activity. A larger increase in the hippocampal BOLD response (fMRI measure of brain activity) was seen for trials on which recall was correct. The relationship between hippocampal activity and associative memory was also found to be disrupted during stress. In the stress group (but not the control group), reduced activation in the hippocampus was found during correct recollection of old words versus correct rejection of new words. During trials in which participants reported they were certain about whether the scene was indoor or outdoor, there was a relationship between frontoparietal engagement and reaction time in the control group, however, this relationship was disrupted in the stressed group. This result indicates that the control group but not the stressed group is able to flexibly recruit these networks.

hippo1800.png

What's the impact?

This study examines the neural mechanisms underlying changes in memory recollection during acute stress using fMRI. Stress can impair episodic memory recollection, and likely does so by disrupting the relationship between hippocampal activity and recollection. The results have important implications for how stress can impact memory, especially in clinical populations in which neural mechanisms may be altered - for example, in post-traumatic stress disorder.

Gagnon et al., Stress Impairs Episodic Retrieval by Disrupting Hippocampal and Cortical Mechanisms of Remembering. Cerebral Cortex (2018). Access the original scientific publication here.