Unique Cortical-Brainstem Activity Underlies Compulsive Alcohol Drinking

November 26, 2019 by Leigh Christopher

What's the science?

A key feature of alcohol use disorders is compulsive drinking—defined as continued drinking regardless of the resulting negative consequences. While most people drink alcohol at some point during their adult life, less than a third develop an alcohol use disorder. But what makes these individuals more vulnerable to compulsive drinking? Scientists currently have a poor understanding of the individual differences in behavior and neural circuitry that drive compulsion. Previous animal studies suggest that the prefrontal cortex, a brain region involved in planning and coordinating our thoughts and actions, plays a crucial role in compulsive behaviors. Prefrontal cortex activity is different in individuals who have consumed alcohol or who have a family history of alcohol use disorders. This week in Science, Siciliano, and colleagues investigated how individual differences in behavior and neural activity in the prefrontal cortex predict the development of compulsive drinking in mice.

How did they do it?

First, the authors took the mice and exposed them to a “binge-induced compulsive task” (BICT), a conditioning task comprising of three different periods. In the first period, the pre-binge, mice had been conditioned to drink from a bottle containing only alcohol. After three days, increasing amounts of the bitter-tasting quinine was added to the alcohol to act as a punishment—or negative consequence—of drinking. In the subsequent 14-day binge drinking period the mice had unlimited access to water and alcohol at certain times. Finally, the post-binge period ran similarly to the pre-binge period, where the mice were presented with alcohol alone for the first three days followed by the alcohol-quinine mix for the next four. Mice were sorted into groups based on their drinking behavior during the post-binge period. Second, the authors compared drinking behavior in the pre-binge period between the newly identified groups. Third, they used cellular-resolution calcium imaging as a proxy for neuronal activity during the BICT to examine whether the activity of the neural connections between the medial prefrontal cortex and the dorsal periaqueductal grey contributed to susceptibility of developing compulsive drinking behaviors. Fourth, they used two different light-sensitive proteins and optic fibers to determine whether mimicking endogenous neuronal activity in this cortical-brainstem pathway could alter drinking behavior. One of the light-sensitive proteins—halorhodopsin—can inhibit cellular activity, while the other light-sensitive protein—channelrhodopsin-2—helps activate cells.

What did they find?

Three groups of mice were identified based on post-binge period drinking behavior: low drinkers (low alcohol intake regardless of if quinine was present or absent), high drinkers (high alcohol intake that ceased when quinine was present), and compulsive drinkers (high alcohol intake even when quinine was present). Second, compulsive drinking mice drank more of the alcohol-quinine mix during the pre-binge drinking period compared to the other two groups. This compulsive drinking behavior was exacerbated after the binge drinking period. Third, the authors observed more inhibitory responses in the neurons connecting the medial prefrontal cortex and the dorsal periaqueductal grey in compulsive drinking mice compared to the low drinking mice. The low drinking mice also exhibited more excitatory neuronal activity between these two brain regions when consuming alcohol. Therefore, the neural response during initial alcohol exposure predicted the future development of compulsive drinking. Finally, they found that inhibiting neuronal activity between the medial prefrontal cortex and the dorsal periaqueductal grey increased quinine intake and that stimulating neuronal activity over the same neurons decreased alcohol intake. The authors concluded that light-induced inhibition prevented punishment signals being sent from the cortex to the brainstem, whereas light-induced stimulation enhanced the punishment.

What's the impact?

This study provides a mechanistic explanation for the individual variance in the susceptibility to compulsive alcohol drinking. These findings are particularly important as this newly discovered cortical-brainstem circuit may help guide efforts in drug discovery to prevent alcohol use disorders. Future research is needed to determine the specific mechanisms underlying the reactivity of this circuit to alcohol.

Siciliano et al. A cortical-brainstem circuit predicts and governs compulsive alcohol drinking. Science (2019). Access the original scientific publication here.

Different Learning Strategies Used During Pavlovian Conditioning

November 26, 2019 by Leigh Christopher

Post by Shireen Parimoo

What's the science?

In Pavlovian conditioning, people form associations between a neutral stimulus (e.g. a bell) and an upcoming unconditioned stimulus (e.g. food). The neutral stimulus later becomes the conditioned stimulus because it elicits the same response as the unconditioned stimulus. People can learn these associations using a value-based or an uncertainty-based strategy. In value-based learning, learning occurs based on the difference between the expected reward and the actual reward received, which is the reward prediction error. In uncertainty-based learning, people learn the probability that a conditioned stimulus will elicit a specific unconditioned stimulus, which generates the state prediction error. There are individual differences in whether people pay more attention to the conditioned stimulus (sign-trackers) or the unconditioned stimulus (goal-trackers). The neural basis of these learning strategies is not yet well understood. This week in Nature Human Behavior, Schad and colleagues used eye-tracking and functional magnetic resonance imaging (fMRI) techniques to investigate the neural substrates of learning strategies used by sign-trackers and goal-trackers.

How did they do it?

Participants were 129 male adults who completed a Pavlovian conditioning task in the fMRI scanner while their eye movements were recorded. They learned associations between visual-auditory cues that predicted monetary reward (appetitive conditioned stimulus; $1, $2), no reward (neutral conditioned stimulus: $0), or loss (aversive conditioned stimulus; -$1, -$2). The authors computed a gaze index to categorize participants as sign-trackers or goal-trackers. The gaze index is the difference between the proportion of fixations made to the unconditioned stimulus and the proportion of fixations made to the conditioned stimulus. A value of 0 indicates that participants made an equal proportion of fixations to both conditioned and unconditioned stimuli, whereas positive and negative values indicate that they made more fixations to the conditioned and the unconditioned stimulus, respectively. To identify sign-trackers, the authors examined the relationship between gaze index and the value of the conditioned stimulus. The top third of the participants who looked more frequently at the conditioned stimulus predicting monetary rewards than at the conditioned stimulus predicting losses were deemed to be sign-trackers. A similar analysis was conducted with the value of the unconditioned stimulus to identify goal-trackers. Eye movement behavior during the conditioning task, including pupil dilation and the number of fixations, was compared across the two groups for the different stimuli.

The authors used computational modeling to determine whether the eye movement patterns of sign-trackers and goal-trackers during the conditioning task reflected value-based or uncertainty-based learning strategies. Value-based learning was assessed in a reinforcement learning model that computes a reward-prediction error value. On the other hand, uncertainty-based learning was assessed in a model that produced a state prediction error value. Finally, the authors examined the neural substrates of the different learning strategies. They used a reinforcement learning model to compute reward prediction error signals in reward-processing regions like the nucleus accumbens in response to the stimuli. Uncertainty-based learning was investigated by computing the state prediction error signal at the onset of the unconditioned stimulus in regions associated with the state prediction error effect, such as the intraparietal sulcus and the lateral prefrontal cortex. The reward prediction and state prediction error effects in the brain were compared between sign-trackers and goal-trackers.

What did they find?

Sign-trackers made more fixations to the appetitive conditioned stimulus associated with a monetary reward than to the aversive conditioned stimulus associated with monetary loss, which is in line with value-based learning. Conversely, goal-trackers made more fixations to the appetitive unconditioned stimulus more than the aversive unconditioned stimulus, but over time, they looked away from the conditioned stimuli and more at the unconditioned stimuli, which is in line with uncertainty-based learning. Pupil dilation in response to both conditioned and unconditioned stimuli also differed between sign-trackers and goal-trackers. Pupil size decreased over the course of learning in goal-trackers but did not change in response to the appetitive and aversive conditioned stimuli. Among sign-trackers, there was no change in pupil size over time, but the pupils dilated in response to appetitive conditioned stimuli compared to the aversive conditioned stimuli. Computational modeling indicated that the value-based model captured pupillary changes for sign-trackers, whereas the uncertainty-based model best explained the pupil dilation in goal-trackers. Overall, these results suggest that eye movement behavior tracks the value of the stimulus among sign-trackers and the upcoming expectation state among goal-trackers.

Distinct patterns of brain activity were associated with learning strategies used by sign-trackers and goal-trackers. In reward-processing brain regions such as the nucleus accumbens, ventromedial prefrontal cortex, and amygdala, the value-based model explained more variance in brain activity for the sign-trackers than for the goal-trackers. In contrast, the uncertainty-based model better explained activity in the intraparietal sulcus of the goal-trackers than the sign-trackers. In sum, the eye-tracking and neural data indicate that sign-trackers used value-based learning strategies while goal-trackers relied more on an uncertainty-based strategy.

What's the impact?

This study is the first to demonstrate the distinct behavioral and neural profiles of learning in sign-trackers, who primarily use value-based learning strategies, and goal-trackers, who rely on uncertainty-based learning. By providing a deeper understanding of the different learning systems in humans, these findings have important implications for the treatment of disorders that involve aberrant reward learning, like addiction.

Schad et al. Dissociating neural learning signals in human sign- and goal-trackers. Nature Human Behavior (2019). Access the original scientific publication here.

Two Neural Features Related to Information Encoding and Behavior

November 26, 2019 by Leigh Christopher

Post by Stephanie Williams

What's the science?

To understand how neurons encode information related to the external world, such as information from the environment, we need to understand which statistical features of individual neurons contain that information. In the past, research groups have suggested the mean firing rate of individual neurons or groups of neurons may represent information about stimuli. Other research has looked at the amount of noise that populations of neurons share (correlated noise), however, the specific statistical properties of neurons related to the information encoding are not well understood. This week in The Journal of Neuroscience, Noguiera and colleagues identify neural features that explain most of the variance in information encoding and behavior. The authors specifically address two questions 1) what features are related to information encoding 2) do those features affect behavioral performance?

How did they do it?

This study involved both experimental data collection and theoretical modelling. For the experimental arm of the study, four monkeys were trained to perform three different tasks. Two of the three tasks were direction discrimination tasks (one coarse discrimination and one fine discrimination), and the third was a spatial attention task, in which two of the monkeys had to detect a change in the orientation of some lines displayed in a circle (i.e. a Gabor patch). The authors recorded neural activity while the monkeys performed a given task from two brain regions: the middle temporal area (MT) and area 8a in the lateral prefrontal cortex. Performance on the tasks was quantified as the number of correct reports of motion for the direction tasks, and as the mean reaction time for the attention task. To test which neural features were related to 1) information encoding and 2) behavioral performance, the authors isolated features of interest by iterating through each extracted feature, and changing the values of one set of features while holding all of the other features constant. By using a statistical technique, called bootstrapping, they could select the bootstrap iterations that produced feature values that were in a narrow range around the median for that particular feature. They used these bootstrapped samples to generate the “fluctuations” in the feature that they were changing during their iterations (they call this method “conditioned bootstrapping”). The authors then trained a binary classifier to predict which task the monkeys were performing and the specific behavior performed in the task (i.e. which Gabor patches were attended to or left vs. right motion stimulus).

For the theoretical arm of the study, the authors defined decoding performance, mathematically simulated experimental data, and tested decoding performance on the simulated data. To define decoding performance, the authors first derived a mathematical expression that described the theoretical optimal performance of a linear classifier. They defined two terms representing the two categories of features of interest 1) the population signal feature, which is a measure of how the overall modulation of activity of the measured neuronal population changes as a function of the stimulus condition, and 2) the projected precision feature which is related to the trial-by-trial variability. To simulate population activity, the authors built a neural population activity model with a large ensemble than was recorded experimentally (N = 1000 model neurons), with each neuron’s activity modeled as a function of stimuli (2 stimuli) of different strengths (3 different strengths). They also incorporated a mathematical term that represented noise correlations (corresponding changes in trial-to-trial variability) between neurons. To model behavioral performance, the authors used an optimal linear classifier to make predictions from simulated neural activity. They then compared the performance of their theoretical decoder with the performance of the decoder trained on their experimental data.

What did they find?

The authors found that two of the features they examined were important features that affected how much information was encoded, and were the strongest predictors of behavioral performance. They found that changing both the 1) population signal feature and the 2) projected precision feature (one at a time, while holding all other features constant), significantly affected the amount of encoded information and also predicted changes in behavioral performance.

The first feature related to population tuning was specifically a length metric that joined the mean population responses across different experimental conditions. The second feature, related to the amount of trial-by-trial variability was calculated as the inverse of the population covariability projected onto the direction of the population signal. Importantly, the authors did not find that other features (such as global activity and mean pairwise correlations), which had previously been suggested by other research, were related to the amount of information encoding when they controlled for the two features they identified. However, it is worth noting that changing the global activity and correlation features did change the amount of information encoded in population activity when the two features that authors identified were not controlled for.

What's the impact?

The authors show for the first time that two features, population signal and projected precision, modulate the amount of information encoded by finite neuronal populations and predict changes in behavior. They also show that two other features did not modulate the amount of encoded information or behavioral performance. These findings shed light on the specific properties of neurons involved in encoding information.

Nogueira et al. The effects of population tuning and trial-by-trial availability on information encoding and behavior. J. Neurosci (2019). Access the original scientific publication here.