TY - GEN
T1 - A Reinforcement Learning Model of Temporal Difference Variations for Action-Selection and Action-Execution in the Human Brain
AU - Natsheh, Ashar Y.
AU - Natsheh, Joman Y.
AU - Mousa, Aya H.
AU - Al-Saheb, Mahmoud H.
AU - Moustafa, Ahmed A.
AU - Herzallah, Mohammad M.
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023/8
Y1 - 2023/8
N2 - Temporal difference (TD) prediction error signal models are instrumental in simulating brain function during reinforcement learning (RL). Recent evidence suggests a significant role of TD prediction error signals in the action-selection and action-execution brain networks. We introduce a novel neuro-computational model that addresses the effects of temporal difference error signal variations on reinforcement learning for action-selection and action-execution networks. These networks represent the basal ganglia and prefrontal cortex brain regions, while the TD prediction error signal represents the dopamine neurotransmitter. The model incorporates dopamine genetic parameters in the two networks (COMT gene for action-selection; DAT1 gene for action-execution) to generate four different parameter combinations. The model simulation showed that TD signaling in both networks plays a significant role in RL under optimal conditions of medium, not high, TD signals. Moreover, each parameter combination showed a unique pattern of RL, corresponding with experimental data obtained using a computer-based RL task.
AB - Temporal difference (TD) prediction error signal models are instrumental in simulating brain function during reinforcement learning (RL). Recent evidence suggests a significant role of TD prediction error signals in the action-selection and action-execution brain networks. We introduce a novel neuro-computational model that addresses the effects of temporal difference error signal variations on reinforcement learning for action-selection and action-execution networks. These networks represent the basal ganglia and prefrontal cortex brain regions, while the TD prediction error signal represents the dopamine neurotransmitter. The model incorporates dopamine genetic parameters in the two networks (COMT gene for action-selection; DAT1 gene for action-execution) to generate four different parameter combinations. The model simulation showed that TD signaling in both networks plays a significant role in RL under optimal conditions of medium, not high, TD signals. Moreover, each parameter combination showed a unique pattern of RL, corresponding with experimental data obtained using a computer-based RL task.
UR - http://www.scopus.com/inward/record.url?scp=85171741452&partnerID=8YFLogxK
U2 - 10.1109/ICIT58056.2023.10226147
DO - 10.1109/ICIT58056.2023.10226147
M3 - Conference contribution
AN - SCOPUS:85171741452
SN - 9798350320077
T3 - 2023 International Conference on Information Technology: Cybersecurity Challenges for Sustainable Cities, ICIT 2023 - Proceeding
SP - 236
EP - 243
BT - 2023 International Conference on Information Technology: Cybersecurity Challenges for Sustainable Cities, ICIT 2023 - Proceeding
A2 - Jaber, Khalid Mohammad
PB - IEEE, Institute of Electrical and Electronics Engineers
T2 - 11th International Conference on Information Technology
Y2 - 9 August 2023 through 10 August 2023
ER -