Results for ' reinforcement learning'

986 found
Order:
See also
  1.  58
    Reinforcement Learning and Counterfactual Reasoning Explain Adaptive Behavior in a Changing Environment.Yunfeng Zhang, Jaehyon Paik & Peter Pirolli - 2015 - Topics in Cognitive Science 7 (2):368-381.
    Animals routinely adapt to changes in the environment in order to survive. Though reinforcement learning may play a role in such adaptation, it is not clear that it is the only mechanism involved, as it is not well suited to producing rapid, relatively immediate changes in strategies in response to environmental changes. This research proposes that counterfactual reasoning might be an additional mechanism that facilitates change detection. An experiment is conducted in which a task state changes over time (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark  
  2.  80
    Reinforcement learning and artificial agency.Patrick Butlin - 2024 - Mind and Language 39 (1):22-38.
    There is an apparent connection between reinforcement learning and agency. Artificial entities controlled by reinforcement learning algorithms are standardly referred to as agents, and the mainstream view in the psychology and neuroscience of agency is that humans and other animals are reinforcement learners. This article examines this connection, focusing on artificial reinforcement learning systems and assuming that there are various forms of agency. Artificial reinforcement learning systems satisfy plausible conditions for minimal (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  3.  14
    Reinforcement Learning for Production‐Based Cognitive Models.Adrian Brasoveanu & Jakub Dotlačil - 2021 - Topics in Cognitive Science 13 (3):467-487.
    We investigate how Reinforcement Learning methods can be used to solve the production selection and production ordering problem in ACT‐R. We focus on four algorithms from the Q learning family, tabular Q and three versions of Deep Q Networks, as well as the ACT‐R utility learning algorithm, which provides a baseline for the Q algorithms. We compare the performance of these five algorithms in a range of lexical decision tasks framed as sequential decision problems.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  4.  24
    Evolutionary Reinforcement Learning for Adaptively Detecting Database Intrusions.Seul-Gi Choi & Sung-Bae Cho - 2020 - Logic Journal of the IGPL 28 (4):449-460.
    Relational database management system is the most popular database system. It is important to maintain data security from information leakage and data corruption. RDBMS can be attacked by an outsider or an insider. It is difficult to detect an insider attack because its patterns are constantly changing and evolving. In this paper, we propose an adaptive database intrusion detection system that can be resistant to potential insider misuse using evolutionary reinforcement learning, which combines reinforcement learning and (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  5.  23
    Predictive Movements and Human Reinforcement Learning of Sequential Action.Roy Kleijn, George Kachergis & Bernhard Hommel - 2018 - Cognitive Science 42 (S3):783-808.
    Sequential action makes up the bulk of human daily activity, and yet much remains unknown about how people learn such actions. In one motor learning paradigm, the serial reaction time (SRT) task, people are taught a consistent sequence of button presses by cueing them with the next target response. However, the SRT task only records keypress response times to a cued target, and thus it cannot reveal the full time‐course of motion, including predictive movements. This paper describes a mouse (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  6.  15
    Reinforcement Learning-Based Collision Avoidance Guidance Algorithm for Fixed-Wing UAVs.Yu Zhao, Jifeng Guo, Chengchao Bai & Hongxing Zheng - 2021 - Complexity 2021:1-12.
    A deep reinforcement learning-based computational guidance method is presented, which is used to identify and resolve the problem of collision avoidance for a variable number of fixed-wing UAVs in limited airspace. The cooperative guidance process is first analyzed for multiple aircraft by formulating flight scenarios using multiagent Markov game theory and solving it by machine learning algorithm. Furthermore, a self-learning framework is established by using the actor-critic model, which is proposed to train collision avoidance decision-making neural (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  7.  12
    Reinforcement Learning with Probabilistic Boolean Network Models of Smart Grid Devices.Pedro Juan Rivera Torres, Carlos Gershenson García, María Fernanda Sánchez Puig & Samir Kanaan Izquierdo - 2022 - Complexity 2022:1-15.
    The area of smart power grids needs to constantly improve its efficiency and resilience, to provide high quality electrical power in a resilient grid, while managing faults and avoiding failures. Achieving this requires high component reliability, adequate maintenance, and a studied failure occurrence. Correct system operation involves those activities and novel methodologies to detect, classify, and isolate faults and failures and model and simulate processes with predictive algorithms and analytics. In this paper, we showcase the application of a complex-adaptive, self-organizing (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  8. Using Reinforcement Learning to Examine Dynamic Attention Allocation During Reading.Yanping Liu, Erik D. Reichle & Ding-Guo Gao - 2013 - Cognitive Science 37 (8):1507-1540.
    A fundamental question in reading research concerns whether attention is allocated strictly serially, supporting lexical processing of one word at a time, or in parallel, supporting concurrent lexical processing of two or more words (Reichle, Liversedge, Pollatsek, & Rayner, 2009). The origins of this debate are reviewed. We then report three simulations to address this question using artificial reading agents (Liu & Reichle, 2010; Reichle & Laurent, 2006) that learn to dynamically allocate attention to 1–4 words to “read” as efficiently (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  9. Integrating reinforcement learning, bidding and genetic algorithms.Ron Sun - unknown
    This paper presents a GA-based multi-agent reinforce- ment learning bidding approach (GMARLB) for perform- ing multi-agent reinforcement learning. GMARLB inte- grates reinforcement learning, bidding and genetic algo- rithms. The general idea of our multi-agent systems is as follows: There are a number of individual agents in a team, each agent of the team has two modules: Q module and CQ module. Each agent can select actions to be performed at each step, which are done by (...)
     
    Export citation  
     
    Bookmark  
  10.  35
    Deep Reinforcement Learning for Vectored Thruster Autonomous Underwater Vehicle Control.Tao Liu, Yuli Hu & Hui Xu - 2021 - Complexity 2021:1-25.
    Autonomous underwater vehicles are widely used to accomplish various missions in the complex marine environment; the design of a control system for AUVs is particularly difficult due to the high nonlinearity, variations in hydrodynamic coefficients, and external force from ocean currents. In this paper, we propose a controller based on deep reinforcement learning in a simulation environment for studying the control performance of the vectored thruster AUV. RL is an important method of artificial intelligence that can learn behavior (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  11. Reinforcement learning: A brief guide for philosophers of mind.Julia Haas - 2022 - Philosophy Compass 17 (9):e12865.
    In this opinionated review, I draw attention to some of the contributions reinforcement learning can make to questions in the philosophy of mind. In particular, I highlight reinforcement learning's foundational emphasis on the role of reward in agent learning, and canvass two ways in which the framework may advance our understanding of perception and motivation.
    Direct download (5 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  12.  52
    Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling.A. David Redish, Steve Jensen, Adam Johnson & Zeb Kurth-Nelson - 2007 - Psychological Review 114 (3):784-805.
    No categories
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   22 citations  
  13.  33
    Predictive Movements and Human Reinforcement Learning of Sequential Action.Roy de Kleijn, George Kachergis & Bernhard Hommel - 2018 - Cognitive Science 42 (S3):783-808.
    Sequential action makes up the bulk of human daily activity, and yet much remains unknown about how people learn such actions. In one motor learning paradigm, the serial reaction time (SRT) task, people are taught a consistent sequence of button presses by cueing them with the next target response. However, the SRT task only records keypress response times to a cued target, and thus it cannot reveal the full time‐course of motion, including predictive movements. This paper describes a mouse (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  14.  34
    Network formation by reinforcement learning: The long and medium run.Brian Skyrms - unknown
    We investigate a simple stochastic model of social network formation by the process of reinforcement learning with discounting of the past. In the limit, for any value of the discounting parameter, small, stable cliques are formed. However, the time it takes to reach the limiting state in which cliques have formed is very sensitive to the discounting parameter. Depending on this value, the limiting result may or may not be a good predictor for realistic observation times.
    Direct download  
     
    Export citation  
     
    Bookmark   21 citations  
  15.  11
    Reinforcement learning of non-Markov decision processes.Steven D. Whitehead & Long-Ji Lin - 1995 - Artificial Intelligence 73 (1-2):271-306.
  16.  28
    Reinforcement learning for Golog programs with first-order state-abstraction.D. Beck & G. Lakemeyer - 2012 - Logic Journal of the IGPL 20 (5):909-942.
  17.  32
    Reinforcement learning and higher level cognition: Introduction to special issue.Nathaniel D. Daw & Michael J. Frank - 2009 - Cognition 113 (3):259-261.
  18.  43
    Reinforcement learning of self-regulated β-oscillations for motor restoration in chronic stroke.Georgios Naros & Alireza Gharabaghi - 2015 - Frontiers in Human Neuroscience 9.
  19. When, What, and How Much to Reward in Reinforcement Learning-Based Models of Cognition.Christian P. Janssen & Wayne D. Gray - 2012 - Cognitive Science 36 (2):333-358.
    Reinforcement learning approaches to cognitive modeling represent task acquisition as learning to choose the sequence of steps that accomplishes the task while maximizing a reward. However, an apparently unrecognized problem for modelers is choosing when, what, and how much to reward; that is, when (the moment: end of trial, subtask, or some other interval of task performance), what (the objective function: e.g., performance time or performance accuracy), and how much (the magnitude: with binary, categorical, or continuous values). (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   6 citations  
  20. Multi-Agent Reinforcement Learning: Weighting and Partitioning.Ron Sun & Todd Peterson - unknown
    This paper addresses weighting and partitioning in complex reinforcement learning tasks, with the aim of facilitating learning. The paper presents some ideas regarding weighting of multiple agents and extends them into partitioning an input/state space into multiple regions with di erential weighting in these regions, to exploit di erential characteristics of regions and di erential characteristics of agents to reduce the learning complexity of agents (and their function approximators) and thus to facilitate the learning overall. (...)
     
    Export citation  
     
    Bookmark   6 citations  
  21.  72
    Novelty and Inductive Generalization in Human Reinforcement Learning.Samuel J. Gershman & Yael Niv - 2015 - Topics in Cognitive Science 7 (3):391-415.
    In reinforcement learning, a decision maker searching for the most rewarding option is often faced with the question: What is the value of an option that has never been tried before? One way to frame this question is as an inductive problem: How can I generalize my previous experience with one set of options to a novel option? We show how hierarchical Bayesian inference can be used to solve this problem, and we describe an equivalence between the Bayesian (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  22.  8
    Reinforcement learning in factories: the auton project.Andrew W. Moore - 1996 - In Garrison W. Cottrell (ed.), Proceedings of the Eighteenth Annual Conference of The Cognitive Science Society. Lawrence Erlbaum. pp. 18--12.
  23.  16
    Relational reinforcement learning with guided demonstrations.David Martínez, Guillem Alenyà & Carme Torras - 2017 - Artificial Intelligence 247 (C):295-312.
  24.  23
    Reinforcement Learning in Autism Spectrum Disorder.Manuela Schuetze, Christiane S. Rohr, Deborah Dewey, Adam McCrimmon & Signe Bray - 2017 - Frontiers in Psychology 8.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  25.  19
    Deep Reinforcement Learning for UAV Intelligent Mission Planning.Longfei Yue, Rennong Yang, Ying Zhang, Lixin Yu & Zhuangzhuang Wang - 2022 - Complexity 2022:1-13.
    Rapid and precise air operation mission planning is a key technology in unmanned aerial vehicles autonomous combat in battles. In this paper, an end-to-end UAV intelligent mission planning method based on deep reinforcement learning is proposed to solve the shortcomings of the traditional intelligent optimization algorithm, such as relying on simple, static, low-dimensional scenarios, and poor scalability. Specifically, the suppression of enemy air defense mission planning is described as a sequential decision-making problem and formalized as a Markov decision (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  26.  23
    Integrating reinforcement learning, equilibrium points, and minimum variance to understand the development of reaching: A computational model.Daniele Caligiore, Domenico Parisi & Gianluca Baldassarre - 2014 - Psychological Review 121 (3):389-421.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  27.  17
    Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance.W. Bradley Knox & Peter Stone - 2015 - Artificial Intelligence 225 (C):24-50.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  28.  15
    Reinforcement Learning.Oliver Kramer - 2009 - In L. Magnani (ed.), computational intelligence. pp. 101--117.
  29.  57
    Can reinforcement learning explain variation in early infant crying?Arnon Lotem & David W. Winkler - 2004 - Behavioral and Brain Sciences 27 (4):468-468.
    We welcome Soltis' use of evolutionary signaling theory, but question his interpretations of colic as a signal of vigor and his explanation of abnormal high-pitched crying as a signal of poor infant quality. Instead, we suggest that these phenomena may be suboptimal by-products of a generally adaptive learning process by which infants adjust their crying levels in relation to parental responsiveness.
    Direct download (7 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  30.  12
    Decentralized Reinforcement Learning of Robot Behaviors.David L. Leottau, Javier Ruiz-del-Solar & Robert Babuška - 2018 - Artificial Intelligence 256 (C):130-159.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  31.  15
    Certified reinforcement learning with logic guidance.Hosein Hasanbeig, Daniel Kroening & Alessandro Abate - 2023 - Artificial Intelligence 322 (C):103949.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  32.  14
    Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs.Finale Doshi-Velez, Joelle Pineau & Nicholas Roy - 2012 - Artificial Intelligence 187-188 (C):115-132.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  33. Reinforcement learning with raw image pixels as state input.D. Ernst, R. Marée & L. Wehenkel - 2006 - In O. Stock & M. Schaerf (eds.), Lecture Notes In Computer Science. Springer Verlag. pp. 4153.
  34.  21
    Enforcing ethical goals over reinforcement-learning policies.Guido Governatori, Agata Ciabattoni, Ezio Bartocci & Emery A. Neufeld - 2022 - Ethics and Information Technology 24 (4):1-19.
    Recent years have yielded many discussions on how to endow autonomous agents with the ability to make ethical decisions, and the need for explicit ethical reasoning and transparency is a persistent theme in this literature. We present a modular and transparent approach to equip autonomous agents with the ability to comply with ethical prescriptions, while still enacting pre-learned optimal behaviour. Our approach relies on a normative supervisor module, that integrates a theorem prover for defeasible deontic logic within the control loop (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  35.  60
    Deep Reinforcement Learning as Foundation for Artificial General Intelligence.Itamar Arel - 2012 - In Pei Wang & Ben Goertzel (eds.), Theoretical Foundations of Artificial General Intelligence. Springer. pp. 89--102.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  36. Supplementing neural reinforcement learning with symbolic methods possibilities and challenges.Ron Sun - unknown
    methods to improve reinforcement learning are identi ed and discussed in some detail Each demonstrates to some extent the advantages of combining RL and symbolic meth ods These methods point to the potentials and the chal lenges of this line of research..
     
    Export citation  
     
    Bookmark  
  37.  11
    Expanding horizons in reinforcement learning for curious exploration and creative planning.Dale Zhou & Aaron M. Bornstein - 2024 - Behavioral and Brain Sciences 47:e118.
    Curiosity and creativity are expressions of the trade-off between leveraging that with which we are familiar or seeking out novelty. Through the computational lens of reinforcement learning, we describe how formulating the value of information seeking and generation via their complementary effects on planning horizons formally captures a range of solutions to striking this balance.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  38. Reinforcement learning.Chris Jch Watkins & Peter Dayan - 2003 - In L. Nadel (ed.), Encyclopedia of Cognitive Science. Nature Publishing Group.
  39. Bidding in Reinforcement Learning: A Paradigm for Multi-Agent Systems.Chad Sessions - unknown
    The paper presents an approach for developing multi-agent reinforcement learning systems that are made up of a coalition of modular agents. We focus on learning to segment sequences (sequential decision tasks) to create modular structures, through a bidding process that is based on reinforcements received during task execution. The approach segments sequences (and divides them up among agents) to facilitate the learning of the overall task. Notably, our approach does not rely on a priori knowledge or (...)
     
    Export citation  
     
    Bookmark   1 citation  
  40.  50
    Integrating reinforcement learning with models of representation learning.Matt Jones & Fabián Canas - 2010 - In S. Ohlsson & R. Catrambone (eds.), Proceedings of the 32nd Annual Conference of the Cognitive Science Society. Cognitive Science Society. pp. 1258--1263.
  41. Cheap talk, reinforcement learning, and the emergence of cooperation.J. McKenzie Alexander - 2015 - Philosophy of Science 82 (5):969-982.
    Cheap talk has often been thought incapable of supporting the emergence of cooperation because costless signals, easily faked, are unlikely to be reliable (Zahavi and Zahavi, 1997). I show how, in a social network model of cheap talk with reinforcement learning, cheap talk does enable the emergence of cooperation, provided that individuals also temporally discount the past. This establishes one mechanism that suffices for moving a population of initially uncooperative individuals to a state of mutually beneficial cooperation even (...)
    Direct download (7 more)  
     
    Export citation  
     
    Bookmark  
  42.  18
    “Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling”: Correction.David A. Redish, Steve Jensen, Adam Johnson & Zeb Kurth-Nelson - 2009 - Psychological Review 116 (3):518-518.
  43.  18
    Reinforcement Learning With Parsimonious Computation and a Forgetting Process.Asako Toyama, Kentaro Katahira & Hideki Ohira - 2019 - Frontiers in Human Neuroscience 13.
  44. The Archimedean trap: Why traditional reinforcement learning will probably not yield AGI.Samuel Allen Alexander - 2020 - Journal of Artificial General Intelligence 11 (1):70-85.
    After generalizing the Archimedean property of real numbers in such a way as to make it adaptable to non-numeric structures, we demonstrate that the real numbers cannot be used to accurately measure non-Archimedean structures. We argue that, since an agent with Artificial General Intelligence (AGI) should have no problem engaging in tasks that inherently involve non-Archimedean rewards, and since traditional reinforcement learning rewards are real numbers, therefore traditional reinforcement learning probably will not lead to AGI. We (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  45.  39
    On Adaptation, Maximization, and Reinforcement Learning Among Cognitive Strategies.Ido Erev & Greg Barron - 2005 - Psychological Review 112 (4):912-931.
  46.  26
    Counterfactual state explanations for reinforcement learning agents via generative deep learning.Matthew L. Olson, Roli Khanna, Lawrence Neal, Fuxin Li & Weng-Keen Wong - 2021 - Artificial Intelligence 295 (C):103455.
  47. Knowledge extraction from reinforcement learning.Ron Sun - unknown
    traction from reinforcement learners It addresses two ap proaches towards knowledge extraction the extraction of ex plicit symbolic rules from neural reinforcement learners and the extraction of complete plans from such learners The advantages of such knowledge extraction include the improvement of learning especially with the rule extraction approach and the improvement of the usability of re sults of learning..
     
    Export citation  
     
    Bookmark  
  48.  11
    Interestingness elements for explainable reinforcement learning: Understanding agents' capabilities and limitations.Pedro Sequeira & Melinda Gervasio - 2020 - Artificial Intelligence 288 (C):103367.
  49.  30
    Using reinforcement learning to understand the emergence of "intelligent" eye-movement behavior during reading.Erik D. Reichle & Patryk A. Laurent - 2006 - Psychological Review 113 (2):390-408.
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   6 citations  
  50. Emotion-driven reinforcement learning.R. P. Marinier & John E. Laird - unknown
     
    Export citation  
     
    Bookmark   3 citations  
1 — 50 / 986