Order:
  1.  25
    Beyond Preferences in AI Alignment.Tan Zhi-Xuan, Micah Carroll, Matija Franklin & Hal Ashton - forthcoming - Philosophical Studies:1-51.
    The dominant practice of AI alignment assumes (1) that preferences are an adequate representation of human values, (2) that human rationality can be understood in terms of maximizing the satisfaction of preferences, and (3) that AI systems should be aligned with the preferences of one or more humans to ensure that they behave safely and in accordance with our values. Whether implicitly followed or explicitly endorsed, these commitments constitute what we term apreferentistapproach to AI alignment. In this paper, we characterize (...)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  2.  19
    Probabilistic programming versus meta-learning as models of cognition.Desmond C. Ong, Tan Zhi-Xuan, Joshua B. Tenenbaum & Noah D. Goodman - 2024 - Behavioral and Brain Sciences 47:e158.
    We summarize the recent progress made by probabilistic programming as a unifying formalism for the probabilistic, symbolic, and data-driven aspects of human cognition. We highlight differences with meta-learning in flexibility, statistical assumptions and inferences about cogniton. We suggest that the meta-learning approach could be further strengthened by considering Connectionist and Bayesian approaches, rather than exclusively one or the other.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  3.  1
    Resource‐Rational Virtual Bargaining for Moral Judgment: Toward a Probabilistic Cognitive Model.Diego Trujillo, Mindy Zhang, Tan Zhi-Xuan, Joshua B. Tenenbaum & Sydney Levine - forthcoming - Topics in Cognitive Science.
    Recent theoretical work has argued that moral psychology can be understood through the lens of “resource rational contractualism.” The view posits that the best way of making a decision that affects other people is to get everyone together to negotiate under idealized conditions. The outcome of that negotiation is an arrangement (or “contract”) that would lead to mutual benefit. However, this ideal is seldom (if ever) practical given the resource demands (time, information, computational processing power) that are required. Instead, the (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark