Abstract
Within psychology, philosophy, and cognitive science, theory of mind refers to the
cognitive ability to reason about the mental states of other people, thus recognizing
them as having beliefs, knowledge, intentions and emotions of their own. In
this project, we construct a natural language inference (NLD) dataset that tests
the ability of a state of the art language model, RoBERTa-large finetuned on
the MNLI dataset, to make theory of mind inferences related to knowledge and
belief. Experimental results suggest that the model struggles with such inferences,
including after attempts for further finetuning.