Combining Background Knowledge and Learned Topics

Mark Steyvers; Padhraic Smyth; Chaitanya Chemuduganta

Download from

dx.doi.org

More download options

Combining Background Knowledge and Learned Topics

Mark Steyvers, Padhraic Smyth & Chaitanya Chemuduganta

Topics in Cognitive Science 3 (1):18-47 (2011) Copy BIBT_EX

Abstract

Statistical topic models provide a general data - driven framework for automated discovery of high-level knowledge from large collections of text documents. Although topic models can potentially discover a broad range of themes in a data set, the interpretability of the learned topics is not always ideal. Human-defined concepts, however, tend to be semantically richer due to careful selection of words that define the concepts, but they may not span the themes in a data set exhaustively. In this study, we review a new probabilistic framework for combining a hierarchy of human-defined semantic concepts with a statistical topic model to seek the best of both worlds. Results indicate that this combination leads to systematic improvements in generalization performance as well as enabling new techniques for inferring and visualizing the content of a document

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Keywords

Data‐driven learning Topic model Concepts Background knowledge Bayesian models Hierarchical concept‐topic model Human‐defined knowledge Concept‐topic model

Reprint years

DOI

10.1111/j.1756-8765.2010.01097.x

Other Versions

No versions found

My notes

Analytics

Added to PP
2010-08-11

Downloads
127 (#172,749)

6 months
6 (#873,397)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

Computational Methods to Extract Meaning From Text and Advance Theories of Human Cognition.Danielle S. McNamara - 2011 - Topics in Cognitive Science 3 (1):3-17.

Add more citations

References found in this work

A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge.Thomas K. Landauer & Susan T. Dumais - 1997 - Psychological Review 104 (2):211-240.

Topics in semantic representation.Thomas L. Griffiths, Mark Steyvers & Joshua B. Tenenbaum - 2007 - Psychological Review 114 (2):211-244.

Representing word meaning and order information in a composite holographic lexicon.Michael N. Jones & Douglas J. K. Mewhort - 2007 - Psychological Review 114 (1):1-37.

Probabilistic models of language processing and acquisition.Nick Chater & Christopher D. Manning - 2006 - Trends in Cognitive Sciences 10 (7):335–344.

The Role of Embodied Intention in Early Lexical Acquisition.Chen Yu, Dana H. Ballard & Richard N. Aslin - 2005 - Cognitive Science 29 (6):961-1005.

View all 6 references / Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Combining Background Knowledge and Learned Topics

Abstract

Categories

Keywords

Reprint years

DOI

Other Versions

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work