Unsupervised Learning from Dyadic Data
Title | Unsupervised Learning from Dyadic Data |
Publication Type | Technical Report |
Year of Publication | 1998 |
Authors | Hofmann, T., & Puzicha J. |
Other Numbers | 1157 |
Abstract | Dyadic data refers to a domain with two finite sets of objects in which observations are made for dyads, i.e., pairs with one element from either set. This includes event co-occurrences, histogram data, and single stimulus preference data as special cases. Dyadic data arises naturally in many applications ranging from computational linguistics and information retrieval to preference analysis and computer vision. In this paper, we present a systematic, domain-independent framework for unsupervised learning from dyadic data by statistical mixture models. Our approach covers different models with flat and hierarchical latent class structures and unifies probabilistic modeling and structure discovery. Mixture models provide both, a parsimonious yet flexible parameterization of probability distributions with good generalization performance on sparse data, as well as structural information about data-inherent grouping structure. We propose an annealed version of the standard Expectation Maximization algorithm for model fitting which is empirically evaluated on a variety of data sets from different domains. |
URL | http://www.icsi.berkeley.edu/ftp/global/pub/techreports/1998/tr-98-042.pdf |
Bibliographic Notes | ICSI Technical Report TR-98-042 |
Abbreviated Authors | T. Hofmann and J. Puzicha |
ICSI Publication Type | Technical Report |