Discovering Concepts from Word Co-occurrences with a Relational Model
スポンサーリンク
概要
- 論文の詳細を見る
Clustering word co-occurrences has been studied to discover clusters as latentconcepts. Previous work has applied the semantic aggregate model (SAM), and reports that discovered clusters seem semantically significant. The SAM assumes a co-occurrence arises from one latent concept. This assumption seems moderately natural. However, to analyze latent concepts more deeply, the assumption may be too restrictive. We propose to make clusters for each part of speech from co-occurrence data. For example, we make adjective clusters and noun clusters from adjective—noun co-occurrences while the SAM builds clusters of "co-occurrences." The proposed approach allows us to analyze adjectives and nouns independently.To take this approach, we propose a frequency-based infinite relational model (FIRM) for word co-occurrences. The FIRM is a stochastic block model that takes into account the frequency of observations although traditional stochastic blockmodels ignore it. The FIRM also utilizes the Dirichlet process so that the number of clusters is inferred. We derive a variational inference algorithm for the model to apply to a large dataset. Experimental results show that the FIRM is more helpful to analyze adjectives and nouns independently, and the FIRM clusters capture the SAM clusters better than a stochastic blockmodel.
- Information and Media Technologies 編集運営会議の論文
著者
-
SATO Taisuke
Department of Applied Chemistry, Faculty of Engineering, Osaka University
-
Kameya Yoshitaka
Department Of Computer Science Graduate School Of Information Science And Engineering Tokyo Institut
-
Kurihara Kenichi
Department Of Computer Science Graduate School Of Information Science And Engineering Tokyo Institut
関連論文
- Catalytic Addition of Olefinic C-H Bonds to Olefins
- The Ruthenium-Catalyzed Addition of βC-H Bonds in Aldehydes to Olefins
- Ruthenium-Catalyzed Coupling of Aromatic Carbon-Hydrogen Bonds in Aromatic Imidates with Olefins
- New Protocol for the Siteselective Alkylation and Vinylation of Aromatic Compounds. Catalyst-Specific Reactions
- Ruthenium-Catalyzed Reactions of Acyclic α,β-Enones with Olefins and Their Reaction Mechanisms
- Discovering concepts from word co-occurrences with a relational model (論文特集:データマイニングと統計数理)
- Discovering Concepts from Word Co-occurrences with a Relational Model