Zipf's Law and Heaps' Law Can Predict the Size of Potential Words(New Perspectives,Proceedings of the YITP Workshop on Econophysics,Econophysics 2011-The Hitchhiker's Guide to the Economy-)
スポンサーリンク
概要
- 論文の詳細を見る
We confirm Zipf's law and Heaps' law using various types of documents such as literary works, blogs, and computer programs. Independent of the document type, the exponents of Zipf' law are estimated to be approximately 1, whereas Heaps' exponents appear to be dependent on the observation size, and the estimated values are scattered around 0.5. By definition, randomly shuffled documents reproduce Zipf's law and Heaps' law. However, artificially generated documents using the empirically observed Zipf's law and number of distinct words do not reproduce Heaps' law. We demonstrate that Heaps' law holds for artificial documents in which a certain number of distinct words are added to empirically observed distinct words. This suggests that the number of potential distinct words considered in the creation of a given document can be predicted.
- 2012-06-14
著者
-
Takayasu Hideki
Sony Computer Science Laboratories
-
Takayasu Misako
Department Of Computational Intelligence & Systems Science Interdisciplinary Graduate School Of
-
Sano Yukie
College Of Science And Technology Nihon University
-
SANO Yukie
College of Science and Technology, Nihon University
関連論文
- Hubs and Authorities on Japanese Inter-Firm Network : Characterization of Nodes in Very Large Directed Networks(Complex Networks,Econophysics-Physical Approach to Social and Economic Phenomena-)
- A New Simple Method for Evaluating Cardiac Vagal Activity : Standard Deviation of Ratios for Consecutive RR Intervals
- Estimation of Parameters from Discrete Random Nonstationary Time Series(New Perspectives,Econophysics-Physical Approach to Social and Economic Phenomena-)
- TCP Optimization for Eliminating Duplicate Segments in Congested Networks(Network Protocols)
- Continuum Limit and Renormalization of Market Price Dynamics Based on PUCK Model(Financial and Consumer Markets,Econophysics-Physical Approach to Social and Economic Phenomena-)
- Spatial asymmetry and temporal delay of inhibitory amacrine cells produce directional selectivity in retina
- Observation of Two Types of Behaviors of Financial Bubbles and the Related Higher-Order Potential Forces(Financial and Consumer Markets,Econophysics-Physical Approach to Social and Economic Phenomena-)
- The Statistical Relationship between Product Life Cycle and Repeat Purchase Behavior in Convenience Stores(Financial and Consumer Markets,Econophysics-Physical Approach to Social and Economic Phenomena-)
- Zipf's Law and Heaps' Law Can Predict the Size of Potential Words(New Perspectives,Proceedings of the YITP Workshop on Econophysics,Econophysics 2011-The Hitchhiker's Guide to the Economy-)
- Zipf's Law and Heaps' Law Can Predict the Size of Potential Words