Access the full text.
Sign up today, get DeepDyve free for 14 days.
V.A. Yatsko (2014)
The method of zonal correlation text analysisAutom. Doc. Math. Linguist., 48
J. Novoviĉová, A. Malik (2005)
Proc. Int. Joint Conf. on Neural Networks
V.A. Yatsko (2014)
Computational linguistics or linguistic informatics?Autom. Doc. Math. Linguist., 48
A.I. Mikhailov, A.I. Chernyi, R.S. Gilyarevskii (1966)
Informatics is the new name of the theory of scientific informationNauchn.-Tekhn. Inform., 12
C.D. Manning, P. Raghavan, H. Schutze (2009)
An Introduction to Information Retrieval
V.A. Yatsko (2013)
The method of zonal text analysisV Mire Nauchn. Otkryt., 6.1
R. Köhler, B.B. Rieger (1993)
Proc. 1st Int. Conf. on Quantitative Linguistics
G. Altmann, I.-I. Popescu, D. Zotta (2013)
Stratification in textsGlottometrics, 25
I.-I. Popescu, J. Mautek, G. Altmann (2009)
Aspects of Word Frequencies
X. Gabaix (1999)
Zipf’s law for cities: An explanationQ. J. Econ., 114
This paper describes a method for automatic text classification based on analysing the deviation of the word distribution from Zipf’s law, combined with the zonal data-processing approach. Deviation is understood as the difference between the actual numerical score of a word and its score according to Zipf’s law. The proposed method involves the division of input and reference texts into J 0, J 1, and J 2 zones, and the creation of a numerical series using the words that are contained in the J 0 zone. The constructed numerical series shows the difference between the real scores of words and the scores calculated according to Zipf’s law. The proposed method can significantly reduce text dimensionality and thus improve the running speed of automatic text classification.
Automatic Documentation and Mathematical Linguistics – Springer Journals
Published: Aug 1, 2015
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.