Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Text mining

Text mining Purpose– The purpose of this paper was to analyse text mining (TM) literature indexed in the Web of Science (WoS) under the “Information Science Library Science” subcategory. More specifically, it analyses the chronological growth of TM literature, and the major countries, institutions, departments and individuals contributing to TM literature. Collaboration in TM research is also analysed. Design/methodology/approach– Bibliographic and citation data required for this research were retrieved from the WoS database. TM being a multidisciplinary field, the search was restricted to “Information Science Library Science” subcategory in the WoS. A comprehensive query statement covering all synonyms of “text mining” was prepared using the Boolean operator “OR”. Microsoft Excel and HistCite software were used for data analysis. Pajek and VoSviewer were used for data visualization. Findings– It was found that USA is the major producer of TM research literature, and the highest number of papers were published in the Journal of The American Medical Informatics. Columbia University ranked first both in number of articles and citations received in the top ten institutes publishing TM literature. It was also observed that six of the top ten subdivisions of institutions are either from medicine or medical informatics or biomedical information. H.C. Chen and C. Friedman were seen to be the most prolific authors. Research limitations/implications– The paper analyses articles on TM published during 1999-2013 in WoS under the subcategory Information Science Library Science’. Originality/value– The paper is based on empirical data exclusively gathered for this research. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Library Review Emerald Publishing

Loading next page...
 
/lp/emerald-publishing/text-mining-in67WPW82l

References (27)

Publisher
Emerald Publishing
Copyright
Copyright © Emerald Group Publishing Limited
ISSN
0024-2535
DOI
10.1108/LR-08-2014-0091
Publisher site
See Article on Publisher Site

Abstract

Purpose– The purpose of this paper was to analyse text mining (TM) literature indexed in the Web of Science (WoS) under the “Information Science Library Science” subcategory. More specifically, it analyses the chronological growth of TM literature, and the major countries, institutions, departments and individuals contributing to TM literature. Collaboration in TM research is also analysed. Design/methodology/approach– Bibliographic and citation data required for this research were retrieved from the WoS database. TM being a multidisciplinary field, the search was restricted to “Information Science Library Science” subcategory in the WoS. A comprehensive query statement covering all synonyms of “text mining” was prepared using the Boolean operator “OR”. Microsoft Excel and HistCite software were used for data analysis. Pajek and VoSviewer were used for data visualization. Findings– It was found that USA is the major producer of TM research literature, and the highest number of papers were published in the Journal of The American Medical Informatics. Columbia University ranked first both in number of articles and citations received in the top ten institutes publishing TM literature. It was also observed that six of the top ten subdivisions of institutions are either from medicine or medical informatics or biomedical information. H.C. Chen and C. Friedman were seen to be the most prolific authors. Research limitations/implications– The paper analyses articles on TM published during 1999-2013 in WoS under the subcategory Information Science Library Science’. Originality/value– The paper is based on empirical data exclusively gathered for this research.

Journal

Library ReviewEmerald Publishing

Published: Apr 13, 2015

There are no references for this article.