Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

The sustainability and stabilization of tag vocabulary in CiteULike An empirical study of collaborative tagging

The sustainability and stabilization of tag vocabulary in CiteULike An empirical study of... Purpose – The purpose of this study is to examine the growth patterns of tag vocabulary in collaborative tagging systems to verify the sustainability and stabilization of tag distributions. Both sustainability and stabilization are essential to the mining and categorization of information driven by tagging behaviors. Design/methodology/approach – The study was based on time series data of CiteULike from November 2004 to April 2010. Power law distributions were detected to reveal statistical regularities and tagging patterns. Logistic regression analysis with time‐dependent covariates was conducted to identify the factors affecting the growth of distinct tags for articles. The significance of the effects and the time taken for a given article to reach its tagging maturity were also explored. Findings – Time series plots and trend analysis illustrated the continuous growth of the tagging system. Exploratory analysis of power law distribution fittings indicated a sign of system stability known as scale invariance. Logistic regression results demonstrated that for a particular article, the number of users who tagged the article, the initial date when the article was tagged, and the life span of the article are statistically significant to the ratio of the distinct tag number to the total tag number for a given article. These results confirmed that the distinct tag ratio of an article gives rise to a stable pattern. Originality/value – Though extensive work has been done on the patterns of tag vocabulary, it is not clear how the growth of distinctive tags behaves in relation to the total number of tag applications, considering time‐dependent covariates such as the number of users, and the longevity of an article. This paper sets to complement the literature on the existing methodology and investigate this property in detail. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Online Information Review Emerald Publishing

The sustainability and stabilization of tag vocabulary in CiteULike An empirical study of collaborative tagging

Online Information Review , Volume 36 (5): 20 – Sep 21, 2012

Loading next page...
 
/lp/emerald-publishing/the-sustainability-and-stabilization-of-tag-vocabulary-in-citeulike-an-0e23tyExjn

References (39)

Publisher
Emerald Publishing
Copyright
Copyright © 2012 Emerald Group Publishing Limited. All rights reserved.
ISSN
1468-4527
DOI
10.1108/14684521211275966
Publisher site
See Article on Publisher Site

Abstract

Purpose – The purpose of this study is to examine the growth patterns of tag vocabulary in collaborative tagging systems to verify the sustainability and stabilization of tag distributions. Both sustainability and stabilization are essential to the mining and categorization of information driven by tagging behaviors. Design/methodology/approach – The study was based on time series data of CiteULike from November 2004 to April 2010. Power law distributions were detected to reveal statistical regularities and tagging patterns. Logistic regression analysis with time‐dependent covariates was conducted to identify the factors affecting the growth of distinct tags for articles. The significance of the effects and the time taken for a given article to reach its tagging maturity were also explored. Findings – Time series plots and trend analysis illustrated the continuous growth of the tagging system. Exploratory analysis of power law distribution fittings indicated a sign of system stability known as scale invariance. Logistic regression results demonstrated that for a particular article, the number of users who tagged the article, the initial date when the article was tagged, and the life span of the article are statistically significant to the ratio of the distinct tag number to the total tag number for a given article. These results confirmed that the distinct tag ratio of an article gives rise to a stable pattern. Originality/value – Though extensive work has been done on the patterns of tag vocabulary, it is not clear how the growth of distinctive tags behaves in relation to the total number of tag applications, considering time‐dependent covariates such as the number of users, and the longevity of an article. This paper sets to complement the literature on the existing methodology and investigate this property in detail.

Journal

Online Information ReviewEmerald Publishing

Published: Sep 21, 2012

Keywords: Tag vocabulary; Collaborative tagging; CiteULike; Statistical analysis; Journals

There are no references for this article.