Access the full text.
Sign up today, get DeepDyve free for 14 days.
Blog tags are labels of blog documents that classify them into different categories. Most tags are user-generated, which create problems such as inconsistencies in tags across different users, blogs without tags, lack of descriptive tags, lack of semantic distinction, etc. In this paper, we utilise dimensionality reduction techniques to reduce the inherent noise in blog tags. A tag-topic model is combined with dimensionality reduction, and then evaluated on real-world blog data. By employing dimensionality reduction techniques to reduce the document-tag space, better classification results were achieved. This indicates that the noise in tags can be effectively reduced by representing the original set of tags with a smaller number of latent tags, which can lead to more accurate real-time categorisation of blog documents.
International Journal of Web Engineering and Technology – Inderscience Publishers
Published: Jan 1, 2011
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.