Access the full text.
Sign up today, get DeepDyve free for 14 days.
D. Isa, Lam Lee, V. Kallimani, R. Rajkumar (2008)
Text Document Preprocessing with the Bayes Formula for Classification Using the Support Vector MachineIEEE Transactions on Knowledge and Data Engineering, 20
D. Karaboğa, C. Ozturk (2010)
Fuzzy clustering with artificial bee colony algorithm
F. Sebastiani (2001)
Machine learning in automated text categorizationArXiv, cs.IR/0110053
Huan Liu, Lei Yu (2005)
Toward integrating feature selection algorithms for classification and clusteringIEEE Transactions on Knowledge and Data Engineering, 17
A. Mesleh, G. Kanaan (2008)
Support vector machine text classification system: Using Ant Colony Optimization based feature subset selection2008 International Conference on Computer Engineering & Systems
T. Ho, Kaname Funakoshi (1998)
Information Retrieval Using Rough Sets, 13
K. Porter, Y. Feig (1980)
The use of DAPI for identifying and counting aquatic microflora1Limnology and Oceanography, 25
Jian-xiong Dong, A. Krzyżak, C. Suen (2005)
Fast SVM training algorithm with decomposition on very large data setsIEEE Transactions on Pattern Analysis and Machine Intelligence, 27
Mohamed Bennasar, Y. Hicks, R. Setchi (2015)
Feature selection using Joint Mutual Information MaximisationExpert Syst. Appl., 42
T. Joachims (1998)
Text Categorization with Support Vector Machines: Learning with Many Relevant Features
Edda Leopold, J. Kindermann (2002)
Text Categorization with Support Vector Machines. How to Represent Texts in Input Space?Machine Learning, 46
Linli Xu, Dale Schuurmans (2005)
Unsupervised and Semi-Supervised Multi-Class Support Vector Machines
G. Chandrashekar, F. Sahin (2014)
A survey on feature selection methodsComput. Electr. Eng., 40
(2006)
kNN Arabic text categorization using IG feature selection
G. Salton (1989)
Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer
Wen Zhang, Taketoshi Yoshida, Xijin Tang (2008)
Text classification based on multi-word with support vector machineKnowl. Based Syst., 21
S. al-Harbi, Abdulrahman Almuhareb, A. Al-Thubaity, M. Khorsheed, A. Al-Rajeh (2008)
Automatic Arabic Text Classification
C. Apté, Fred Damerau, S. Weiss (1994)
Automated learning of decision rules for text categorizationACM Trans. Inf. Syst., 12
A. Uysal, Serkan Günal (2014)
The impact of preprocessing on text classificationInf. Process. Manag., 50
Intelligent Data Analysis, 1
Li Guo, S. Boukir (2015)
Fast data selection for SVM training using ensemble marginPattern Recognit. Lett., 51
Timothy O'Keefe (2009)
Feature Selection and Weighting Methods in Sentiment Analysis
Yiming Yang, Jan Pedersen (1997)
A Comparative Study on Feature Selection in Text Categorization
Zeyad Younus, D. Mohamad, T. Saba, M. Alkawaz, A. Rehman, Mznah Al-Rodhaan, A. Al-Dhelaan (2015)
Content-based image retrieval using PSO and k-means clustering algorithmArabian Journal of Geosciences, 8
B. Subanya, R. Rajalaxmi (2014)
Feature selection using Artificial Bee Colony for cardiovascular disease classification2014 International Conference on Electronics and Communication Systems (ICECS)
Chih-Ming Chen, Hahn-Ming Lee, Yu-Jung Chang (2009)
Two novel feature selection approaches for web page classificationExpert Syst. Appl., 36
H. Jung, Gahyun Kim (2014)
Support Vector Number Reduction: Survey and Experimental EvaluationsIEEE Transactions on Intelligent Transportation Systems, 15
Ahmet Ozkis, A. Babalık (2014)
Performance Comparison of ABC and A-ABC Algorithms on Clustering Problems
Mayy Al-Tahrawi, S. Al-Khatib (2015)
Arabic text classification using Polynomial NetworksJ. King Saud Univ. Comput. Inf. Sci., 27
M. Dash, Huan Liu (1997)
Feature Selection for ClassificationIntell. Data Anal., 1
Mauricio Schiezaro, H. Pedrini (2013)
Data feature selection based on Artificial Bee Colony algorithmEURASIP Journal on Image and Video Processing, 2013
Mehdi Aghdam, N. Ghasem-Aghaee, Mohammad Basiri (2009)
Text feature selection using ant colony optimizationExpert Syst. Appl., 36
Owing to the huge volume of documents available on the internet, text classification becomes a necessary task to handle these documents. To achieve optimal text classification results, feature selection, an important stage, is used to curtail the dimensionality of text documents by choosing suitable features. The main purpose of this research work is to classify the personal computer documents based on their content.Design/methodology/approachThis paper proposes a new algorithm for feature selection based on artificial bee colony (ABCFS) to enhance the text classification accuracy. The proposed algorithm (ABCFS) is scrutinized with the real and benchmark data sets, which is contrary to the other existing feature selection approaches such as information gain and χ2 statistic. To justify the efficiency of the proposed algorithm, the support vector machine (SVM) and improved SVM classifier are used in this paper.FindingsThe experiment was conducted on real and benchmark data sets. The real data set was collected in the form of documents that were stored in the personal computer, and the benchmark data set was collected from Reuters and 20 Newsgroups corpus. The results prove the performance of the proposed feature selection algorithm by enhancing the text document classification accuracy.Originality/valueThis paper proposes a new ABCFS algorithm for feature selection, evaluates the efficiency of the ABCFS algorithm and improves the support vector machine. In this paper, the ABCFS algorithm is used to select the features from text (unstructured) documents. Although, there is no text feature selection algorithm in the existing work, the ABCFS algorithm is used to select the data (structured) features. The proposed algorithm will classify the documents automatically based on their content.
Information Discovery and Delivery – Emerald Publishing
Published: Sep 6, 2019
Keywords: Information technology; Information science; Information retrieval; Information management; Information systems; Document management; Text classification; Feature selection; Information gain; χ2 statistic; Artificial bee colony; Support vector machine; Improved SVM
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.