Access the full text.
Sign up today, get DeepDyve free for 14 days.
A. Jović, K. Brkić, N. Bogunovic (2014)
An overview of free software tools for general data mining2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)
L. Rokach (2010)
Ensemble-based classifiersArtificial Intelligence Review, 33
Karen Tso-Sutter, L. Schmidt-Thieme (2006)
Empirical Analysis of Attribute-Aware Recommender System Algorithms Using Synthetic DataJ. Comput., 1
Mohammed Zaki (2014)
Data Mining and Analysis: Fundamental Concepts and Algorithms
Y. Freund, R. Schapire (1997)
A decision-theoretic generalization of on-line learning and an application to boosting
Journal of Clinical Pathology, 1
Y. Benjamini, M. Leshno (2010)
Statistical Methods for Data Mining
C. Romero, Sebastián Ventura, Enrique García (2008)
Data mining in course management systems: Moodle case study and tutorialComput. Educ., 51
I. Arroyo, David Cooper, W. Burleson, B. Woolf (2010)
Bayesian networks and linear regression models of students’ goals, moods, and emotions
Brijesh Baradwaj, S. Pal (2012)
Mining Educational Data to Analyze Students' PerformanceArXiv, abs/1201.3417
Xi Zhang, Yanwei Fu, Shanshan Jiang, L. Sigal, G. Agam (2015)
Learning from Synthetic Data Using a Stacked Multichannel Autoencoder2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)
D. Kabakchieva (2013)
Predicting Student Performance by Using Data Mining Methods for Classification, 13
Amirah Shahiri, W. Husain, N. Rashid (2015)
A Review on Predicting Student's Performance Using Data Mining TechniquesProcedia Computer Science, 72
C. Anuradha, T. Velmurugan (2015)
A Comparative Analysis on the Evaluation of Classification Algorithms in the Prediction of Students PerformanceIndian journal of science and technology, 8
B. Minaei-Bidgoli, D. Kashy, G. Kortemeyer, W. Punch (2003)
Predicting student performance: an application of data mining methods with an educational Web-based system33rd Annual Frontiers in Education, 2003. FIE 2003., 1
N. Chawla, K. Bowyer, L. Hall, W. Kegelmeyer (2002)
SMOTE: Synthetic Minority Over-sampling TechniqueArXiv, abs/1106.1813
Journal of Computer and System Science, 55
P. Thakar (2015)
Performance Analysis and Prediction in Educational Data Mining: A Research TravelogueArXiv, abs/1509.05176
L. Breiman (2001)
Random ForestsMachine Learning, 45
Mashael Al-Barrak, Muna Al-Razgan (2016)
Predicting Students Final GPA Using Decision Trees: A Case Study
C. Romero, Sebastián Ventura, Mykola Pechenizkiy, R. Baker (2010)
Handbook of Educational Data Mining
D. Jannach, M. Zanker, A. Felfernig, G. Friedrich (2010)
Recommender Systems - An Introduction
Yannick Meier, J. Xu, Onur Atan, M. Schaar (2015)
Predicting GradesIEEE Transactions on Signal Processing, 64
C. Romero, Sebastián Ventura (2007)
Educational data mining: A survey from 1995 to 2005Expert Syst. Appl., 33
L. Breiman (1996)
Bagging PredictorsMachine Learning, 24
Gerben Dekker, Mykola Pechenizkiy, Jan Vleeshouwers (2009)
Predicting Students Drop Out: A Case Study
John Platt (2000)
Fast Training of Support Vector Machines using Sequential Minimal Optimization
César Sacín, Jorge Chue, J. Peche, Gustavo Alvarado, Bruno Vinatea, Jhonny Estrella, Alvaro Ortigosa (2011)
A data mining approach to guide students through the enrollment process based on academic performanceUser Modeling and User-Adapted Interaction, 21
Syed Jishan, Raisul Rashu, N. Haque, R. Rahman (2015)
Improving accuracy of students’ final grade prediction model using optimal equal width binning and synthetic minority over-sampling techniqueDecision Analytics, 2
PurposeThe purpose of this paper is to present an empirical study on the effect of two synthetic attributes to popular classification algorithms on data originating from student transcripts. The attributes represent past performance achievements in a course, which are defined as global performance (GP) and local performance (LP). GP of a course is an aggregated performance achieved by all students who have taken this course, and LP of a course is an aggregated performance achieved in the prerequisite courses by the student taking the course.Design/methodology/approachThe paper uses Educational Data Mining techniques to predict student performance in courses, where it identifies the relevant attributes that are the most key influencers for predicting the final grade (performance) and reports the effect of the two suggested attributes on the classification algorithms. As a research paradigm, the paper follows Cross-Industry Standard Process for Data Mining using RapidMiner Studio software tool. Six classification algorithms are experimented: C4.5 and CART Decision Trees, Naive Bayes, k-neighboring, rule-based induction and support vector machines.FindingsThe outcomes of the paper show that the synthetic attributes have positively improved the performance of the classification algorithms, and also they have been highly ranked according to their influence to the target variable.Originality/valueThis paper proposes two synthetic attributes that are integrated into real data set. The key motivation is to improve the quality of the data and make classification algorithms perform better. The paper also presents empirical results showing the effect of these attributes on selected classification algorithms.
International Journal of Intelligent Computing and Cybernetics – Emerald Publishing
Published: Jun 12, 2017
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.