Access the full text.
Sign up today, get DeepDyve free for 14 days.
David Croft, Gavin O'Kelly, Guanming Wu, R. Haw, M. Gillespie, L. Matthews, M. Caudy, P. Garapati, Gopal Gopinath, B. Jassal, S. Jupe, Irina Kalatskaya, S. Mahajan, Bruce May, N. Ndegwa, E. Schmidt, V. Shamovsky, C. Yung, E. Birney, H. Hermjakob, P. D’Eustachio, L. Stein (2010)
Reactome: a database of reactions, pathways and biological processesNucleic Acids Research, 39
H. Hoos (2005)
2 – SLS METHODS
M. Kuhn, P. Yates, C. Hyde (2016)
Statistical Methods for Drug Discovery
Silvia Yusta (2009)
Different metaheuristic strategies to solve the feature selection problemPattern Recognit. Lett., 30
L. Breiman (2001)
Random ForestsMachine Learning, 45
I. Witten, E. Frank, M. Hall, Chris Pal (2016)
Data Mining, Fourth Edition: Practical Machine Learning Tools and Techniques
David Noren, B. Long, R. Norel, K. Rhrissorrakrai, K. Hess, Chenyue Hu, Alex Bisberg, A. Schultz, E. Engquist, Li Liu, Xihui Lin, Gregory Chen, Honglei Xie, Geoffrey Hunter, P. Boutros, O. Stepanov, D. Consortium, Thea Norman, S. Friend, G. Stolovitzky, S. Kornblau, A. Qutub (2016)
A Crowdsourcing Approach to Developing and Assessing Prediction Algorithms for AML PrognosisPLoS Computational Biology, 12
M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, I. Witten (2009)
The WEKA data mining software: an updateSIGKDD Explor., 11
D. Boughaci (2013)
Metaheuristic Approaches for the Winner Determination Problem in Combinatorial Auction
H. Hoos, T. Stützle (2015)
Stochastic Local Search Algorithms: An Overview
M. Kanehisa, S. Goto (2000)
KEGG: Kyoto Encyclopedia of Genes and GenomesNucleic acids research, 28 1
R. Murphy (2011)
An active role for machine learning in drug development.Nature chemical biology, 7 6
E. Cerami, Benjamin Gross, E. Demir, I. Rodchenkov, O. Babur, Nadia Anwar, N. Schultz, Gary Bader, C. Sander (2010)
Pathway Commons, a web resource for biological pathway dataNucleic Acids Research, 39
Dexter Pratt, J. Chen, David Welker, Ricardo Rivas, R. Pillich, Vladimir Rynkov, K. Ono, Carol Miello, L. Hicks, S. Szalma, A. Stojmirović, R. Dobrin, Michael Braxenthaler, Jan Kuentzer, Barry Demchak, T. Ideker (2015)
NDEx, the Network Data Exchange.Cell systems, 1 4
Li Liu, Yung Chang, Tao Yang, David Noren, B. Long, S. Kornblau, A. Qutub, Jieping Ye (2016)
Evolution‐informed modeling improves outcome prediction for cancersEvolutionary Applications, 10
Yuanyuan Wang (2005)
Statistical Methods for High Throughput Screening Drug Discovery Data
A. Fabregat, S. Jupe, L. Matthews, K. Sidiropoulos, M. Gillespie, Maulik Kamdar, P. Garapati, R. Haw, B. Jassal, Florian Korninger, Bruce May, Marija Milacic, Corina Duenas, K. Rothfels, Cristoffer Sevilla, V. Shamovsky, Solomon Shorser, Thawfeek Varusai, Guilherme Viteri, Joel Weiser, Guanming Wu, L. Stein, H. Hermjakob, P. D’Eustachio (2013)
The Reactome pathway knowledgebaseNucleic Acids Research, 42
M. Hall (2003)
Correlation-based Feature Selection for Machine Learning
Bertrand Miannay, S. Minvielle, O. Roux, Pierre Drouin, H. Avet-Loiseau, C. Guérin-Charbonnel, W. Gouraud, M. Attal, T. Facon, N. Munshi, P. Moreau, L. Campion, F. Magrangeas, Carito Guziolowski (2017)
Logic programming reveals alteration of key transcription factors in multiple myelomaScientific Reports, 7
D. Türei, T. Korcsmáros, J. Saez-Rodriguez (2016)
moFF: a robust and automated approach to extract peptide ion intensitiesNature Methods, 13
Clarisse Dhaenens, L. Jourdan (2016)
On the Use of Metaheuristics for Feature Selection in Classification
A. Lima, E. Philot, G. Trossini, L. Scott, V. Maltarollo, K. Honório (2016)
Use of machine learning approaches for novel drug discoveryExpert Opinion on Drug Discovery, 11
Erik Gawehn, J. Hiss, G. Schneider (2016)
Deep Learning in Drug DiscoveryMolecular Informatics, 35
Andy Liaw, M. Wiener (2007)
Classification and Regression by randomForest
The use of data issued from high throughput technologies in drug target problems is widely widespread during the last decades. This study proposes a meta-heuristic framework using stochastic local search (SLS) combined with random forest (RF) where the aim is to specify the most important genes and proteins leading to the best classification of Acute Myeloid Leukemia (AML) patients. First we use a stochastic local search meta-heuristic as a feature selection technique to select the most significant proteins to be used in the classification task step. Then we apply RF to classify new patients into their corresponding classes. The evaluation technique is to run the RF classifier on the training data to get a model. Then, we apply this model on the test data to find the appropriate class. We use as metrics the balanced accuracy (BAC) and the area under the receiver operating characteristic curve (AUROC) to measure the performance of our model. The proposed method is evaluated on the dataset issued from DREAM 9 challenge. The comparison is done with a pure random forest (without feature selection), and with the two best ranked results of the DREAM 9 challenge. We used three types of data: only clinical data, only proteomics data, and finally clinical and proteomics data combined. The numerical results show that the highest scores are obtained when using clinical data alone, and the lowest is obtained when using proteomics data alone. Further, our method succeeds in finding promising results compared to the methods presented in the DREAM challenge.
Journal of Medical Systems – Springer Journals
Published: Jun 4, 2018
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.