Access the full text.
Sign up today, get DeepDyve free for 14 days.
Tomas Mikolov, Ilya Sutskever, Kai Chen, G. Corrado, J. Dean (2013)
Distributed Representations of Words and Phrases and their Compositionality
Daniela Steidl, Nils Göde (2013)
Feature-based detection of bugs in clones2013 7th International Workshop on Software Clones (IWSC)
Michel Chilowicz, É. Duris, G. Roussel (2009)
Syntax tree fingerprinting for source code similarity detection2009 IEEE 17th International Conference on Program Comprehension
R. Socher, Jeffrey Pennington, E. Huang, A. Ng, Christopher Manning (2011)
Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions
K. Hermann, Phil Blunsom (2014)
Multilingual Models for Compositional Distributed Semantics
Hao Peng, Lili Mou, Ge Li, Yuxuan Liu, Lu Zhang, Zhi Jin (2014)
Building Program Vector Representations for Deep LearningArXiv, abs/1409.3358
R. Socher, Alex Perelygin, Jean Wu, Jason Chuang, Christopher Manning, A. Ng, Christopher Potts (2013)
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
Yoshua Bengio, Aaron Courville, Pascal Vincent (2012)
Representation Learning: A Review and New PerspectivesIEEE Transactions on Pattern Analysis and Machine Intelligence, 35
Nal Kalchbrenner, Edward Grefenstette, Phil Blunsom (2014)
A Convolutional Neural Network for Modelling Sentences
Lili Mou, Ge Li, Lu Zhang, Tao Wang, Zhi Jin (2014)
Convolutional Neural Networks over Tree Structures for Programming Language Processing
John Pane, C. Ratanamahatana, B. Myers (2001)
Studying the language and structure in non-programmers' solutions to programming problemsInt. J. Hum. Comput. Stud., 54
Abram Hindle, Earl Barr, Z. Su, M. Gabel, Premkumar Devanbu (2016)
On the naturalness of softwareCommunications of the ACM, 59
S. Pinker (1994)
The Language Instinct
S. Pinker (1994)
The language instinct : the new science of language and mind
Lili Mou, Hao Peng, Ge Li, Yan Xu, Lu Zhang, Zhi Jin (2015)
Discriminative Neural Sentence Modeling by Tree-Based Convolution
Ronan Collobert, J. Weston (2008)
A unified architecture for natural language processing: deep neural networks with multitask learning
Achraf Ghabi, Alexander Egyed (2012)
Code patterns for automatically validating requirements-to-code traces2012 Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering
Laura Dietz, Valentin Dallmeier, A. Zeller, T. Scheffer (2009)
Localizing Bugs in Program Executions with Graphical Models
R. Socher, A. Karpathy, Quoc Le, Christopher Manning, A. Ng (2014)
Grounded Compositional Semantics for Finding and Describing Images with SentencesTransactions of the Association for Computational Linguistics, 2
Nicolas Bettenburg, Andrew Begel (2013)
Deciphering the story of software development through frequent pattern mining2013 35th International Conference on Software Engineering (ICSE)
Fabian Yamaguchi, Markus Lottmann, Konrad Rieck (2012)
Generalized vulnerability extrapolation using abstract syntax trees
I. Baxter, A. Yahin, L. Moura, Marcelo Sant'Anna, Lorraine Bier (1998)
Clone detection using abstract syntax treesProceedings. International Conference on Software Maintenance (Cat. No. 98CB36272)
George Dahl, Marc'Aurelio Ranzato, Abdel-rahman Mohamed, Geoffrey Hinton (2010)
Phone Recognition with the Mean-Covariance Restricted Boltzmann Machine
Dan Hao, T. Lan, Hongyu Zhang, C. Guo, Lu Zhang (2013)
Is This a Bug or an Obsolete Test?
A. Krizhevsky, Ilya Sutskever, Geoffrey Hinton (2012)
ImageNet classification with deep convolutional neural networksCommunications of the ACM, 60
R. Socher, E. Huang, Jeffrey Pennington, A. Ng, Christopher Manning (2011)
Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection
Ronan Collobert, J. Weston, L. Bottou, Michael Karlen, K. Kavukcuoglu, P. Kuksa (2011)
Natural Language Processing (Almost) from ScratchArXiv, abs/1103.0398
[In this chapter, we will apply the tree-based convolutional neural network (TBCNN) to the source code of programming languages, which we call programming language processing. In fact, programming language processing is a hot research topic in the field of software engineering; it has also aroused growing interest in the artificial intelligence community. A distinct characteristic of a program is that it contains rich, explicit, and complicated structural information, necessitating more intensive modeling of structures. In this chapter, we propose a TBCNN variant for programming language processing, where a convolution kernel is designed for programs’ abstract syntax trees. We show the effectiveness of TBCNN in two different program analysis tasks: classifying programs according to functionality, and detecting code snippets of certain patterns. TBCNN outperforms baseline methods, including several neural models for NLP.]
Published: Oct 2, 2018
Keywords: Tree-based convolution; Representation learning; Programming language processing; Program analysis
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.