Access the full text.
Sign up today, get DeepDyve free for 14 days.
Donald Metzler, S. Dumais, Christopher Meek (2007)
Similarity Measures for Short Segments of Text
Alexander Budanitsky, Graeme Hirst (2006)
Evaluating WordNet-based Measures of Lexical Semantic RelatednessComputational Linguistics, 32
Y. Liu, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun (2015)
Topical Word Embeddings
Marco Baroni, Georgiana Dinu, Germán Kruszewski (2014)
Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors
S. Deerwester, S. Dumais, T. Landauer, G. Fumas, L. Beck (1988)
Improving information retrieval using latent semantic indexing
M. Qureshi (2016)
Utilising Wikipedia for Text Mining ApplicationsSIGIR Forum, 49
Mo Yu, Mark Dredze (2014)
Improving Lexical Embeddings with Semantic Knowledge
Yuncheng Jiang, Xiaopei Zhang, Yong Tang, Ruihua Nie (2015)
Feature-based approaches to semantic similarity assessment of concepts using WikipediaInf. Process. Manag., 51
Debasis Ganguly, Dwaipayan Roy, Mandar Mitra, Gareth Jones (2015)
Word Embedding based Generalized Language Model for Information RetrievalProceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
Ray Larson (2008)
Introduction to Information Retrieval
David Milne, I. Witten (2008)
An effective, low-cost measure of semantic relatedness obtained from Wikipedia links
Rada Mihalcea, Paul Tarau (2004)
TextRank: Bringing Order into Text
Lawrence Page, S. Brin, R. Motwani, T. Winograd (1999)
The PageRank Citation Ranking : Bringing Order to the Web, 98
Xianghua Fu, Ting Wang, Jing Li, Chong Yu, Wangwang Liu (2016)
Improving Distributed Word Representation and Topic Model by Word-Topic Mixture Model
Eric Yeh, Daniel Ramage, Christopher Manning, Eneko Agirre, Aitor Etxabe (2009)
WikiWalk: Random walks on Wikipedia for Semantic Relatedness
(2014)
Retrofitting word vectors to semantic lexicons
B. Goodman, S. Flaxman (2016)
European Union Regulations on Algorithmic Decision-Making and a "Right to Explanation"AI Mag., 38
Qiming Diao, Minghui Qiu, Chao-Yuan Wu, Alex Smola, Jing Jiang, Chong Wang (2014)
Jointly modeling aspects, ratings and sentiments for movie recommendation (JMARS)Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining
Zoltán Gyöngyi, H. Garcia-Molina, Jan Pedersen (2004)
Combating Web Spam with TrustRank
Derek
M. Strube, Simone Ponzetto (2006)
WikiRelate! Computing Semantic Relatedness Using Wikipedia
Marco Baroni, Alessandro Lenci (2010)
Distributional Memory: A General Framework for Corpus-Based SemanticsComputational Linguistics, 36
A. Globerson, Gal Chechik, Fernando Pereira, Naftali Tishby (2004)
Euclidean Embedding of Co-occurrence DataJ. Mach. Learn. Res., 8
J. Firth (1957)
A Synopsis of Linguistic Theory, 1930-1955
Johannes Hoffart, Stephan Seufert, Dat Nguyen, M. Theobald, G. Weikum (2012)
KORE: keyphrase overlap relatedness for entity disambiguationProceedings of the 21st ACM international conference on Information and knowledge management
Ronan Collobert, J. Weston (2008)
A unified architecture for natural language processing: deep neural networks with multitask learning
Anupam Datta, S. Sen, Yair Zick (2016)
Algorithmic Transparency via Quantitative Input Influence: Theory and Experiments with Learning Systems2016 IEEE Symposium on Security and Privacy (SP)
Eneko Agirre, Aitor Etxabe (2009)
Personalizing PageRank for Word Sense Disambiguation
Ehsan Sherkat, E. Milios (2017)
Vector Embedding of Wikipedia Concepts and EntitiesArXiv, abs/1702.03470
E. Gabrilovich, Shaul Markovitch (2007)
Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis
G. Zuccon, B. Koopman, P. Bruza, L. Azzopardi (2015)
Integrating and Evaluating Neural Word Embeddings in Information RetrievalProceedings of the 20th Australasian Document Computing Symposium
Y. Bengio, Réjean Ducharme, Pascal Vincent, Christian Janvin (2003)
A neural probabilistic language modelJournal of Machine Learning Research, 3
Omer Levy, Yoav Goldberg (2014)
Neural Word Embedding as Implicit Matrix Factorization
T. Caliński, J. Harabasz (1974)
A dendrite method for cluster analysisCommunications in Statistics-theory and Methods, 3
Jiang Bian, Bin Gao, Tie-Yan Liu (2014)
Knowledge-Powered Deep Learning for Word Embedding
A. Henelius, K. Puolamäki, Henrik Boström, L. Asker, P. Papapetrou (2014)
A peek into the black box: exploring classifiers by randomizationData Mining and Knowledge Discovery, 28
L. Maaten, Geoffrey Hinton (2008)
Visualizing Data using t-SNEJournal of Machine Learning Research, 9
Saar Kuzi, Anna Shtok, Oren Kurland (2016)
Query Expansion Using Word EmbeddingsProceedings of the 25th ACM International on Conference on Information and Knowledge Management
Yunita Sari, Mark Stevenson (2016)
Exploring Word Embeddings and Character N-Grams for Author Clustering
T. Landauer, P. Foltz, Darrell Laham (1998)
An introduction to latent semantic analysisDiscourse Processes, 25
Tomas Mikolov, Ilya Sutskever, Kai Chen, G. Corrado, J. Dean (2013)
Distributed Representations of Words and Phrases and their Compositionality
Michael Wick, W. Thompson (1992)
Reconstructive Expert System ExplanationArtif. Intell., 54
Z. Harris (1968)
Mathematical structures of language, 21
Z. Harris (1954)
Distributional Structure
Liqiang Niu, Xinyu Dai, Jianbing Zhang, Jiajun Chen (2015)
Topic2Vec: Learning distributed representations of topics2015 International Conference on Asian Language Processing (IALP)
Sanjeev Arora, Yuanzhi Li, Yingyu Liang, Tengyu Ma, Andrej Risteski (2015)
A Latent Variable Model Approach to PMI-based Word EmbeddingsTransactions of the Association for Computational Linguistics, 4
Zachary Lipton (2016)
The mythos of model interpretabilityCommunications of the ACM, 61
J. Hunt, C. Price (1988)
Explaining qualitative diagnosisEngineering Applications of Artificial Intelligence, 1
Azadeh Nikfarjam, A. Sarker, K. O’Connor, Rachel Ginn, G. Gonzalez-Hernandez (2015)
Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster featuresJournal of the American Medical Informatics Association : JAMIA, 22
Zhen Wang, Jianwen Zhang, Jianlin Feng, Zheng Chen (2014)
Knowledge Graph and Text Jointly Embedding
Guoqing Zheng, Jamie Callan (2015)
Learning to Reweight Terms with Distributed RepresentationsProceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
Hinrich Schütze (1992)
Word Space
Maël Pégny, Eva Thelisson, Issam Ibnouhsein (2019)
The Right to an ExplanationDelphi - Interdisciplinary Review of Emerging Technologies
S. Gallant, R. Hecht-Nielsen, W. Caid, K. Qing, J. Carleton, David Sudbeck (1992)
HNC's MatchPlus systemSIGIR Forum, 26
Omer Levy, Yoav Goldberg, Ido Dagan (2015)
Improving Distributional Similarity with Lessons Learned from Word EmbeddingsTransactions of the Association for Computational Linguistics, 3
Piotr Bojanowski, Edouard Grave, Armand Joulin, Tomas Mikolov (2016)
Enriching Word Vectors with Subword InformationTransactions of the Association for Computational Linguistics, 5
Fernando Diaz, Bhaskar Mitra, Nick Craswell (2016)
Query Expansion with Locally-Trained Word EmbeddingsArXiv, abs/1605.07891
R. Socher, Danqi Chen, Christopher Manning, A. Ng (2013)
Reasoning With Neural Tensor Networks for Knowledge Base Completion
Gerard Salton, Michael McGill (1983)
Introduction to Modern Information Retrieval
Matt Kusner, Yu Sun, Nicholas Kolkin, Kilian Weinberger (2015)
From Word Embeddings To Document Distances
Omer Levy, Yoav Goldberg (2014)
Linguistic Regularities in Sparse and Explicit Word Representations
Antoine Bordes, J. Weston, Ronan Collobert, Yoshua Bengio (2011)
Learning Structured Embeddings of Knowledge BasesProceedings of the AAAI Conference on Artificial Intelligence
Yongfeng Zhang, Guokun Lai, Min Zhang, Yi Zhang, Yiqun Liu, Shaoping Ma (2014)
Explicit factor models for explainable recommendation based on phrase-level sentiment analysisProceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval
Z. Ren, Shangsong Liang, Piji Li, Shuaiqiang Wang, M. Rijke (2017)
Social Collaborative Viewpoint Regression with Explainable RecommendationsProceedings of the Tenth ACM International Conference on Web Search and Data Mining
N. Tintarev, J. Masthoff (2015)
Explaining Recommendations: Design and Evaluation
Marco Ribeiro, Sameer Singh, Carlos Guestrin (2016)
“Why Should I Trust You?”: Explaining the Predictions of Any ClassifierProceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Fei Wu, Jun Song, Yi Yang, Xi Li, Zhongfei Zhang, Yueting Zhuang (2015)
Structured Embedding via Pairwise Relations and Long-Range Interactions in Knowledge Base
Peng Wang, Bo Xu, Jiaming Xu, Guanhua Tian, Cheng-Lin Liu, Hongwei Hao (2016)
Semantic expansion using word embedding clustering and convolutional neural network for improving short text classificationNeurocomputing, 174
Juri Ganitkevitch, Benjamin Durme, Chris Callison-Burch (2013)
PPDB: The Paraphrase Database
Torsten Zesch, Iryna Gurevych (2007)
Analysis of the Wikipedia Category Graph for NLP Applications
Jeffrey Pennington, R. Socher, Christopher Manning (2014)
GloVe: Global Vectors for Word Representation
K. Vehkalahti, B. Everitt (2018)
Cluster AnalysisMultivariate Analysis for the Behavioral Sciences
P. Bhargava, T. Phan, Jiayu Zhou, Juhan Lee (2015)
Who, What, When, and Where: Multi-Dimensional Collaborative Recommendations Using Tensor Factorization on Sparse User-Generated DataProceedings of the 24th International Conference on World Wide Web
(2013)
A (2013) Reasoning with neural tensor networks
Alex Lopez-Suarez, M. Kamel (1994)
DyKOr: a method for generating the content of explanations in knowledge systemsKnowl. Based Syst., 7
Tomas Mikolov, Kai Chen, G. Corrado, J. Dean (2013)
Efficient Estimation of Word Representations in Vector Space
Philip Adler, Casey Falk, Sorelle Friedler, Tionney Nix, Gabriel Rybeck, C. Scheidegger, Brandon Smith, Suresh Venkatasubramanian (2016)
Auditing black-box models for indirect influenceKnowledge and Information Systems, 54
M. Jarmasz (2012)
Roget's Thesaurus as a Lexical Resource for Natural Language ProcessingArXiv, abs/1204.0140
Chang Xu, Yalong Bai, Jiang Bian, Bin Gao, G. Wang, X. Liu, Tie-Yan Liu (2014)
RC-NET: A General Framework for Incorporating Knowledge into Word RepresentationsProceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management
We present an unsupervised explainable vector embedding technique, called EVE, which is built upon the structure of Wikipedia. The proposed model defines the dimensions of a semantic vector representing a concept using human-readable labels, thereby it is readily interpretable. Specifically, each vector is constructed using the Wikipedia category graph structure together with the Wikipedia article link structure. To test the effectiveness of the proposed model, we consider its usefulness in three fundamental tasks: 1) intruder detection—to evaluate its ability to identify a non-coherent vector from a list of coherent vectors, 2) ability to cluster—to evaluate its tendency to group related vectors together while keeping unrelated vectors in separate clusters, and 3) sorting relevant items first—to evaluate its ability to rank vectors (items) relevant to the query in the top order of the result. For each task, we also propose a strategy to generate a task-specific human-interpretable explanation from the model. These demonstrate the overall effectiveness of the explainable embeddings generated by EVE. Finally, we compare EVE with the Word2Vec, FastText, and GloVe embedding techniques across the three tasks, and report improvements over the state-of-the-art.
Journal of Intelligent Information Systems – Springer Journals
Published: Jun 4, 2018
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.