Access the full text.
Sign up today, get DeepDyve free for 14 days.
Tomoharu Iwata, Shinji Watanabe, Takeshi Yamada, N. Ueda (2009)
Topic Tracking Model for Analyzing Consumer Purchase Behavior
Gael Poux-Medard, Julien Velcin, Sabine Loudcher (2021)
Powered Dirichlet Process for Controlling the Importance of "Rich-Get-Richer" Prior Assumptions in Bayesian ClusteringArXiv, abs/2104.12485
Nan Du, Mehrdad Farajtabar, Amr Ahmed, Alex Smola, Le Song (2015)
Dirichlet-Hawkes Processes with Applications to Clustering Continuous-Time Document StreamsProceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
M. Karsai, Hang-Hyun Jo, K. Kaski (2017)
Bursty Human DynamicsArXiv, abs/1803.02580
Jianhua Yin, Daren Chao, Zhongkun Liu, Wei Zhang, Xiaohui Yu, Jianyong Wang (2018)
Model-based Clustering of Short Text StreamsProceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
Gael Poux-Medard, Julien Velcin, Sabine Loudcher (2020)
Information Interactions in Outcome Prediction: Quantification and Interpretation using Stochastic Block ModelsProceedings of the 15th ACM Conference on Recommender Systems
Gael Poux-Medard, Julien Velcin, Sabine Loudcher (2021)
Information Interaction Profile of Choice AdoptionArXiv, abs/2104.13695
Xi Tan, Vinayak Rao, Jennifer Neville (2018)
The Indian Buffet Hawkes Process to Model Evolving Latent Influences
Xuerui Wang, A. McCallum (2006)
Topics over time: a non-Markov continuous-time model of topical trends
Hanna Wallach, S. Jensen, Lee Dicker, K. Heller (2008)
An Alternative Prior Process for Nonparametric Bayesian Clustering
Giannis Haralabopoulos, Ioannis Anagnostopoulos, S. Zeadally (2014)
Lifespan and propagation of information in On-line Social Networks: A case study based on RedditArXiv, abs/1403.1486
D. Blei, A. Ng, Michael Jordan (2009)
Latent Dirichlet Allocation
D. Blei, P. Frazier (2009)
Distance dependent Chinese restaurant processes
M. Welling (2006)
Flexible Priors for Infinite Mixture Models
E. Bacry, Martin Bompaire, P. Deegan, Stéphane Gaïffas, Søren Poulsen (2017)
tick: a Python Library for Statistical Learning, with an emphasis on Hawkes Processes and Time-Dependent ModelsJ. Mach. Learn. Res., 18
Gael Poux-Medard, Julien Velcin, Sabine Loudcher (2021)
Powered Hawkes-Dirichlet Process: Challenging Textual Clustering using a Flexible Temporal Prior2021 IEEE International Conference on Data Mining (ICDM)
(2006)
Dynamic topic models
Nan Du, Le Song, Alex Smola, M. Yuan (2012)
Learning Networks of Heterogeneous Influence
Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio (2014)
Neural Machine Translation by Jointly Learning to Align and TranslateCoRR, abs/1409.0473
Ming Yu, Varun Gupta, M. Kolar (2017)
An Influence-Receptivity Model for Topic Based Information Cascades2017 IEEE International Conference on Data Mining (ICDM)
Seth Myers, J. Leskovec (2012)
Clash of the Contagions: Cooperation and Competition in Information Diffusion2012 IEEE 12th International Conference on Data Mining
Mohit Rathore, D. Gupta, Dinabandhu Bhandari (2018)
Complaint Classification using Word2Vec ModelInternational Journal of Engineering & Technology
Charalampos Mavroforakis, Isabel Valera, M. Gomez-Rodriguez (2017)
Modeling the Dynamics of Learning Activity on the WebProceedings of the 26th International Conference on World Wide Web
Junyu Cao, Wei Sun, Z. Shen (2019)
Sequential Choice Bandits: Learning with Marketing FatigueOperations Research eJournal
Hongteng Xu, H. Zha (2017)
A Dirichlet Mixture Model of Hawkes Processes for Event Sequence Clustering
Amr Ahmed, E. Xing (2008)
Dynamic Non-Parametric Mixture Models and the Recurrent Chinese Restaurant Process: with Applications to Evolutionary Clustering
The textual content of a document and its publication date are intertwined. For example, the publication of a news article on a topic is influenced by previous publications on similar issues, according to underlying temporal dynamics. However, it can be challenging to retrieve meaningful information when textual information conveys little information or when temporal dynamics are hard to unveil. Furthermore, the textual content of a document is not always linked to its temporal dynamics. We develop a flexible method to create clusters of textual documents according to both their content and publication time, the powered Dirichlet–Hawkes process (PDHP). We show PDHP yields significantly better results than state-of-the-art models when temporal information or textual content is weakly informative. The PDHP also alleviates the hypothesis that textual content and temporal dynamics are always perfectly correlated. PDHP retrieves textual clusters, temporal clusters, or a mixture of both with high accuracy. We demonstrate that PDHP generalizes previous work –the Dirichlet–Hawkes process (DHP) and uniform process (UP). Finally, we illustrate the changes induced by PDHP over DHP and UP with a real-world application using Reddit data. We detail how PDHP recovers bursty dynamics and show that its limit case accounts for daily and weekly publication cycles.
Knowledge and Information Systems – Springer Journals
Published: Nov 1, 2022
Keywords: Clustering; Temporal Bayesian prior; Powered Dirichlet process; Hawkes process
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.