TY - JOUR AU - Niyato, Dusit AB - Abstract:In this paper, we introduce token communications (TokCom), a large model-driven framework to leverage cross-modal context information in generative semantic communications (GenSC). TokCom is a new paradigm, motivated by the recent success of generative foundation models and multimodal large language models (GFM/MLLMs), where the communication units are tokens, enabling efficient transformer-based token processing at the transmitter and receiver. In this paper, we introduce the potential opportunities and challenges of leveraging context in GenSC, explore how to integrate GFM/MLLMs-based token processing into semantic communication systems to leverage cross-modal context effectively at affordable complexity, present the key principles for efficient TokCom at various layers in future wireless networks. In a typical image semantic communication setup, we demonstrate a significant improvement of the bandwidth efficiency, achieved by TokCom by leveraging the context information among tokens. Finally, the potential research directions are identified to facilitate adoption of TokCom in future wireless networks. TI - Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications JF - Computing Research Repository DO - 10.48550/arxiv.2502.12096 DA - 2025-06-06 UR - https://www.deepdyve.com/lp/arxiv-cornell-university/token-communications-a-unified-framework-for-cross-modal-context-aware-L3QwMA0Jkx VL - 2025 IS - 2502 DP - DeepDyve ER -