Generative Semantic Communication: Architectures, Technologies, and Applications

Jinke Ren , Yaping Sun , Hongyang Du , Weiwen Yuan , Chongjie Wang , Xianda Wang , Yingbin Zhou , Ziwei Zhu , Fangxin Wang , Shuguang Cui

Engineering ››

PDF (3320KB)
Engineering ›› DOI: 10.1016/j.eng.2025.07.022
review-article

Generative Semantic Communication: Architectures, Technologies, and Applications

Author information +
History +
PDF (3320KB)

Abstract

Semantic communication (SemCom) has emerged as a transformative paradigm for future wireless networks, aiming to improve communication efficiency by transmitting only the semantic meaning (or its encoded version) of the source data rather than the complete set of bits (symbols). However, traditional deep learning-based SemCom systems present challenges such as limited generalization, low robustness, and inadequate reasoning capabilities, primarily due to the inherently discriminative nature of deep neural networks. To address these limitations, generative artificial intelligence (GAI) is seen as a promising solution, offering notable advantages in learning complex data distributions, transforming data between high- and low-dimensional spaces, and generating high-quality content.

Keywords

Semantic communication / Generative artificial intelligence / Large language model / Variational autoencoder / Generative adversarial network / Diffusion model

Cite this article

Download citation ▾
Jinke Ren, Yaping Sun, Hongyang Du, Weiwen Yuan, Chongjie Wang, Xianda Wang, Yingbin Zhou, Ziwei Zhu, Fangxin Wang, Shuguang Cui. Generative Semantic Communication: Architectures, Technologies, and Applications. Engineering DOI:10.1016/j.eng.2025.07.022

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Saad W, Bennis M, Chen M.A vision of 6G wireless systems: applications, trends, technologies, and open research problems.IEEE Netw 2019; 34(3):134-142.

[2]

Shannon CE.A mathematical theory of communication.Bell Syst Tech J 1948; 27(3):379-423.

[3]

Weaver W.Recent contributions to the mathematical theory of communication.ETC 1953; 1:261-281.

[4]

Gündüz D, Qin Z, Aguerri IE, Dhillon HS, Yang Z, Yener A, et al.Beyond transmitting bits: context, semantics, and task-oriented communications.IEEE J Sel Areas Commun 2022; 41(1):5-41.

[5]

Wen D, Liu P, Zhu G, Shi Y, Xu J, Eldar YC, et al.Task-oriented sensing, computation, and communication integration for multi-device edge AI.IEEE Trans Wirel Commun 2023; 23(3):2486-2502.

[6]

Yang W, Du H, Liew ZQ, Lim WY, Xiong Z, Niyato D, et al.Semantic communications for future Internet: fundamentals, applications, and challenges.IEEE Commun Surv Tutor 2022; 25(1):213-250.

[7]

Xie H, Qin Z, Li GY, Juang BH.Deep learning enabled semantic communication systems.IEEE Trans Signal Process 2021; 69:2663-2675.

[8]

Huang D, Gao F, Tao X, Du Q, Lu J.Toward semantic communications: deep learning-based image semantic coding.IEEE J Sel Areas Commun 2022; 41(1):55-71.

[9]

Weng Z, Qin Z, Tao X, Pan C, Liu G, Li GY.Deep learning enabled semantic communications with speech recognition and synthesis.IEEE Trans Wirel Commun 2023; 22(9):6227-6240.

[10]

Yang W, Xiong Z, Quek TQ, Shen X.Streamlined transmission: a semantic-aware XR deployment framework enhanced by generative AI.IEEE Netw 2024; 38(6):29-38.

[11]

Xia L, Sun Y, Liang C, Zhang L, Imran MA, Niyato D.Generative AI for semantic communication: architecture, challenges. and outlook., arXiv:2308.15483 (2023)

[12]

Liang C, Du H, Sun Y, Niyato D, Kang J, Zhao D, et al.Generative AI-driven semantic communication networks: architecture, technologies and applications.IEEE Trans Cognit Commun Netw 2024; 11(1):27-47.

[13]

Grassucci E, Park J, Barbarossa S, Kim SL, Choi J, Comminiello D. Generative AI meets semantic communication: evolution and revolution of communication tasks., arXiv:2401.06803 (2024)

[14]

Ren J, Zhang Z, Xu J, Chen G, Sun Y, Zhang P, et al.Knowledge base enabled semantic communication: a generative perspective.IEEE Wirel Commun 2024; 31(4):14-22.

[15]

Kingma DP, Welling M.Auto-encoding variational bayes.2013. arXiv: 1312.6114.

[16]

Razavi A, Van Aden Oord, Vinyals O. Generating diverse high-fidelity images with VQ-VAE-2, Curran Associates Inc., Vancouver, BC, Canada. New York City (2019), pp. 14866-14876

[17]

Li T, Zhao XM, Li L.Co-VAE: drug-target binding affinity prediction by co-regularized variational autoencoders.IEEE Trans Pattern Anal Mach Intell 2021; 44(12):8861-8873.

[18]

Saidutta YM, Abdi A, Fekri F. VAE for joint source–channel coding of distributed Gaussian sources over AWGN MAC, IEEE, Atlanta, GA, USA. New York City (2020), pp. 1-5

[19]

Erdemir E, Dragotti PL, Gündüz D. Privacy-aware communication over a wiretap channel with generative networks, IEEE, Singapore. New York City (2022), pp. 2989-2993

[20]

Alawad MA, Hamdan MQ, Hamdi KA.Innovative variational autoencoder for an end-to-end communication system.IEEE Access 2022; 11:86834-86847.

[21]

Xi Z, Yiqian Z, Congduan L, Xiao M.Variational neural inference enhanced text semantic communication system.China Commun 2024; 21(7):50-64.

[22]

Yao S, Xiao Z, Wang S, Dai J, Niu K, Zhang P. Variational speech waveform compression to catalyze semantic communications, IEEE, Glasgow, UK. New York City (2023), pp. 1-6

[23]

Bo Y, Duan Y, Shao S, Tao M.Joint coding-modulation for digital semantic communications via variational autoencoder.IEEE Trans Commun 2024; 72(9):5626-5640.

[24]

Seon J, Lee S, Kim J, Hyun SKim, Ghyu YSun, Seo H, et al.Deep reinforced segment selection and equalization for task-oriented semantic communication.IEEE Commun Lett 2024; 28(8):1865-1869.

[25]

Zhang H, Tao M, Sun Y, Letaief KB.Improving learning-based semantic coding efficiency for image transmission via shared semantic-aware codebook.IEEE Trans Commun 2024; 73(2):1217-1232.

[26]

Chen D, Hua W.Hierarchical VAE based semantic communications for POMDP tasks.In: Proceedings of IEEEInternational Conference on Acoustics, Speech and Signal Processing; 2024 Apr 14–19; Seoul, Republic of Korea. New York City: IEE E; 2024. p. 5540–4.

[27]

Zhang G, Li H, Cai Y, Hu Q, Yu G, Zhang R. Learned image transmission with hierarchical variational autoencoder., arXiv:2408.16340 (2024)

[28]

Li S, Sun Y, Zhang J, Cai K, Cui S, Xu X. End-to-end generative semantic communication powered by shared semantic knowledge base., arXiv:2405.05738 (2024)

[29]

Xie B, Wu Y, Shi Y, Zhang W, Cui S, Debbah M. Robust image semantic coding with learnable CSI fusion masking over MIMO fading channels., arXiv:2406.07389 (2024)

[30]

Ma S, Qiao W, Wu Y, Li H, Shi G, Gao D, et al.Task-oriented explainable semantic communications.IEEE Trans Wirel Commun 2023; 22(12):9248-9262.

[31]

Nemati M, Park J, Choi J.VQ-VAE empowered wireless communication for joint source–channel coding and beyond.In: Proceedings of IEEEGlobal Communications Conference; 2023 Dec 48; Kuala Lumpur, Malaysia. New York City: IEE E; 2023. p. 3155–60.

[32]

Hu Q, Zhang G, Qin Z, Cai Y, Yu G, Li GY.Robust semantic communications with masked VQ-VAE enabled codebook.IEEE Trans Wirel Commun 2023; 22(12):8707-8722.

[33]

Talli P, Pase F, Chiariotti F, Zanella A, Zorzi M.Effective communication with dynamic feature compression.IEEE Trans Commun 2024; 72(9):5595-5610.

[34]

Choi J, Park J, Ko SW, Choi J, Bennis M, Kim SL.Semantics alignment via split learning for resilient multi-user semantic communication.IEEE Trans Vehicular Technol 2024; 73(10):15815-15819.

[35]

Si P, Liu R, Qian L, Zhao J, Lam KY.Post-deployment fine-tunable semantic communication.IEEE Trans Wirel Commun 2024; 240(1):35-50.

[36]

Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, et al. Generative adversarial nets, Curran Associates Inc., Montreal, QC, Canada. New York City (2014), pp. 2672-2680

[37]

Han T, Tang J, Yang Q, Duan Y, Zhang Z, Shi Z.Generative model based highly efficient semantic communication approach for image transmission.In: Proceedings of IEEEInternational Conference on Acoustics, Speech and Signal Processing; 2023 Jun 4–10; Rhodes Island, Greece. New York City: IEE E; 2023. p. 1–5.

[38]

Huang D, Tao X, Gao F, Lu J. Deep learning-based image semantic coding for semantic communications, IEEE, Madrid, Spain. New York City (2021), pp. 1-6

[39]

Tang S, Yang Q, Gündüz D, Zhang Z. Evolving semantic communication with generative model., arXiv:2403.20237 (2024)

[40]

Zhang H, Shao S, Tao M, Bi X, Letaief KB.Deep learning-enabled semantic communication systems with task-unaware transmitter and dynamic data.IEEE J Sel Areas Commun 2022; 41(1):170-185.

[41]

He Q, Yuan H, Feng D, Che B, Chen Z, Xia XG.Robust semantic transmission of images with generative adversarial networks.Rio de Janeiro, IEEE, Brazil. New York City 2022; 3953-3958.

[42]

Wang J, Wang S, Dai J, Si Z, Zhou D, Niu K.Perceptual learned source–channel coding for high-fidelity image semantic transmission.Rio de Janeiro, IEEE, Brazil. New York City 2022; 3959-3964.

[43]

Xin G, Fan P, Letaief KB, Peng C. Deep conditional generative semantic communication for image transmission, IEEE, Denver, CO, USA. New York City (2024), pp. 1073-1078

[44]

Tan K, Dai J, Liu Z, Wang S, Qin X, Xu W, et al.Rate-distortion-perception controllable joint source–channel coding for high-fidelity generative semantic communications.IEEE Trans Cognit Commun Netw 2024; 11(2):672-689.

[45]

Erdemir E, Tung TY, Dragotti PL, Gündüz D.Generative joint source–channel coding for semantic image transmission.IEEE J Sel Areas Commun 2023; 41(8):2645-2657.

[46]

Bourtsoulatze E, Burth DKurka, Gündüz D.Deep joint source–channel coding for wireless image transmission.IEEE Trans Cogn Commun Netw 2019; 5(3):567-579.

[47]

Yu K, He Q, Wu G.Two-way semantic communications without feedback.IEEE Trans Veh Technol 2024; 73(6):9077-9082.

[48]

Dong C, Liang H, Xu X, Han S, Wang B, Zhang P.Semantic communication system based on semantic slice models propagation.IEEE J Sel Areas Commun 2022; 41(1):202-213.

[49]

Lokumarambage MU, Gowrisetty VS, Rezaei H, Sivalingam T, Rajatheva N, Fernando A.Wireless end-to-end image transmission system using semantic communications.IEEE Access 2023; 11:37149-37163.

[50]

Miao Y, Yan J, Wang Y, Li Z, Hu D. A semantic communication system based on vector quantization and generative model, IEEE, Guangzhou, China. New York City (2024), pp. 46-50

[51]

Zhou Y, Sun Y, Chen G, Xu X, Chen H, Huang B, et al. MOC-RVQ: multilevel codebook-assisted digital generative semantic communication., arXiv:2401.01272 (2024)

[52]

Fu Q, Xie H, Qin Z, Slabaugh G, Tao X.Vector quantized semantic communication system.IEEE Wirel Commun Lett 2023; 12(6):982-986.

[53]

Mao J, Xiong K, Liu M, Qin Z, Chen W, Fan P, et al.A GAN-based semantic communication for text without CSI.IEEE Trans Wirel Commun 2024; 23(10):14498-14514.

[54]

Yang L, Zhang Z, Song Y, Hong S, Xu R, Zhao Y, et al.Diffusion models: a comprehensive survey of methods and applications.ACM Comput Surv 2023; 56(4):1-39.

[55]

Ho J, Jain A, Abbeel P. Denoising diffusion probabilistic models, Curran Associates Inc., Vancouver, BC, Canada. New York City (2020), pp. 6840-6851

[56]

Song Y, Ermon S. Generative modeling by estimating gradients of the data distribution, Curran Associates Inc., Vancouver, BC, Canada. New York City (2019), pp. 11918-11930

[57]

Song Y, Sohl-Dickstein J, Kingma DP, Kumar A, Ermon S, Poole B. Score-based generative modeling through stochastic different equations, Curran Associates Inc., Vienna, Austria. New York City (2021), pp. 37799-37812

[58]

Rombach R, Blattmann A, Lorenz D, Esser P, Ommer B. High-resolution image synthesis with latent diffusion models, IEEE, New Orleans, LA, USA. New York City (2022), pp. 10684-10695

[59]

Kollovieh M, Ansari AF, Bohlke-Schneider M, Zschiegner J, Wang H, Predict WYB, et al. synthesize: self-guiding diffusion models for probabilistic time series forecasting, Curran Associates Inc., Vancouver, BC, Canada. New York City (2023), pp. 28341-28364

[60]

Jiang Z, Liu X, Yang G, Li W, Li A, Wang G.DIFFSC: semantic communication framework with enhanced denoising through diffusion probabilistic models.In: Proceedings of IEEEInternational Conference on Acoustics, Speech and Signal Processing; 2024 Apr 14–19; Seoul, Republic of Korea. New York City: IEE E; 2024. p. 13071–5.

[61]

Guo L, Chen W, Sun Y, Ai B, Pappas N, Quek T. Diffusion-driven semantic communication for generative models with bandwidth constraints., arXiv:2407.18468 (2024)

[62]

Chen J, You D, Gündüz D, Dragotti PL.CommIN: semantic image communications as an inverse problem with INN-guided diffusion models.In: Proceedings of IEEEInternational Conference on Acoustics, Speech and Signal Processing; 2024 Apr 14–19; Seoul, Republic of Korea. New York City: IEE E; 2024. p. 6675–9.

[63]

Yuan W, Ren J, Wang C, Zhang R, Wei J, Kim DI, et al.Generative semantic communication for joint image transmission and segmentation.2024. arXiv: 2411.18005.

[64]

Qiang C, Li H, Ni H, Qu H, Fu R, Wang T, et al.Minimally-supervised speech synthesis with conditional diffusion model and language model: a comparative study of semantic coding.In: Proceedings of IEEEInternational Conference on Acoustics, Speech and Signal Processing; 2024 Apr 14–19; Seoul, Republic of Korea. New York City: IEE E; 2024. p. 10186–90.

[65]

Grassucci E, Barbarossa S, Comminiello D. Generative semantic communication: diffusion models beyond bit recovery., arXiv:2306.04321 (2023)

[66]

Grassucci E, Choi J, Park J, Gramaccioni RF, Cicchetti G, Comminiello D. Rethinking multi-user semantic communications with deep generative models., arXiv:2405.09866 (2024)

[67]

Pignata G, Grassucci E, Cicchetti G, Comminiello D. Lightweight diffusion models for resource-constrained semantic communication., arXiv:2410.02491 (2024)

[68]

Zhang K, Li L, Lin W, Yan Y, Cheng W, Han Z. SC-CDM: enhancing quality of image semantic communication with a compact diffusion model., arXiv:2410.02121 (2024)

[69]

Fan S, Bao Z, Dong C, Liang H, Xu X, Zhang P. Semantic feature decomposition based semantic communication system of images with large-scale visual generation models., arXiv:2410.20126 (2024)

[70]

Qiao L, Mashhadi MB, Gao Z, Foh CH, Xiao P, Bennis M.Latency-aware generative semantic communications with pre-trained diffusion models.IEEE Wirel Commun Lett 2024; 13(10):2652-2656.

[71]

Wang Y, Yang W, Xiong Z, Zhao Y, Mao S, Quek TQ, et al.FAST-GSC: fast and adaptive semantic transmission for generative semantic communication.2024. arXiv: 2407.15395.

[72]

Pezone F, Musa O, Caire G, Barbarossa S.Semantic-preserving image coding based on conditional diffusion models.In: Proceedings of IEEEInternational Conference on Acoustics, Speech and Signal Processing; 2024 Apr 14–19; Seoul, Republic of Korea. New York City: IEE E; 2024. p. 13501–5.

[73]

Fu W, Xu L, Wu X, Wei H, Wang L. Multimodal generative semantic communication based on latent diffusion model, IEEE, London, UK. New York City (2024), pp. 1-6

[74]

Mingkai C, Minghao L, Zhe Z, Zhiping X, Lei W.Task-oriented semantic communication with foundation models.China Commun 2024; 21(7):65-77.

[75]

Jiang P, Wen CK, Li X, Jin S. Adaptive semantic image transmission using generative foundation model, IEEE, London, UK. New York City (2024), pp. 1-6

[76]

Grassucci E, Marinoni C, Rodriguez A, Comminiello D.Diffusion models for audio semantic communication.In: Proceedings of IEEEInternational Conference on Acoustics, Speech and Signal Processing; 2024 Apr 14–19; Seoul, Republic of Korea. New York City: IEE E; 2024. p. 13136–40.

[77]

Chen M, Liu M, Wang C, Song X, Zhang Z, Xie Y, et al.Cross-modal graph semantic communication assisted by generative AI in the metaverse for 6G.Research 2024; 7:0342.

[78]

Yang M, Gao D, Xie F, Li J, Song X, Shi G.SG2SC: a generative semantic communication framework for scene understanding-oriented image transmission.In: Proceedings of IEEEInternational Conference on Acoustics, Speech and Signal Processing; 2024 Apr 14–19; Seoul, Republic of Korea. New York City: IEE E; 2024. p. 13486–90.

[79]

Zhang H, Bao Z, Liang H, Liu Y, Dong C, Li L. Diffusion-based wireless semantic communication for VR image, IEEE, Hangzhou, China. New York City (2024), pp. 639-644

[80]

Xu B, Meng R, Chen Y, Xu X, Dong C, Sun H.Latent semantic diffusion-based channel adaptive de-noising SemCom for future 6G systems.In: Proceedings of IEEEGlobal Communications Conference; 2023 Dec 4–8; Kuala Lumpur, Malaysia. New York City: IEE E; 2023. p. 1229–34.

[81]

Wu T, Chen Z, He D, Qian L, Xu Y, Tao M, et al.CDDM: channel denoising diffusion models for wireless semantic communications.IEEE Trans Wirel Commun 2024; 23(9):11168-11183.

[82]

Li N, Deng Y. Goal-oriented semantic communication for wireless image transmission via stable diffusion., arXiv:2408.00428 (2024)

[83]

Pei J, Feng C, Wang P, Tabassum H, Shi D. Latent diffusion model-enabled real-time semantic communication considering semantic ambiguities and channel noises., arXiv:2406.06644 (2024)

[84]

Zeng Y, He X, Chen X, Tong H, Yang Z, Guo Y, et al. DMCE: diffusion model channel enhancer for multi-user semantic communication systems, IEEE, Denver, CO, USA. New York City (2024), pp. 855-860

[85]

Jiang F, Peng Y, Dong L, Wang K, Yang K, Pan C, et al. Large generative model assisted 3D semantic communication., arXiv:2403.05783 (2024)

[86]

Duan Y, Wu T, Chen Z, Tao M. DM-MIMO: diffusion models for robust semantic communications over MIMO channels, IEEE, Hangzhou, China. New York City (2024), pp. 1609-1614

[87]

Yang P, Zhang G, Cai Y. Rate-adaptive generative semantic communication using conditional diffusion models., arXiv:2409.02597 (2024)

[88]

Xie H, Qin Z, Han Z, Letaief KB.Hybrid digital-analog semantic communications.2024. arXiv: 2405.12580.

[89]

Ren X, Wu J, Xu H, Chen X. Diffusion model based secure semantic communications with adversarial purification, IEEE, New York City, NY, USA. New York City (2024), pp. 130-134

[90]

Du H, Wang J, Niyato D, Kang J, Xiong Z, Kim DI.AI-generated incentive mechanism and full-duplex semantic communications for information sharing.IEEE J Sel Areas Commun 2023; 41(9):2981-2997.

[91]

Zheng J, Du B, Du H, Kang J, Niyato D, Zhang H.Energy-efficient resource allocation in generative AI-aided secure semantic mobile networks.IEEE Trans Mobile Comput 2024; 23(12):11422-11435.

[92]

Liu J, Xiao M, Wen J, Kang J, Zhang R, Zhang T, et al. Optimizing resource allocation for multi-modal semantic communication in mobile AIGC networks: a diffusion-based game approach., arXiv:2409.17506 (2024)

[93]

Du B, Du H, Liu H, Niyato D, Xin P, Yu J, et al.YOLO-based semantic communication with generative AI-aided resource allocation for digital twins construction.IEEE Internet Things J 2023; 11(5):7664-7678.

[94]

Xu C, Mashhadi MB, Ma Y, Tafazolli R. Semantic-aware power allocation for generative semantic communications with foundation models., arXiv:2407.03050 (2024)

[95]

Zhao WX, Zhou K, Li J, Tang T, Wang X, Hou Y, et al. A survey of large language models., arXiv:2303.18223 (2023)

[96]

Wan Z, Wang X, Liu C, Alam S, Zheng Y, Liu J, et al. Efficient large language models: a survey., arXiv:2312.03863 (2023)

[97]

Kenton JD, Toutanova LKBERT. pre-training of deep bidirectional transformers for language understanding, Association for Computing Machiner, Minneapolis, MN, USA. New York City (2019), pp. 4171-4186

[98]

Open AI, Achiam J, Adler S, Agarwal S, Ahmad L, Akkaya I, et al. GPT-4 technical report., arXiv:2303.08774 (2023)

[99]

Raffel C, Shazeer N, Roberts A, Lee K, Narang S, Matena M, et al.Exploring the limits of transfer learning with a unified text-to-text transformer.J Mach Learn Res 2020; 21(140):1-67.

[100]

Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention is all you need, Curran Associates Inc., Long Beach, CA, USA. New York City (2017), pp. 5998-6008

[101]

Wang Y, Sun Z, Fan J, Ma H. On the uses of large language models to design end-to-end learning semantic communication, IEEE, Dubai, United Arab Emirate. New York City (2024), pp. 1-6

[102]

Chang MK, Hsu CT, Yang GC.Gen SC: generative semantic communication systems using BART-like model.IEEE Commun Lett 2024; 28(10):2298-2302.

[103]

Wang Z, Zou L, Wei S, Liao F, Zhuo J, Mi H, et al. Large language model enabled semantic communication systems., arXiv:2407.14112 (2024)

[104]

Wei X, Tong H, Yang N, Yin C. Language-oriented semantic communication for image transmission with fine-tuned diffusion model., arXiv:2409.17104 (2024)

[105]

Nam H, Park J, Choi J, Bennis M, Kim SL.Language-oriented communication with semantic coding and knowledge distillation for text-to-image generation.In: Proceedings of IEEEInternational Conference on Acoustics, Speech and Signal Processing; 2024 Apr 14–19; Seoul, Republic of Korea. New York City: IEE E; 2024. p. 13506–10.

[106]

Zhao Y, Yue Y, Hou S, Cheng B, Huang Y.LaMoSC: large language model-driven semantic communication system for visual transmission.IEEE Trans Cognit Commun Netw 2024; 10(6):2005-2018.

[107]

Du H, Liu G, Niyato D, Zhang J, Kang J, Xiong Z, et al.Generative Al-aided joint training-free secure semantic communications via multi-modal prompts.In: Proceedings of IEEEInternational Conference on Acoustics, Speech and Signal Processing; 2024 Apr 14–19; Seoul, Republic of Korea. New York City: IEE E; 2024. p. 12896–900.

[108]

Wang X, Ye D, Feng C, Yang HH, Chen X, Quek TQ.Trustworthy image semantic communication with GenAI: explainablity, controllability, and efficiency.2024. ar Xiv:2408.03806 .

[109]

Ren M, Qiao L, Yang L, Gao Z, Chen J, Mashhadi MB, et al.Generative semantic communication via textual prompts: latency performance tradeoffs.2024. arXiv: 2409.09715.

[110]

Cao D, Wu J, Bashir AK. Multimodal large language models driven privacy-preserving wireless semantic communication in 6G, IEEE, Denver, CO, USA. New York City (2024), pp. 171-176

[111]

Zhao F, Sun Y, Feng L, Zhang L, Zhao D.Enhancing reasoning ability in semantic communication through generative AI-assisted knowledge construction.IEEE Commun Lett 2024; 28(4):832-836.

[112]

Jiang F, Peng Y, Dong L, Wang K, Yang K, Pan C, et al.Large AI model-based semantic communications.IEEE Wirel Commun 2024; 31(3):68-75.

[113]

Guo S, Wang Y, Ye J, Zhang A, Xu K. Semantic importance-aware communications with semantic correction using large language models., arXiv:2405.16011 (2024)

[114]

Jiang F, Tang C, Dong L, Wang K, Yang K, Pan C. Visual language model based cross-modal semantic communication systems., arXiv:2407.00020 (2024)

[115]

Zheng J, Ren J, Xu P, Yuan Z, Xu J, Wang F, et al. Generative semantic communication for text-to-speech synthesis., arXiv:2410.03459 (2024)

[116]

Jiang F, Dong L, Peng Y, Wang K, Yang K, Pan C, et al.Large AI model empowered multimodal semantic communications.IEEE Commun Mag 2025; 63(1):76-82.

[117]

Yang W, Xiong Z, Mao S, Quek TQ, Zhang P, Debbah M, et al.Rethinking generative semantic communication for multi-user systems with multi-modal LLM.2024. arXiv: 2408.08765.

[118]

Xie H, Qin Z, Tao X, Han Z.Towards intelligent communications: large model empowered semantic communications.2024. arXiv: 2402.13073.

[119]

Jiang P, Wen CK, Yi X, Li X, Jin S, Zhang J.Semantic communications using foundation models: design approaches and open issues.IEEE Wirel Commun 2024; 31(3):76-84.

[120]

Zhang F, Du Y, Chen K, Shao Y, Liew SC.Addressing out-of-distribution challenges in image semantic communication systems with multi-modal large language models.2024. arXiv: 2407.15335.

[121]

Jiang P, Wen CK, Li X, Jin S, Li GY.Semantic satellite communications based on generative foundation model.2024. arXiv: 2404.11941.

[122]

Chen W, Xu W, Chen H, Zhang X, Qin Z, Zhang Y, et al. Semantic communication based on large language model for underwater image transmission., arXiv:2408.12616 (2024)

[123]

Kalita A. Large language models (LLMs) for semantic communication in edge-based IoT networks., arXiv:2407.20970 (2024)

[124]

Durante Z, Huang Q, Wake N, Gong R, Sung JPark, Sarkar B, et al. Agent AI: surveying the horizons of multimodal interaction., arXiv:2401.03568 (2024)

[125]

Girdhar R, El-Nouby A, Liu Z, Singh M, Alwala KV, Joulin A, et al. ImageBind: one embedding space to bind them all, IEEE, Vancouver, BC, Canada. New York City (2023), pp. 15180-15190

[126]

Wang X, Zhang X, Luo Z, Sun Q, Cui Y, Wang J, et al. Emu3: next-token prediction is all you need., arXiv:2409.18869 (2024)

[127]

Touvron H, Martin L, Stone K, Albert P, Almahairi A, Babaei Y, et al. Llama 2: open foundation and fine-tuned chat models., arXiv:2307.09288 (2023)

[128]

Han Z, Gao C, Liu J, Zhang J, Zhang SQ. Parameter-efficient fine-tuning for large models: a comprehensive survey., arXiv:2403.14608 (2024)

[129]

Zhang D, Yu Y, Dong J, Li C, Su D, Chu C, et al. MM-LLMs: recent advances in multimodal large language models., arXiv:2401.13601 (2024)

[130]

Faisal MM.Yolov8-deepsort-object-tracking 2023 [Internet].San Francisco: GetHub; [cited 2025 Jun 30]. Available from: https://github.com/MuhammadMoinFaisal/YOLOv8-DeepSORT-Object-Tracking.

[131]

Chen Z, Wu J, Wang W, Su W, Chen G, Xing S, et al. InternVL: scaling up vision foundation models and aligning for generic visual-linguistic tasks., arXiv:2312.14238 (2023)

[132]

Yang A, Yang B, Hui B, Zheng B, Yu B, Zhou C, et al. Qwen2 technical report., arXiv:2407.10671 (2024)

[133]

Tanner H.All-MiniLm-L6-v2 2024 [Internet].San Francisco: GetHub; [cited 2025 Jun 30]. Available from: https://github.com/henrytanner52/all-MiniLM-L6-v2.

[134]

Liu ZS, Siu WC, Wang LW, Li CT, Cani MP. Unsupervised real image super-resolution via generative variational autoencoder, IEEE, Seattle, WA, USA. New York City (2020), pp. 442-443

[135]

Wang X, Xie L, Dong C, Real-ESRGAN SY. training real-world blind super-resolution with pure synthetic data, IEEE, Nashville, TN, USA. New York City (2021), pp. 1905-1914

[136]

Lin X, He J, Chen Z, Lyu Z, Dai B, Yu F, et al.DiffBIR: towards blind image restoration with generative diffusion prior.2023. arXiv: 2308.15070.

[137]

Lin Y, Zheng L, Zheng Z, Wu Y, Hu Z, Yan C, et al.Improving person re-identification by attribute and identity learning.Pattern Recognit 2019; 95:151-161.

[138]

Sisinni E, Saifullah A, Han S, Jennehag U, Gidlund M.Industrial Internet of Things: challenges, opportunities, and directions.IEEE Trans Industr Inform 2018; 14(11):4724-4734.

[139]

Lu J, Yang W, Xiong Z, Xing C, Tafazolli R, Quek TQ, et al.Generative AI-enhanced multi-modal semantic communication in Internet of Vehicles: system design and methodologies.2024. arXiv: 2409.15642.

[140]

Lin Y, Gao Z, Du H, Niyato D, Kang J, Jamalipour A, et al.A unified framework for integrating semantic communication and AI-generated content in metaverse.IEEE Netw 2023; 38(4):174-181.

[141]

Huang D, Ge M, Xiang K, Zhang X, Yang H.Privacy preservation of large language models in the metaverse era: research frontiers, categorical comparisons, and future directions.Int J Netw Manag 2024; 35(1):e2292.

[142]

Kurai R, Hiraki T, Hiroi Y, Hirao Y, Perusquia-Hernandez M, Uchiyama H, et al. MagicItem: dynamic behavior design of virtual objects with large language models in a consumer metaverse platform., arXiv:2406.13242 (2024)

[143]

Xiao G, Lin J, Seznec M, Wu H, Demouth J, Han S. SmoothQuant: accurate and efficient post-training quantization for large language models, Association for Computing Machinery, Honolulu, HI, USA: New York City (2023), pp. 38087-38099

[144]

Ma X, Fang G, Wang X. On the structural pruning of large language models, Curran Associates Inc., New Orleans, LA, USA. New York City (2023), pp. 21702-21720

[145]

Gu Y, Dong L, Wei F, Huang M.MiniLLM: knowledge distillation of large language models. dblp: computer science bibliography, Vienna, Austria. Trier (2024)

[146]

DeepSeek AI, Liu A, Feng B, Xue B, Wang B, Wu B, et al.Deepseek-v3 technical report.2024. arXiv: 2412.19437.

[147]

Saha R, Sagan N, Srivastava V, Goldsmith A, Pilanci M. Compressing large language models using low rank and low precision decomposition, Curran Associates Inc., Vancouver, BC, Canada. New York City (2024), pp. 88981-89018

[148]

Xue N, Sun Y, Chen Z, Tao M, Xu X, Qian L, et al. WDMoE: wireless distributed large language models with mixture of experts., arXiv:2405.03131 (2024)

[149]

Wu D, Wang X, Qiao Y, Wang Z, Jiang J, Cui S, et al. NetLLM: adapting large language models for networking, Association for Computing Machinery, Sydney, NSW, Australia. New York City (2024), pp. 661-678

[150]

Wang L, Zhang X, Su H, Zhu J.A comprehensive survey of continual learning: theory, method and application.IEEE Trans Pattern Anal Mach Intell 2024; 46(8):5362-5383.

[151]

Li S, Sun Y, Zhang J, Cai K, Chen H, Cui S, et al.Cooperative semantic knowledge base update policy for multiple semantic communication pairs.2024. arXiv: 2410.02405.

[152]

Wang X, Peng J, Xu K, Yao H, Chen T. Reinforcement learning-driven LLM agent for automated attacks on LLMs, Association for Computing Machinery, Bangkok, Thailand. New York City (2024), pp. 170-177

[153]

Jiang F, Xu Z, Niu L, Wang B, Jia J, Li B, et al. POSTER: identifying and mitigating vulnerabilities in LLM-integrated applications, Association for Computing Machinery, Singapore. New York City (2024), pp. 1949-1951

[154]

Yu L, Do V, Hambardzumyan K, Cancedda N.Robust LLM safeguarding via refusal feature adversarial training.2024. arXiv: 2409.20089.

[155]

Xu R, Li G, Yang Z, Chen M, Liu Y, Li J. Covert and reliable semantic communication against cross-layer privacy inference over wireless edge networks, IEEE, Dubai, United Arab Emirates. New York City (2024), pp. 1-6

[156]

Lin Y, Gao Z, Du H, Niyato D, Kang J, Xiong Z, et al.Blockchain-based efficient and trustworthy AIGC services in metaverse.IEEE Trans Serv Comput 2024; 17(5):2067-2079.

AI Summary AI Mindmap
PDF (3320KB)

159

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/