Search scope:
排序: Display mode:
Prospects for multi-agent collaboration and gaming: challenge, technology, and application Perspective
Yu LIU, Zhi LI, Zhizhuo JIANG, You HE
Frontiers of Information Technology & Electronic Engineering 2022, Volume 23, Issue 7, Pages 1002-1009 doi: 10.1631/FITEE.2200055
Keywords: 多智能体;博弈论;集体智能;强化学习;智能控制
Decentralized multi-agent reinforcement learning with networked agents: recent advances Review Article
Kaiqing Zhang, Zhuoran Yang, Tamer Başar,kzhang66@illinois.edu,zy6@princeton.edu,basar1@illinois.edu
Frontiers of Information Technology & Electronic Engineering 2021, Volume 22, Issue 6, Pages 802-814 doi: 10.1631/FITEE.1900661
Keywords: 强化学习;多智能体系统;网络系统;一致性优化;分布式优化;博弈论
Multi-agent differential game based cooperative synchronization control using a data-driven method Research Article
Yu SHI, Yongzhao HUA, Jianglong YU, Xiwang DONG, Zhang REN
Frontiers of Information Technology & Electronic Engineering 2022, Volume 23, Issue 7, Pages 1043-1056 doi: 10.1631/FITEE.2200001
Keywords: Multi-agent system Differential game Synchronization control Data-driven Reinforcement learning
Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents Research Article
Jian ZHAO, Youpeng ZHAO, Weixun WANG, Mingyu YANG, Xunhan HU, Wengang ZHOU, Jianye HAO, Houqiang LI
Frontiers of Information Technology & Electronic Engineering 2022, Volume 23, Issue 7, Pages 1032-1042 doi: 10.1631/FITEE.2100594
Keywords: Multi-agent system Reinforcement learning Unexpected crashed agents
Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game Research Article
Hongyang LI, Qinglai WEI
Frontiers of Information Technology & Electronic Engineering 2022, Volume 23, Issue 7, Pages 1010-1019 doi: 10.1631/FITEE.2200010
Keywords: Optimal synchronization control Multi-agent systems Nonzero-sum game Adaptive dynamic programming Input saturation Off-policy reinforcement learning Policy iteration
The Application of Multi-agent Based Distributed Intelligent Control in VAV Air Conditioning System
Zhang Hongwei,Wu Aiguo,Sheng Tao
Strategic Study of CAE 2006, Volume 8, Issue 7, Pages 58-62
A VAV system can be treated as a multi-agent system. In this paper, a multi-agent-based distributed intelligent control method is presented to solve the problem of concordance and decoupling in the VAV system. A simulation program of VAV system is set up for control analysis. Through a simulation, this control method has been proved to be satisfactory.
Keywords: VAV agent multi-agent system distributed intelligent control
Existence and practice of gaming: thoughts on the development of multi-agent system gaming Perspective
Qi DONG, Zhenyu WU, Jun LU, Fengsong SUN, Jinyu WANG, Yanyu YANG, Xiaozhou SHANG
Frontiers of Information Technology & Electronic Engineering 2022, Volume 23, Issue 7, Pages 995-1001 doi: 10.1631/FITEE.2100593
Keywords: 博弈;多智能体系统;多智能体演化博弈;预警探测
Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrial wireless networks Research Article
Xiaoyu LIU, Chi XU, Haibin YU, Peng ZENG,liuxiaoyu1@sia.cn,xuchi@sia.cn,yhb@sia.cn,zp@sia.cn
Frontiers of Information Technology & Electronic Engineering 2022, Volume 23, Issue 1, Pages 47-60 doi: 10.1631/FITEE.2100331
Keywords: Multi-agent deep reinforcement learning End–edge orchestrated Industrial wireless networks Delay Energy consumption
Cooperative channel assignment for VANETs based on multiagent reinforcement learning Research Articles
Yun-peng Wang, Kun-xian Zheng, Da-xin Tian, Xu-ting Duan, Jian-shan Zhou,ypwang@buaa.edu.cn,zhengkunxian@buaa.edu.cn,dtian@buaa.edu.cn,duanxuting@buaa.edu.cn
Frontiers of Information Technology & Electronic Engineering 2020, Volume 21, Issue 7, Pages 1047-1058 doi: 10.1631/FITEE.1900308
Keywords: Vehicular ad-hoc networks Reinforcement learning Dynamic channel assignment Multichannel
Artificial Intelligence for Retrosynthesis Prediction Review
Yinjie Jiang, Yemin Yu, Ming Kong, Yu Mei, Luotian Yuan, Zhengxing Huang, Kun Kuang, Zhihua Wang, Huaxiu Yao, James Zou, Connor W. Coley, Ying Wei
Engineering 2023, Volume 25, Issue 6, Pages 32-50 doi: 10.1016/j.eng.2022.04.021
In recent years, there has been a dramatic rise in interest in retrosynthesis prediction with artificial intelligence (AI) techniques. Unlike conventional retrosynthesis prediction performed by chemists and by rule-based expert systems, AI-driven retrosynthesis prediction automatically learns chemistry knowledge from off-the-shelf experimental datasets to predict reactions and retrosynthesis routes. This provides an opportunity to address many conventional challenges, including heavy reliance on extensive expertise, the sub-optimality of routes, and prohibitive computational cost. This review describes the current landscape of AI-driven retrosynthesis prediction. We first discuss formal definitions of the retrosynthesis problem and review the outstanding research challenges therein. We then review the related AI techniques and recent progress that enable retrosynthesis prediction. Moreover, we propose a novel landscape that provides a comprehensive categorization of different retrosynthesis prediction components and survey how AI reshapes each component. We conclude by discussing promising areas for future research.
Keywords: Retrosynthesis prediction Artificial intelligence Graph neural networks Deep reinforcement learning
Interactive medical image segmentation with self-adaptive confidence calibration
沈楚云,李文浩,徐琪森,胡斌,金博,蔡海滨,朱凤平,李郁欣,王祥丰
Frontiers of Information Technology & Electronic Engineering 2023, Volume 24, Issue 9, Pages 1332-1348 doi: 10.1631/FITEE.2200299
Keywords: Medical image segmentation Interactive segmentation Multi-agent reinforcement learning Confidence learning Semi-supervised learning
Intelligent Products and Equipment Led by New-Generation Artificial Intelligence
Tan Jianrong, Liu Zhenyu, Xu Jinghua
Strategic Study of CAE 2018, Volume 20, Issue 4, Pages 35-43 doi: 10.15302/J-SSCAE-2018.04.007
Intelligent products and equipment is the value carrier, technological prerequisite and material base of intelligent manufacturing and service. The intelligent products and equipment refers to two dialectical aspects: on the one hand, commercialization of intelligent technology, turning intelligence technology into products, which is mainly reflected in the comprehensive application of the Internet of Things, big data, cloud computing, edge computing, machine learning, deep learning, security monitoring, automation control, computer technology, precision sensing technology, and GPS positioning technology; On the other hand, the intelligent products and equipment refers to the intellectualization of traditional products. The new-generation artificial intelligence endows traditional products with higher intelligence and injects strong vitality and developmental motivation into traditional products in the aspect of intelligent manufacturing equipment, intelligent production, and intelligent management. Based on extensive scientific surveys and current researches, and combined with the ten major fields of Made in China 2025 and macro policies such as the Three-Year Action Plan for Artificial Intelligence, twelve major equipment fields of intelligent products and equipment are formulated. Researches show that the new-generation intelligent products and equipment focuses on knowledge engineering and is prominently characterized by self-sensing, self-adaptation, self-learning, and self-decision-making. Ten key technologies will be prioritized in future.
Keywords: intelligent products and equipment knowledge engineering intelligent state sensing intelligent variation adaptation intelligent knowledge learning intelligent control decision
Jingda Wu, Zhiyu Huang, Zhongxu Hu, Chen Lv
Engineering 2023, Volume 21, Issue 2, Pages 75-91 doi: 10.1016/j.eng.2022.05.017
Due to its limited intelligence and abilities, machine learning is currently unable to handle various situations thus cannot completely replace humans in real-world applications. Because humans exhibit robustness and adaptability in complex scenarios, it is crucial to introduce humans into the training loop of artificial intelligence (AI), leveraging human intelligence to further advance machine learning algorithms. In this study, a real-time human-guidance-based (Hug)-deep reinforcement learning (DRL) method is developed for policy training in an end-to-end autonomous driving case. With our newly designed mechanism for control transfer between humans and automation, humans are able to intervene and correct the agent's unreasonable actions in real time when necessary during the model training process. Based on this human-in-the-loop guidance mechanism, an improved actor-critic architecture with modified policy and value networks is developed. The fast convergence of the proposed Hug-DRL allows real-time human guidance actions to be fused into the agent's training loop, further improving the efficiency and performance of DRL. The developed method is validated by human-in-the-loop experiments with 40 subjects and compared with other state-of-the-art learning approaches. The results suggest that the proposed method can effectively enhance the training efficiency and performance of the DRL algorithm under human guidance without imposing specific requirements on participants' expertise or experience.
Keywords: Human-in-the-loop AI Deep reinforcement learning Human guidance Autonomous driving
Resource allocation for network profit maximization in NOMA-based F-RANs: a game-theoretic approach Research Article
Xueyan CAO, Shi YAN, Hongming ZHANG
Frontiers of Information Technology & Electronic Engineering 2022, Volume 23, Issue 10, Pages 1546-1561 doi: 10.1631/FITEE.2100341
(NOMA) based s (F-RANs) offer high spectrum efficiency, ultra-low delay, and huge network throughput, and this is made possible by edge computing and communication functions of the fog access points (F-APs). Meanwhile, caching-enabled F-APs are responsible for edge caching and delivery of a large volume of multimedia files during the caching phase, which facilitates further reduction in the transmission energy and burden. The need of the prevailing situation in industry is that in NOMA-based F-RANs, energy-efficient , which consists of (CP) and radio (RRA), is crucial for network performance enhancement. To this end, in this paper, we first characterize an NOMA-based F-RAN in which F-APs of caching capabilities underlaid with the radio remote heads serve user equipments via the NOMA protocol. Then, we formulate a problem for maximizing the defined performance indicator, namely network profit, which takes caching cost, revenue, and energy efficiency into consideration. The NP-hard problem is decomposed into two sub-problems, namely the CP sub-problem and RRA sub-problem. Finally, we propose an iterative method and a Stackelberg game based method to solve them, and numerical results show that the proposed solution can significantly improve network profit compared to some existing schemes in NOMA-based F-RANs.
Keywords: Fog radio access network Non-orthogonal multiple access Game theory Cache placement Resource allocation
Intellectual control—A goal exceeding the century - speech on the Fourteenth Conference IFAC
Song Jian
Strategic Study of CAE 1999, Volume 1, Issue 1, Pages 1-5
Keywords: Intelligent control cybernetics automation technology population control
Title Author Date Type Operation
Prospects for multi-agent collaboration and gaming: challenge, technology, and application
Yu LIU, Zhi LI, Zhizhuo JIANG, You HE
Journal Article
Decentralized multi-agent reinforcement learning with networked agents: recent advances
Kaiqing Zhang, Zhuoran Yang, Tamer Başar,kzhang66@illinois.edu,zy6@princeton.edu,basar1@illinois.edu
Journal Article
Multi-agent differential game based cooperative synchronization control using a data-driven method
Yu SHI, Yongzhao HUA, Jianglong YU, Xiwang DONG, Zhang REN
Journal Article
Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents
Jian ZHAO, Youpeng ZHAO, Weixun WANG, Mingyu YANG, Xunhan HU, Wengang ZHOU, Jianye HAO, Houqiang LI
Journal Article
Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game
Hongyang LI, Qinglai WEI
Journal Article
The Application of Multi-agent Based Distributed Intelligent Control in VAV Air Conditioning System
Zhang Hongwei,Wu Aiguo,Sheng Tao
Journal Article
Existence and practice of gaming: thoughts on the development of multi-agent system gaming
Qi DONG, Zhenyu WU, Jun LU, Fengsong SUN, Jinyu WANG, Yanyu YANG, Xiaozhou SHANG
Journal Article
Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrial wireless networks
Xiaoyu LIU, Chi XU, Haibin YU, Peng ZENG,liuxiaoyu1@sia.cn,xuchi@sia.cn,yhb@sia.cn,zp@sia.cn
Journal Article
Cooperative channel assignment for VANETs based on multiagent reinforcement learning
Yun-peng Wang, Kun-xian Zheng, Da-xin Tian, Xu-ting Duan, Jian-shan Zhou,ypwang@buaa.edu.cn,zhengkunxian@buaa.edu.cn,dtian@buaa.edu.cn,duanxuting@buaa.edu.cn
Journal Article
Artificial Intelligence for Retrosynthesis Prediction
Yinjie Jiang, Yemin Yu, Ming Kong, Yu Mei, Luotian Yuan, Zhengxing Huang, Kun Kuang, Zhihua Wang, Huaxiu Yao, James Zou, Connor W. Coley, Ying Wei
Journal Article
Interactive medical image segmentation with self-adaptive confidence calibration
沈楚云,李文浩,徐琪森,胡斌,金博,蔡海滨,朱凤平,李郁欣,王祥丰
Journal Article
Intelligent Products and Equipment Led by New-Generation Artificial Intelligence
Tan Jianrong, Liu Zhenyu, Xu Jinghua
Journal Article
Toward Human-in-the-loop AI: Enhancing Deep Reinforcement Learning Via Real-time Human Guidance for Autonomous Driving
Jingda Wu, Zhiyu Huang, Zhongxu Hu, Chen Lv
Journal Article
Resource allocation for network profit maximization in NOMA-based F-RANs: a game-theoretic approach
Xueyan CAO, Shi YAN, Hongming ZHANG
Journal Article