针对强人工智能安全风险的技术应对策略
刘宇擎 , 张玉槐 , 段沛奇 , 施柏鑫 , 余肇飞 , 黄铁军 , 高文
中国工程科学 ›› 2021, Vol. 23 ›› Issue (3) : 75 -81.
针对强人工智能安全风险的技术应对策略
Technical Countermeasures for Security Risks of Artificial General Intelligence
未来进入强人工智能(AGI)时代,人类可能面临重大安全风险。本文归纳了AGI 与传统人工智能的区别,从模型的不可解释性、算法及硬件的不可靠性、自主意识的不可控性三方面研判了AGI 安全风险的来源,从能力、动机、行为3 个维度提出了针对AGI 的安全风险评估体系。为应对安全风险,从理论及技术研究、应用两个层面分别探讨相应风险的防御策略:在理论技术研究阶段,完善理论基础验证,实现模型可解释性,严格限制AGI 底层价值取向,促进技术标准化;在应用阶段,预防人为造成的安全问题,对AGI 进行动机选择,为AGI 赋予人类价值观。此外,建议加强国际合作,培养强AI 研究人才,为迎接未知的强AI 时代做好充分准备。
Human beings might face significant security risks after entering into the artificial general intelligence (AGI) era. By summarizing the difference between AGI and traditional artificial intelligence, we analyze the sources of the security risks of AGI from the aspects of model uninterpretability, unreliability of algorithms and hardware, and uncontrollability over autonomous consciousness. Moreover, we propose a security risk assessment system for AGI from the aspects of ability, motivation, and behavior. Subsequently, we discuss the defense countermeasures in the research and application stages. In the research stage, theoretical verification should be improved to develop interpretable models, the basic values of AGI should be rigorously constrained, and technologies should be standardized. In the application stage, man-made risks should be prevented, motivations should be selected for AGI, and human values should be given to AGI. Furthermore, it is necessary to strengthen international cooperation and the education of AGI professionals, to well prepare for the unknown coming era of AGI.
强人工智能 / 安全风险 / 风险评估 / 应对策略 / artificial general intelligence (AGI) / security risk / risk assessment / coping strategy
/
〈 |
|
〉 |