The Tong Test: Evaluating Artificial General Intelligence Through Dynamic Embodied Physical and Social Interactions

Yujia Peng , Jiaheng Han , Zhenliang Zhang , Lifeng Fan , Tengyu Liu , Siyuan Qi , Xue Feng , Yuxi Ma , Yizhou Wang , Song-Chun Zhu

Engineering ›› 2024, Vol. 34 ›› Issue (3) : 12 -23.

PDF (2038KB)
Engineering ›› 2024, Vol. 34 ›› Issue (3) : 12 -23. DOI: 10.1016/j.eng.2023.07.006
Research
Perspective

The Tong Test: Evaluating Artificial General Intelligence Through Dynamic Embodied Physical and Social Interactions

Author information +
History +
PDF (2038KB)

Abstract

The release of the generative pre-trained transformer (GPT) series has brought artificial general intelligence (AGI) to the forefront of the artificial intelligence (AI) field once again. However, the questions of how to define and evaluate AGI remain unclear. This perspective article proposes that the evaluation of AGI should be rooted in dynamic embodied physical and social interactions (DEPSI). More specifically, we propose five critical characteristics to be considered as AGI benchmarks and suggest the Tong test as an AGI evaluation system. The Tong test describes a value- and ability-oriented testing system that delineates five levels of AGI milestones through a virtual environment with DEPSI, allowing for infinite task generation. We contrast the Tong test with classical AI testing systems in terms of various aspects and propose a systematic evaluation system to promote standardized, quantitative, and objective benchmarks and evaluation of AGI.

Graphical abstract

Keywords

Artificial general intelligence / Artificial intelligence benchmark / Artificial intelligence evaluation / Embodied artificial intelligence / Value alignment / Turing test / Causality

Cite this article

Download citation ▾
Yujia Peng, Jiaheng Han, Zhenliang Zhang, Lifeng Fan, Tengyu Liu, Siyuan Qi, Xue Feng, Yuxi Ma, Yizhou Wang, Song-Chun Zhu, , , , , , , , , , . The Tong Test: Evaluating Artificial General Intelligence Through Dynamic Embodied Physical and Social Interactions. Engineering, 2024, 34(3): 12-23 DOI:10.1016/j.eng.2023.07.006

登录浏览全文

4963

注册一个新账户 忘记密码

References

RIGHTS & PERMISSIONS

THE AUTHOR

AI Summary AI Mindmap
PDF (2038KB)

4683

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/