Search | Engineering

Subscribe Submit

Home Journals Focus Achievement Fronts About Us 中文版

Resource Type

Journal Article 73

Year

2023 9

2022 11

2021 10

2020 8

2019 5

2018 2

2017 9

2016 2

2015 1

2013 2

2011 1

2007 4

2006 1

2005 1

2003 3

2002 1

2000 1

1999 2

open ︾

Keywords

Deep learning 8

Artificial intelligence 6

Computer vision 6

Mobile robot 2

Optical flow 2

Simultaneous localization and mapping (SLAM) 2

Visual tracking 2

3D parametric model 1

3D visual knowledge 1

Abstract thinking 1

Abstraction 1

Active vision 1

Adaptive network fuzzy inference system 1

Additive white Gaussian noise 1

Aerial manipulation 1

Aquatic environment 1

Artificial general intelligence 1

Attention deficit hyperactivity disorder (ADHD) 1

Auditory perception 1

open ︾

Search scope:

排序： Display mode:

Visual commonsense reasoning with directional visual connections Research Articles

Yahong Han, Aming Wu, Linchao Zhu, Yi Yang,yahong@tju.edu.cn

Frontiers of Information Technology & Electronic Engineering 2021, Volume 22, Issue 5, Pages 615-766 doi: 10.1631/FITEE.2000722

Abstract: To boost research into cognition-level visual understanding, i.e., making an accurate inference based on a thorough understanding of visual details, (VCR) has been proposed. Compared with traditional visual question answering which requires models to select correct answers, VCR requires models to select not only the correct answers, but also the correct rationales. Recent research into human cognition has indicated that brain function or cognition can be considered as a global and dynamic integration of local neuron connectivity, which is helpful in solving specific cognition tasks. Inspired by this idea, we propose a to achieve VCR by dynamically reorganizing the that is contextualized using the meaning of questions and answers and leveraging the directional information to enhance the reasoning ability. Specifically, we first develop a GraphVLAD module to capture to fully model visual content correlations. Then, a contextualization process is proposed to fuse sentence representations with visual neuron representations. Finally, based on the output of , we propose to infer answers and rationales, which includes a ReasonVLAD module. Experimental results on the VCR dataset and visualization analysis demonstrate the effectiveness of our method.

Keywords：视觉常识推理；有向连接网络；视觉神经元连接；情景化连接；有向连接

HTML PDF Collect

Visual knowledge: an attempt to explore machine creativity Perspectives

Yueting Zhuang, Siliang Tang,yzhuang@zju.edu.cn,siliang@zju.edu.cn

Frontiers of Information Technology & Electronic Engineering 2021, Volume 22, Issue 5, Pages 615-766 doi: 10.1631/FITEE.2100116

Abstract: 长期以来困扰人工智能领域的一个问题是：人工智能是否具有创造力，或者说，算法的推理过程是否可以具有创造性。本文从思维科学的角度探讨人工智能创造力的问题。首先，列举形象思维推理的相关研究；然后，重点介绍一种特殊的视觉知识表示形式，即视觉场景图；最后，详细介绍视觉场景图构造问题与潜在应用。所有证据表明，视觉知识和视觉思维不仅可以改善当前人工智能任务的性能，而且可以用于机器创造力的实践。

Keywords：思维科学；形象思维推理；视觉知识表达；视觉场景图

HTML PDF Collect

A survey of script learning Review

Yi Han, Linbo Qiao, Jianming Zheng, Hefeng Wu, Dongsheng Li, Xiangke Liao,hanyi12@nudt.edu.cn,qiao.linbo@nudt.edu.cn,zhengjianming12@nudt.edu.cn,wuhefeng@mail.sysu.edu.cn,dsli@nudt.edu.cn,xkliao@nudt.edu.cn

Frontiers of Information Technology & Electronic Engineering 2021, Volume 22, Issue 3, Pages 287-436 doi: 10.1631/FITEE.2000347

Abstract: Script is the structured knowledge representation of prototypical real-life event sequences. Learning the commonsense knowledge inside the script can be helpful for machines in understanding natural language and drawing commonsensible inferences. is an interesting and promising research direction, in which a trained system can process narrative texts to capture script knowledge and draw inferences. However, there are currently no survey articles on , so we are providing this comprehensive survey to deeply investigate the standard framework and the major research topics on . This research field contains three main topics: event representations, models, and evaluation approaches. For each topic, we systematically summarize and categorize the existing systems, and carefully analyze and compare the advantages and disadvantages of the representative systems. We also discuss the current state of the research and possible future directions.

Keywords：脚本学习；自然语言处理；常识知识建模；事件推理

HTML PDF Collect

Miniaturized five fundamental issues about visual knowledge Perspectives

Yun-he Pan,panyh@zju.edu.cn

Frontiers of Information Technology & Electronic Engineering 2021, Volume 22, Issue 5, Pages 615-766 doi: 10.1631/FITEE.2040000

Abstract: 认知心理学早已指出，人类知识记忆中的重要部分是视觉知识，被用来进行形象思维。因此，基于视觉的人工智能（AI）是AI绕不开的课题，且具有重要意义。本文继《论视觉知识》一文，讨论与之相关的5个基本问题：（1）视觉知识表达；（2）视觉识别；（3）视觉形象思维模拟；（4）视觉知识的学习；（5）多重知识表达。视觉知识的独特优点是具有形象的综合生成能力，时空演化能力和形象显示能力。这些正是字符知识和深度神经网络所缺乏的。AI与计算机辅助设计/图形学/视觉的技术联合将在创造、预测和人机融合等方面对AI新发展提供重要的基础动力。视觉知识和多重知识表达的研究是发展新的视觉智能的关键，也是促进AI 2.0取得重要突破的关键理论与技术。这是一块荒芜、寒湿而肥沃的“北大荒”，也是一块充满希望值得多学科合作勇探的“无人区”。

Keywords：视觉知识表达；视觉识别；视觉形象思维模拟；视觉知识学习；多重知识表达

HTML PDF Collect

Characters of topological relations and its applications in spatial reasoning

Li Chengming and Liu Xiaoli

Strategic Study of CAE 2013, Volume 15, Issue 5, Pages 14-19

Abstract:

In the paper,the formal representation of topological relationships are proposed according to our early research results, and the characters are also introduced. Based on these characters, the results for com-posite topological relations are introduced. At last, the applications in spatial reasoning are proposed.

Keywords： topological relationship composite of topological relations spatial reasoning algbra reasoning logical reasoning

HTML PDF Collect

Three-dimensional shape space learning for visual concept construction: challenges and research progress Perspective

Xin TONG

Frontiers of Information Technology & Electronic Engineering 2022, Volume 23, Issue 9, Pages 1290-1297 doi: 10.1631/FITEE.2200318

Abstract: Human beings can easily categorize three-dimensional (3D) objects with similar shapes and functions into a set of “visual concepts” and learn “visual knowledge” of the surrounding 3D real world (). Developing efficient methods to learn the computational representation of the visual concept and the visual knowledge is a critical task in artificial intelligence (). A crucial step to this end is to learn the shape space spanned by all 3D objects that belong to one visual concept. In this paper, we present the key technical challenges and recent research progress in 3D shape space learning and discuss the open problems and research opportunities in this area.

Keywords：视觉概念；视觉知识；三维几何学习；三维形状空间；三维结构

HTML PDF Collect

混合-增强智能：协作与认知 Review

南宁郑,子熠刘,鹏举任,永强马,仕韬陈,思雨余,建儒薛,霸东陈,飞跃王

Frontiers of Information Technology & Electronic Engineering 2017, Volume 18, Issue 2, Pages 153-179 doi: 10.1631/FITEE.1700053

Abstract: 本文讨论人机协同的混合-增强智能的基本框架，以及基于认知计算的混合-增强智能的基本要素：直觉推理与因果模型、记忆和知识演化；特别论述了直觉推理在复杂问题求解中的作用和基本原理，以及基于记忆与推理的视觉场景理解的认知学习网络

Keywords：人-机协同；混合增强智能；认知计算；直觉推理；因果模型；认知映射；视觉场景理解；自主驾驶汽车

HTML PDF Collect

Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense Feature Article

Yixin Zhu, Tao Gao, Lifeng Fan, Siyuan Huang, Mark Edmonds, Hangxin Liu, Feng Gao, Chi Zhang, Siyuan Qi, Ying Nian Wu, Joshua B. Tenenbaum, Song-Chun Zhu

Engineering 2020, Volume 6, Issue 3, Pages 310-345 doi: 10.1016/j.eng.2020.01.011

Abstract:

Recent progress in deep learning is essentially based on a “big data for small tasks” paradigm, under which massive amounts of data are used to train a classifier for a single narrow task. In this paper, we call for a shift that flips this paradigm upside down. Specifically, we propose a “small data for big tasks” paradigm, wherein a single artificial intelligence (AI) system is challenged to develop “common sense,” enabling it to solve a wide range of tasks with little training data. We illustrate the potential power of this new paradigm by reviewing models of common sense that synthesize recent breakthroughs in both machine and human vision. We identify functionality, physics, intent, causality, and utility (FPICU) as the five core domains of cognitive AI with humanlike common sense. When taken as a unified concept, FPICU is concerned with the questions of “why” and “how,” beyond the dominant “what” and “where” framework for understanding vision. They are invisible in terms of pixels but nevertheless drive the creation, maintenance, and development of visual scenes. We therefore coin them the “dark matter” of vision. Just as our universe cannot be understood by merely studying observable matter, we argue that vision cannot be understood without studying FPICU. We demonstrate the power of this perspective to develop cognitive AI systems with humanlike common sense by showing how to observe and apply FPICU with little training data to solve a wide range of challenging tasks, including tool use, planning, utility inference, and social learning. In summary, we argue that the next generation of AI must embrace “dark” humanlike common sense for solving novel tasks.

Keywords： Computer vision Artificial intelligence Causality Intuitive physics Functionality Perceived intent Utility

HTML PDF Collect

A quantitative attribute-based benchmark methodology for single-target visual tracking Article

Wen-jing KANG, Chang LIU, Gong-liang LIU

Frontiers of Information Technology & Electronic Engineering 2020, Volume 21, Issue 3, Pages 405-421 doi: 10.1631/FITEE.1900245

Abstract: In the past several years, various visual object tracking benchmarks have been proposed, and some of them have been used widely in numerous recently proposed trackers. However, most of the discussions focus on the overall performance, and cannot describe the strengths and weaknesses of the trackers in detail. Meanwhile, several benchmark measures that are often used in tests lack convincing interpretation. In this paper, 12 frame-wise visual attributes that reflect different aspects of the characteristics of image sequences are collated, and a normalized quantitative formulaic definition has been given to each of them for the first time. Based on these definitions, we propose two novel test methodologies, a correlation-based test and a weight-based test, which can provide a more intuitive and easier demonstration of the trackers’ performance for each aspect. Then these methods have been applied to the raw results from one of the most famous tracking challenges, the Video Object Tracking (VOT) Challenge 2017. From the tests, most trackers did not perform well when the size of the target changed rapidly or intensely, and even the advanced deep learning based trackers did not perfectly solve the problem. The scale of the targets was not considered in the calculation of the center location error; however, in a practical test, the center location error is still sensitive to the targets’ changes in size.

Keywords： Visual tracking Performance evaluation Visual attributes Computer vision

HTML PDF Collect

Visual Inspection Technology and its Application

Ye Shenghua,Zhu Jigui,Wang Zhong,Yang Xueyou

Strategic Study of CAE 1999, Volume 1, Issue 1, Pages 49-52

Abstract:

Visual inspection, especially, the active visual inspection and passive visual inspection based on triangulation method has advantages of non-contact, rapid speed, flexibility, etc. Visual inspection is a advanced inspection technology, satisfies modern manufacturing demands. This paper discusses the principle of visual inspection, studies several developed applied visual inspection systems, these systems demostrate wide application foreground of visual inspection from different points of view.

Keywords： active visual inspection passive visual inspection inspection system modern manufacturing

HTML PDF Collect

Application of Uncertainty Reasoning Theory to Satellite Fault Detection and Diagnosis

Yang Tianshe,Li Huaizu,Cao Yuping

Strategic Study of CAE 2003, Volume 5, Issue 2, Pages 68-74

Abstract:

Generally, reasoning theory can be divided into certainty reasoning theory and uncertainty reasoning theory. Traditionally, certainty reasoning theory is used to detect and diagnose satellite faults. However, in practice, it is difficult to detect and diagnose some satellite faults automatically only by use of certainty reasoning theory. The reason is that detection and diagnosis of these faults requires reasonable reasoning and fault-tolerant capability, but certainty reasoning theory can not realize the capability. Fortunately, uncertainty reasoning theory can meet this requirement. Now, it is attracting attention of many researchers and practitioners in the space field all over the world that uncertainty reasoning theory is applied to detect and diagnose the satellite faults which can not be handled properly by certainty reasoning theory. Uncertainty reasoning theory includes several kinds of theories, such as inclusion degree theory, rough set theory, evidence reasoning theory, probabilistic reasoning theory, fuzzy reasoning theory, and so on. This paper introduces three new methods to detect and diagnose the satellite faults, in which inclusion degree theory, rough set theory and evidence reasoning theory of the uncertainty reasoning theory are used respectively.

Keywords： satellite fault detection diagnosis uncertainty reasoning theory

HTML PDF Collect

Causal Inference Review

Kun Kuang, Lian Li, Zhi Geng, Lei Xu, Kun Zhang, Beishui Liao, Huaxin Huang, Peng Ding, Wang Miao, Zhichao Jiang

Engineering 2020, Volume 6, Issue 3, Pages 253-263 doi: 10.1016/j.eng.2019.08.016

Abstract:

Causal inference is a powerful modeling tool for explanatory analysis, which might enable current machine learning to become explainable. How to marry causal inference with machine learning to develop eXplainable Artificial Intelligence (XAI) algorithms is one of key steps towards to the artificial intelligence 2.0. With the aim of bringing knowledge of causal inference to scholars of machine learning
and artificial intelligence, we invited researchers working on causal inference to write this survey from different aspects of causal inference. This survey includes the following sections: "Estimating average treatment effect: A brief review and beyond" from Dr. Kun Kuang, "Attribution problems in counterfactual inference" from Prof. Lian Li, "The Yule-Simpson paradox and the surrogate paradox" from Prof. Zhi Geng, "Causal potential theory" from Prof. Lei Xu, "Discovering causal information from observational data" from Prof. Kun Zhang, "Formal argumentation in causal reasoning and explanation" from Profs. Beishui Liao and Huaxin Huang, "Causal inference with complex experiments" from Prof. Peng Ding, "Instrumental variables and negative controls for observational studies" from Prof. Wang Miao, and "Causal inference with interference" from Dr. Zhichao Jiang.

Keywords： Causal inference Instructive variables Negative control Causal reasoning and explanation Causal discovery Counter factual inference Treatment effect estimation

HTML PDF Collect

Progress in Neural NLP: Modeling, Learning, and Reasoning Review

Ming Zhou, Nan Duan, Shujie Liu, Heung-Yeung Shum

Engineering 2020, Volume 6, Issue 3, Pages 275-290 doi: 10.1016/j.eng.2019.12.014

Abstract:

Natural language processing (NLP) is a subfield of artificial intelligence (AI) that focuses on enabling computers to understand and process human languages. In the last five years, we have witnessed the rapid development of NLP in tasks such as machine translation, question-answering, and machine reading comprehension based on deep learning and an enormous volume of annotated and unannotated data. In this paper, we will review the latest progress in the neural network-based NLP framework (neural NLP) from three perspectives: modeling, learning, and reasoning. In the modeling section, we will describe several fundamental neural network-based modeling paradigms, such as word embedding, sentence embedding, and sequence-to-sequence modeling, which are widely used in modern NLP engines. In the learning section, we will introduce widely used learning methods for NLP models, including supervised, semi-supervised, and unsupervised learning; multitask learning; transfer learning; and active learning. We view reasoning as a new and exciting direction for neural NLP, but it has yet to be well addressed. In the reasoning section, we will review reasoning mechanisms, including the knowledge, existing non-neural inference methods, and new neural inference methods. We emphasize the importance of reasoning in this paper because it is important for building interpretable and knowledge-driven neural NLP models to handle complex tasks. At the end of this paper, we will briefly outline our thoughts on the future directions of neural NLP.

Keywords： Natural language processing Deep learning Modeling learning and Reasoning

HTML PDF Collect

Mathematical Reasoning Challenges Artificial Intelligence

Sean O’Neill

Engineering 2019, Volume 5, Issue 5, Pages 817-818 doi: 10.1016/j.eng.2019.08.009

HTML PDF Collect

An ANFIS-based Approach for Predicting MiningInduced Surface Subsidence

Ding Dexin,Zhang Zhijun,Bi Zhongwei

Strategic Study of CAE 2007, Volume 9, Issue 1, Pages 33-39

Abstract:

Current approaches for predicting mining induced surface subsidence have a drawback in common that they predict the subsidence only on the basis of a physical or mechanical approach irrespective of the practical examples in engineering practice in mining induced surface subsidence.However,these experiences created in engineering practice are of great value and full use should be made of them to establish an approach for predicting mining induced surface subsidence.Therefore,this paper accumulated a lot of practical examples of mining induced surface subsidence,integrated these examples by using adaptive neuro-fuzzy inference system (ANFIS)and established an ANFIS-based approach for predicting mining induced surface subsidence.The approach was further tested by using practical examples of mining induced surface subsidence.The results show that the approach can converge quickly,fit the data in very good agreement and make generalization prediction with high accuracy.

Keywords： underground mining mining induced surface subsidence adaptive neuro唱fuzzy inferencesystem

HTML PDF Collect

Title Author Date Type Operation

Visual commonsense reasoning with directional visual connections

Yahong Han, Aming Wu, Linchao Zhu, Yi Yang,yahong@tju.edu.cn

Journal Article

Visual knowledge: an attempt to explore machine creativity

Yueting Zhuang, Siliang Tang,yzhuang@zju.edu.cn,siliang@zju.edu.cn

Journal Article

A survey of script learning

Journal Article

Miniaturized five fundamental issues about visual knowledge

Yun-he Pan,panyh@zju.edu.cn

Journal Article

Characters of topological relations and its applications in spatial reasoning

Li Chengming and Liu Xiaoli

Journal Article

Three-dimensional shape space learning for visual concept construction: challenges and research progress

Xin TONG

Journal Article

混合-增强智能：协作与认知

南宁郑,子熠刘,鹏举任,永强马,仕韬陈,思雨余,建儒薛,霸东陈,飞跃王

Journal Article

Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense

Yixin Zhu, Tao Gao, Lifeng Fan, Siyuan Huang, Mark Edmonds, Hangxin Liu, Feng Gao, Chi Zhang, Siyuan Qi, Ying Nian Wu, Joshua B. Tenenbaum, Song-Chun Zhu

Journal Article

A quantitative attribute-based benchmark methodology for single-target visual tracking

Wen-jing KANG, Chang LIU, Gong-liang LIU

Journal Article

Visual Inspection Technology and its Application

Ye Shenghua,Zhu Jigui,Wang Zhong,Yang Xueyou

Journal Article

Application of Uncertainty Reasoning Theory to Satellite Fault Detection and Diagnosis

Yang Tianshe,Li Huaizu,Cao Yuping

Journal Article

Causal Inference

Kun Kuang, Lian Li, Zhi Geng, Lei Xu, Kun Zhang, Beishui Liao, Huaxin Huang, Peng Ding, Wang Miao, Zhichao Jiang