Search scope:
排序: Display mode:
Jing-jing Chen, Qi-rong Mao, You-cai Qin, Shuang-qing Qian, Zhi-shen Zheng,2221808071@stmail.ujs.edu.cn,mao_qr@ujs.edu.cn,2211908026@stmail.ujs.edu.cn,2211908025@stmail.ujs.edu.cn,3160602062@stmail.ujs.edu.cn
Frontiers of Information Technology & Electronic Engineering 2020, Volume 21, Issue 11, Pages 1535-1670 doi: 10.1631/FITEE.2000019
Keywords: 语音分离;生成因子;自动编码器;深度学习
Representation learning via a semi-supervised stacked distance autoencoder for image classification Research Articles
Liang Hou, Xiao-yi Luo, Zi-yang Wang, Jun Liang,jliang@zju.edu.cn
Frontiers of Information Technology & Electronic Engineering 2020, Volume 21, Issue 7, Pages 963-1118 doi: 10.1631/FITEE.1900116
Keywords: 自动编码器;图像分类;半监督学习;神经网络
Deep 3D reconstruction: methods, data, and challenges Review Article
Caixia Liu, Dehui Kong, Shaofan Wang, Zhiyong Wang, Jinghua Li, Baocai Yin,lcxxib@emails.bjut.edu.cn,wangshaofan@bjut.edu.cn
Frontiers of Information Technology & Electronic Engineering 2021, Volume 22, Issue 5, Pages 615-766 doi: 10.1631/FITEE.2000068
Keywords: 深度学习模型;三维重建;循环神经网络;深度自编码器;生成对抗网络;卷积神经网络
Brain Encoding and Decoding in fMRI with Bidirectional Deep Generative Models Review
Changde Du, Jinpeng Li, Lijie Huang, Huiguang He
Engineering 2019, Volume 5, Issue 5, Pages 948-953 doi: 10.1016/j.eng.2019.03.010
Brain encoding and decoding via functional magnetic resonance imaging (fMRI) are two important aspects of visual perception neuroscience. Although previous researchers have made significant advances in brain encoding and decoding models, existing methods still require improvement using advanced machine learning techniques. For example, traditional methods usually build the encoding and decoding models separately, and are prone to overfitting on a small dataset. In fact, effectively unifying the encoding and decoding procedures may allow for more accurate predictions. In this paper, we first review the existing encoding and decoding methods and discuss
the potential advantages of a "bidirectional" modeling strategy. Next, we show that there are correspondences between deep neural networks and human visual streams in terms of the architecture and computational rules Furthermore, deep generative models (e.g., variational autoencoders (VAEs) and generative adversarial networks (GANs)) have produced promising results in studies on brain encoding and decoding. Finally, we propose that the dual learning method, which was originally designed for machine translation tasks, could help to improve the performance of encoding and decoding models by leveraging large-scale unpaired data.
Keywords: Brain encoding and decoding Functional magnetic resonance imaging Deep neural networks Deep generative models Dual learning
Battle damage assessment based on an improved Kullback-Leibler divergence sparse autoencoder Article
Zong-feng QI, Qiao-qiao LIU, Jun WANG, Jian-xun LI
Frontiers of Information Technology & Electronic Engineering 2017, Volume 18, Issue 12, Pages 1991-2000 doi: 10.1631/FITEE.1601395
Keywords: Battle damage assessment Improved Kullback-Leibler divergence sparse autoencoder Structural optimization Feature selection
Efficient normalization for quantitative evaluation of the driving behavior using a gated auto-encoder Research Articles
Xin HE, Zhe ZHANG, Li XU, Jiapei YU,xinhe_ee@zju.edu.cn,xupower@zju.edu.cn
Frontiers of Information Technology & Electronic Engineering 2022, Volume 23, Issue 3, Pages 452-462 doi: 10.1631/FITEE.2000667
Keywords: Driving behavior Normalization Gated auto-encoder Quantitative evaluation
Jingda Wu, Zhiyu Huang, Zhongxu Hu, Chen Lv
Engineering 2023, Volume 21, Issue 2, Pages 75-91 doi: 10.1016/j.eng.2022.05.017
Due to its limited intelligence and abilities, machine learning is currently unable to handle various situations thus cannot completely replace humans in real-world applications. Because humans exhibit robustness and adaptability in complex scenarios, it is crucial to introduce humans into the training loop of artificial intelligence (AI), leveraging human intelligence to further advance machine learning algorithms. In this study, a real-time human-guidance-based (Hug)-deep reinforcement learning (DRL) method is developed for policy training in an end-to-end autonomous driving case. With our newly designed mechanism for control transfer between humans and automation, humans are able to intervene and correct the agent's unreasonable actions in real time when necessary during the model training process. Based on this human-in-the-loop guidance mechanism, an improved actor-critic architecture with modified policy and value networks is developed. The fast convergence of the proposed Hug-DRL allows real-time human guidance actions to be fused into the agent's training loop, further improving the efficiency and performance of DRL. The developed method is validated by human-in-the-loop experiments with 40 subjects and compared with other state-of-the-art learning approaches. The results suggest that the proposed method can effectively enhance the training efficiency and performance of the DRL algorithm under human guidance without imposing specific requirements on participants' expertise or experience.
Keywords: Human-in-the-loop AI Deep reinforcement learning Human guidance Autonomous driving
Automatic traceability link recovery via active learning Research Articles
Tian-bao Du, Guo-hua Shen, Zhi-qiu Huang, Yao-shen Yu, De-xiang Wu,tbdu_312@outlook.com,ghshen@nuaa.edu.cn,zqhuang@nuaa.edu.cn
Frontiers of Information Technology & Electronic Engineering 2020, Volume 21, Issue 8, Pages 1217-1225 doi: 10.1631/FITEE.1900222
Keywords: Automatic Traceability link recovery Manpower Active learning
A Geometric Understanding of Deep Learning Article
Na Lei, Dongsheng An, Yang Guo, Kehua Su, Shixia Liu, Zhongxuan Luo, Shing-Tung Yau, Xianfeng Gu
Engineering 2020, Volume 6, Issue 3, Pages 361-374 doi: 10.1016/j.eng.2019.09.010
This work introduces an optimal transportation (OT) view of generative adversarial networks (GANs). Natural datasets have intrinsic patterns, which can be summarized as the manifold distribution principle: the distribution of a class of data is close to a low-dimensional manifold. GANs mainly accomplish two tasks: manifold learning and probability distribution transformation. The latter can be carried out using the classical OT method. From the OT perspective, the generator computes the OT map, while the discriminator computes the Wasserstein distance between the generated data distribution and the real data distribution; both can be reduced to a convex geometric optimization process. Furthermore, OT theory discovers the intrinsic collaborative—instead of competitive—relation between the generator and the discriminator, and the fundamental reason for mode collapse. We also propose a novel generative model, which uses an autoencoder (AE) for manifold learning and OT map for probability distribution transformation. This AE–OT model improves the theoretical rigor and transparency, as well as the computational stability and efficiency; in particular, it eliminates the mode collapse. The experimental results validate our hypothesis, and demonstrate the advantages of our proposed model.
Keywords: Generative Adversarial Deep learning Optimal transportation Mode collapse
SmartPaint: a co-creative drawing system based on generative adversarial networks Special Feature on Intelligent Design
Lingyun SUN, Pei CHEN, Wei XIANG, Peng CHEN, Wei-yue GAO, Ke-jun ZHANG
Frontiers of Information Technology & Electronic Engineering 2019, Volume 20, Issue 12, Pages 1644-1656 doi: 10.1631/FITEE.1900386
Keywords: Co-creative drawing Deep learning Image generation
Deep learning compact binary codes for fingerprint indexing None
Chao-chao BAI, Wei-qiang WANG, Tong ZHAO, Ru-xin WANG, Ming-qiang LI
Frontiers of Information Technology & Electronic Engineering 2018, Volume 19, Issue 9, Pages 1112-1123 doi: 10.1631/FITEE.1700420
With the rapid growth in fingerprint databases, it has become necessary to develop excellent fingerprint indexing to achieve efficiency and accuracy. Fingerprint indexing has been widely studied with real-valued features, but few studies focus on binary feature representation, which is more suitable to identify fingerprints efficiently in large-scale fingerprint databases. In this study, we propose a deep compact binary minutia cylinder code (DCBMCC) as an effective and discriminative feature representation for fingerprint indexing. Specifically, the minutia cylinder code (MCC), as the state-of-the-art fingerprint representation, is analyzed and its shortcomings are revealed. Accordingly, we propose a novel fingerprint indexing method based on deep neural networks to learn DCBMCC. Our novel network restricts the penultimate layer to directly output binary codes. Moreover, we incorporate independence, balance, quantization-loss-minimum, and similarity-preservation properties in this learning process. Eventually, a multi-index hashing (MIH) based fingerprint indexing scheme further speeds up the exact search in the Hamming space by building multiple hash tables on binary code substrings. Furthermore, numerous experiments on public databases show that the proposed approach is an outstanding fingerprint indexing method since it has an extremely small error rate with a very low penetration rate.
Keywords: Fingerprint indexing Minutia cylinder code Deep neural network Multi-index hashing
Attention-based efficient robot grasp detection network Research Article
Xiaofei QIN, Wenkai HU, Chen XIAO, Changxiang HE, Songwen PEI, Xuedian ZHANG,xiaofei.qin@usst.edu.cn,obmmd_zxd@163.com
Frontiers of Information Technology & Electronic Engineering 2023, Volume 24, Issue 10, Pages 1430-1444 doi: 10.1631/FITEE.2200502
Keywords: Robot grasp detection Attention mechanism Encoder– decoder Neural network
Embedding expert demonstrations into clustering buffer for effective deep reinforcement learning Research Article
Shihmin WANG, Binqi ZHAO, Zhengfeng ZHANG, Junping ZHANG, Jian PU
Frontiers of Information Technology & Electronic Engineering 2023, Volume 24, Issue 11, Pages 1541-1556 doi: 10.1631/FITEE.2300084
Keywords: Reinforcement learning Sample efficiency Sampling process Clustering methods Autonomous driving
Visual knowledge guided intelligent generation of Chinese seal carving Research Article
Kejun ZHANG, Rui ZHANG, Yehang YIN, Yifei LI, Wenqi WU, Lingyun SUN, Fei WU, Huanghuang DENG, Yunhe PAN
Frontiers of Information Technology & Electronic Engineering 2022, Volume 23, Issue 10, Pages 1479-1493 doi: 10.1631/FITEE.2100094
We digitally reproduce the process of resource collaboration, design creation, and visual presentation of Chinese art. We develop an intelligent art-generation system (Zhejiang University Intelligent System, http://www.next.zju.edu.cn/seal/; the website of the search and layout system is http://www.next.zju.edu.cn/seal/search_app/) to deal with the difficulty in using a visual knowledge guided approach. The knowledge base in this study is the Qiushi Database, which consists of open datasets of images of seal characters and seal stamps. We propose a seal character generation method based on visual knowledge, guided by the database and expertise. Furthermore, to create the layout of the seal, we propose a deformation algorithm to adjust the seal characters and calculate layout parameters from the database and knowledge to achieve an intelligent structure. Experimental results show that this method and system can effectively deal with the difficulties in the generation of seal carving. Our work provides theoretical and applied references for the rebirth and innovation of art.
Keywords: Seal-carving Intelligent generation Deep learning Parametric modeling Computational art
A subband excitation substitute based scheme for narrowband speech watermarking Article
Wei LIU, Ai-qun HU
Frontiers of Information Technology & Electronic Engineering 2017, Volume 18, Issue 5, Pages 627-643 doi: 10.1631/FITEE.1601503
Keywords: Analysis filter Linear prediction Narrowband speech watermarking Passband excitation replacement Power normalization Spectral envelope shaping Synthesis filter
Title Author Date Type Operation
Latent source-specific generative factor learning for monaural speech separation using weighted-factor autoencoder
Jing-jing Chen, Qi-rong Mao, You-cai Qin, Shuang-qing Qian, Zhi-shen Zheng,2221808071@stmail.ujs.edu.cn,mao_qr@ujs.edu.cn,2211908026@stmail.ujs.edu.cn,2211908025@stmail.ujs.edu.cn,3160602062@stmail.ujs.edu.cn
Journal Article
Representation learning via a semi-supervised stacked distance autoencoder for image classification
Liang Hou, Xiao-yi Luo, Zi-yang Wang, Jun Liang,jliang@zju.edu.cn
Journal Article
Deep 3D reconstruction: methods, data, and challenges
Caixia Liu, Dehui Kong, Shaofan Wang, Zhiyong Wang, Jinghua Li, Baocai Yin,lcxxib@emails.bjut.edu.cn,wangshaofan@bjut.edu.cn
Journal Article
Brain Encoding and Decoding in fMRI with Bidirectional Deep Generative Models
Changde Du, Jinpeng Li, Lijie Huang, Huiguang He
Journal Article
Battle damage assessment based on an improved Kullback-Leibler divergence sparse autoencoder
Zong-feng QI, Qiao-qiao LIU, Jun WANG, Jian-xun LI
Journal Article
Efficient normalization for quantitative evaluation of the driving behavior using a gated auto-encoder
Xin HE, Zhe ZHANG, Li XU, Jiapei YU,xinhe_ee@zju.edu.cn,xupower@zju.edu.cn
Journal Article
Toward Human-in-the-loop AI: Enhancing Deep Reinforcement Learning Via Real-time Human Guidance for Autonomous Driving
Jingda Wu, Zhiyu Huang, Zhongxu Hu, Chen Lv
Journal Article
Automatic traceability link recovery via active learning
Tian-bao Du, Guo-hua Shen, Zhi-qiu Huang, Yao-shen Yu, De-xiang Wu,tbdu_312@outlook.com,ghshen@nuaa.edu.cn,zqhuang@nuaa.edu.cn
Journal Article
A Geometric Understanding of Deep Learning
Na Lei, Dongsheng An, Yang Guo, Kehua Su, Shixia Liu, Zhongxuan Luo, Shing-Tung Yau, Xianfeng Gu
Journal Article
SmartPaint: a co-creative drawing system based on generative adversarial networks
Lingyun SUN, Pei CHEN, Wei XIANG, Peng CHEN, Wei-yue GAO, Ke-jun ZHANG
Journal Article
Deep learning compact binary codes for fingerprint indexing
Chao-chao BAI, Wei-qiang WANG, Tong ZHAO, Ru-xin WANG, Ming-qiang LI
Journal Article
Attention-based efficient robot grasp detection network
Xiaofei QIN, Wenkai HU, Chen XIAO, Changxiang HE, Songwen PEI, Xuedian ZHANG,xiaofei.qin@usst.edu.cn,obmmd_zxd@163.com
Journal Article
Embedding expert demonstrations into clustering buffer for effective deep reinforcement learning
Shihmin WANG, Binqi ZHAO, Zhengfeng ZHANG, Junping ZHANG, Jian PU
Journal Article
Visual knowledge guided intelligent generation of Chinese seal carving
Kejun ZHANG, Rui ZHANG, Yehang YIN, Yifei LI, Wenqi WU, Lingyun SUN, Fei WU, Huanghuang DENG, Yunhe PAN
Journal Article