[1] |
LeCun Y, Bottou L, Bengio Y, Haffner P.Gradient-based learning applied to document recognition.Proc IEEE 1998; 86(11):2278-2324.
|
[2] |
Zeiler MD, Fergus R.Visualizing and understanding convolutional networks.D. Fleet, T. Pajdla, B. Schiele, T. Tuytelaars (Eds.), Computer vision—ECCV 2014, Springer, Cham 2014; 818-833.
|
[3] |
Li J.Exploring the logic and landscape of the knowledge system: multilevel structures, each multiscaled with complexity at the mesoscale.Engineering 2016; 2(3):276-285.
|
[4] |
Li J.The principle of compromise-in-competition: understanding mesoscale complexity of different levels.Proc R Soc Lond A 2024; 480(2301):20240031.
|
[5] |
Devlin J, Chang MW, Lee K, Toutanova K.BERT: pre-training of deep bidirectional transformers for language understanding.In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies; 2019 Jun 2–7; Minneapolis, M N, US A; 2019.
|
[6] |
Radford A, Narasimhan K, Salimans T, Sutskever I.Improving language understanding by generative pre-training.San Francisco: OpenA I; 2018.
|
[7] |
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al.Attention is all you need.Adv Neural Inf Process Syst 2017; 30:5998-6008.
|
[8] |
Guo L, Wu J, Li J.Complexity at mesoscales: a common challenge in developing artificial intelligence.Engineering 2019; 5(5):924-929.
|