Revolutionizing Crop Breeding: Next-Generation Artificial Intelligence and Big Data-Driven Intelligent Design

Ying Zhang, Guanmin Huang, Yanxin Zhao, Xianju Lu, Yanru Wang, Chuanyu Wang, Xinyu Guo, Chunjiang Zhao

Engineering ›› 2025, Vol. 44 ›› Issue (1) : 245-255.

PDF(2208 KB)
PDF(2208 KB)
Engineering ›› 2025, Vol. 44 ›› Issue (1) : 245-255. DOI: 10.1016/j.eng.2024.11.034
Research
Review

Revolutionizing Crop Breeding: Next-Generation Artificial Intelligence and Big Data-Driven Intelligent Design

Author information +
History +

Abstract

The security of the seed industry is crucial for ensuring national food security. Currently, developed countries in Europe and America, along with international seed industry giants, have entered the Breeding 4.0 era. This era integrates biotechnology, artificial intelligence (AI), and big data information technology. In contrast, China is still in a transition period between stages 2.0 and 3.0, which primarily relies on conventional selection and molecular breeding. In the context of increasingly complex international situations, accurately identifying core issues in China’s seed industry innovation and seizing the frontier of international seed technology are strategically important. These efforts are essential for ensuring food security and revitalizing the seed industry. This paper systematically analyzes the characteristics of crop breeding data from artificial selection to intelligent design breeding. It explores the applications and development trends of AI and big data in modern crop breeding from several key perspectives. These include high-throughput phenotype acquisition and analysis, multiomics big data database and management system construction, AI-based multiomics integrated analysis, and the development of intelligent breeding software tools based on biological big data and AI technology. Based on an in-depth analysis of the current status and challenges of China’s seed industry technology development, we propose strategic goals and key tasks for China’s new generation of AI and big data-driven intelligent design breeding. These suggestions aim to accelerate the development of an intelligent-driven crop breeding engineering system that features large-scale gene mining, efficient gene manipulation, engineered variety design, and systematized biobreeding. This study provides a theoretical basis and practical guidance for the development of China’s seed industry technology.

Graphical abstract

Keywords

Crop breeding / Next-generation artificial intelligence / Multiomics big data / Intelligent design breeding

Cite this article

Download citation ▾
Ying Zhang, Guanmin Huang, Yanxin Zhao, Xianju Lu, Yanru Wang, Chuanyu Wang, Xinyu Guo, Chunjiang Zhao. Revolutionizing Crop Breeding: Next-Generation Artificial Intelligence and Big Data-Driven Intelligent Design. Engineering, 2025, 44(1): 245‒255 https://doi.org/10.1016/j.eng.2024.11.034

References

[1]
Wallace JG, Rodgers-Melnick E, Buckler ES. On the road to breeding 4.0: unraveling the good, the bad, and the boring of crop quantitative genomics. Annu Rev Genet 2018; 52:421-444.
[2]
Zhang Y, Liao S, Wang J, Guo X, Yang X, Zhao C. Information technology and intelligent equipment facilitating smart breeding. J Jilin Agric Univ 2021; 43(2):119-129.
[3]
Moose SP, Mumm RH. Molecular plant breeding as the foundation for 21st century crop improvement. Plant Physiol 2008; 147(3):969-977.
[4]
Fisher RA. The correlation between relatives on the supposition of mendelian inheritance. Trans R Soc Edinb 1919; 52(2):399-433.
[5]
Wright S. Systems of mating. I. The biometric relations between parent and offspring. Genetics 1921; 6(3):111-123.
[6]
Risch N, Merikangas K. The future of genetic studies of complex human diseases. Science 1996; 273(5281):1516-1517.
[7]
Meuwissen THE, Hayes BJ, Goddard ME. Prediction of total genetic value using genome-wide dense marker maps. Genetics 2001; 157(4):1819-1829.
[8]
Zhao C. Big data of plant phenomics and its research progress. J Agric Big Data 2019; 1(2):5-14.
[9]
Zhao C, Zhang Y, Du J, Guo X, Wen W, Gu S, et al. Crop phenomics: current status and perspectives. Front Plant Sci 2019; 10:714.
[10]
Wang X, Cai Z. Era of maize breeding 4.0. J Maize Sci 2019; 27(1):1-9.
[11]
Jing H, Tian Z, Chong K, Li J. Progress and perspective of molecular design breeding. Sci Sin Vitae 2021; 51(10):1356-1365.
[12]
Farooq MA, Gao S, Hassan MA, Huang Z, Rasheed A, Hearne S, et al. Artificial intelligence in plant breeding. Trends Genet 2024; 40(10):891-908.
[13]
Hassan MA, Yang M, Rasheed A, Tian X, Reynolds M, Xia X, et al. Quantifying senescence in bread wheat using multispectral imaging from an unmanned aerial vechicle and QTL mapping. Plant Physiol 2021; 187(4):2623-2636.
[14]
Bhandari M, Chang A, Jung J, Ibrahim AMH, Rudd JC, Baker S, et al. Unmanned aerial system-based high-throughput phenotyping for plant breeding. Plant Phenome J 2023; 6(1):e20058.
[15]
Lyu M, Lu X, Shen Y, Tan Y, Wan L, Shu Q, et al. UAV time-series imagery with novel machine learning to estimate heading dates of rice accessions for breeding. Agric Meteorol 2023; 34:109646.
[16]
Gill T, Gill SK, Saini DK, Chopra Y, de JP Koff, Sandhu KS. A comprehensive review of high throughput phenotyping and machine learning for plant stress phenotyping. Phenomics 2022; 2(3):156-183.
[17]
Wu D, Wu D, Feng H, Duan L, Dai G, Liu X, et al. A deep learning-integrated micro-CT image analysis pipeline for quantifying rice lodging resistance-related traits. Plant Commun 2021; 2(2):100165.
[18]
Guo L, Wang X, Tian F. Multi-omics integrative network map, a key to accurately decoding the maize functional genomics. Chin Bull Bot 2023; 58(1):1-5.
[19]
Zhang Y, Gu S, Du J, Huang G, Shi J, Lu X, et al. Plant microphenotype: from innovative imaging to computational analysis. Plant Biotechnol J 2024; 22(4):802-818.
[20]
Barrieu F, Chaumont F, Chrispeels MJ. High expression of the tonoplast aquaporin ZmTIP1 in epidermal and conducting tissues of maize. Plant Physiol 1998; 117(4):1153-1163.
[21]
Liu M, Zhang M, Yu S, Li X, Zhang A, Cui Z, et al. A Genome-wide association study dissects the genetic architecture of the metaxylem vessel number in maize brace roots. Front Plant Sci 2022; 13:847234.
[22]
Schneider HM, Lor VS, Zhang X, Saengwilai P, Hanlon MT, Klein SP, et al. Transcription factor bHLH121 regulates root cortical aerenchyma formation in maize. Proc Natl Acad Sci USA 2023; 120(12):e2219668120.
[23]
Wen W, Gu S, Zhang Y, Yang W, Guo X. Standard framework construction of technology and equipment for big data in crop phenomics. Engineering. 2024;42:175–84.
[24]
Gui S, Yang L, Li J, Luo J, Xu X, Yuan J, et al. ZEAMAP, a comprehensive database adapted to the maize multi-omics era. iScience 2020; 23(6):101241.
[25]
Han L, Zhong W, Qian J, Jin M, Tian P, Zhu W, et al. A multi-omics integrative network map of maize. Nat Genet 2023; 55(1):144-153.
[26]
Guo Z, Li B, Du J, Shen F, Zhao Y, Deng Y, et al. LettuceGDB: the community database for lettuce genetics and omics. Plant Commun 2022; 4(1):100425.
[27]
Yang Z, Luo C, Pei X, Wang S, Huang Y, Li J, et al. SoyMD: a platform combining multi-omics data with various tools for soybean research and breeding. Nucleic Acids Res 2024; 52(D1):D1639-D1650.
[28]
Guan J, Miao H, Zhang Z, Dong S, Zhou Q, Liu X, et al. A near-complete cucumber reference genome assembly and Cucumber-DB, a multi-omics database. Mol Plant 2024; 17(8):1178-1182.
[29]
He Q, Wang C, He Q, Zhang J, Liang H, Lu Z, et al. A complete reference genome assembly for foxtail millet and Setaria-DB, a comprehensive database for Setaria-DB. Mol Plant 2024; 17(2):219-222.
[30]
Gu Z, Gong J, Zhu Z, Li Z, Feng Q, Wang C, et al. Structure and function of rice hybrid genomes reveal genetic basis and optimal performance of heterosis. Nat Genet 2023; 55(10):1745-1756.
[31]
Yang Z, Wang S, Wei L, Huang Y, Liu D, Jia Y, et al. BnIR: a multi-omics database with various tools for Brassica napus research and breeding. Mol Plant 2023; 16(4):775-789.
[32]
Liu Y, Zhang Y, Liu X, Shen Y, Tian D, Yang X, et al. SoyOmics: a deeply integrated database on soybean multi-omics. Mol Plant 2023; 16(5):794-797.
[33]
Li X, Hou S, Feng M, Xia R, Li J, Tang S, et al. MDSi: multi-omics database for Setaria italica. BMC Plant Biol 2023; 23(1):223.
[34]
Yang Z, Wang J, Huang Y, Wang S, Wei L, Liu D, et al. CottonMD: a multi-omics database for cotton biological study. Nucleic Acids Res 2023; 51(D1):D1446-D1456.
[35]
Sun M, Yan H, Zhang A, Jin Y, Lin C, Luo L, et al. Milletdb: a multi-omics database to accelerate the research of functional genomics and molecular breeding of millets. Plant Biotechnol J 2023; 21(11):2348-2357.
[36]
Cui X, Hu M, Yao S, Zhang Y, Tang M, Liu L, et al. BnaOmics: a comprehensive platform combining pan-genome and multi-omics data from Brassica napus. Plant Commun 2023; 4(5):100609.
[37]
Chen Y, Guo Y, Guan P, Wang Y, Wang X, Wang Z, et al. A wheat integrative regulatory network from large-scale complementary functional datasets enables trait-associated gene discovery for crop improvement. Mol Plant 2023; 16(2):393-414.
[38]
Xu Y, Zhang X, Li H, Zheng H, Zhang J, Olsen MS, et al. Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction. Mol Plant 2022; 15(11):1664-1695.
[39]
Liu L, Zhan J, Yan J. Engineering the future cereal crops with big biological data: toward an intelligence-driven breeding by design. J Genet Genomics 2024; 51(8):781-789.
[40]
Montesinos-López OA, Montesinos-López A, Luna-Vázquez FJ, Toledo FH, P Pérez-Rodríguez, Lillemo M, et al. An R package for Bayesian analysis of multi-environment and multi-trait multi-environment data for genome-based prediction. G3-Genes Genom Genet 2019; 9(5):1355-1369.
[41]
Sandhu K, Patil SS, Pumphrey M, Carter A. Multitrait machine- and deep-learning models for genomic selection using spectral information in a wheat breeding program. Plant Genome 2021; 14(3):e20119.
[42]
Li J, Zhang D, Yang F, Zhang Q, Pan S, Zhao X, et al. TrG2P: a transfer learning-based tool integrating multi-trait data for accurate prediction of crop yield. Plant Commun 2024; 5(7):100975.
[43]
Zamir D. Where have all the crop phenotypes gone? PLoS Biology 2013; 11 (6): e1001595.
[44]
Li H, Li X, Zhang P, Feng Y, Mi J, Gao S, et al. Smart breeding platform: a web-based tool for high-throughput population genetics, phenomics, and genomic selection. Mol Plant 2024; 17(5):677-681.
[45]
Wang H, Lai J, Wang H, Li X. Bipartite intelligent design of crops-intelligent combination of natural variation and intelligent creation of artificial variation. J Agric Sci Technol 2022; 24(6):1-8.
[46]
Lu X, Chen X. Progress of conservation and research of crop germplasm resources in China. Scienti Agric Sinica 2003; 36(10):1125-1132.
[47]
Wan JM. Accelerate innovation in biological breeding. Public Commun of Sci & Technol 2022;14(8):F2. Chinese.
[48]
Chi P, Xie H, Zhao P, Chen F, Wu N, Tian Z, et al. Development and issues of biotech seed industry in China. Bull Chin Acad Sci 2023; 38(6):845-852.
[49]
Chong K, Li JY. The development of plant science has brought about a new revolution in breeding technology. Scientia Sinica Vitae 2021; 51(10):1353-1355.
[50]
Hickey LT, Hafeez AN, Robinson H, Jackson SA, Leal-Bertioli SCM, Tester M, et al. Breeding crops to feed 10 billion. Nat Biotechnol 2019; 37(7):744-754.
[51]
Yang W, Feng H, Zhang X, Zhang J, Doonan JH, Batchelor WD, et al. Crop phenomics and high-throughput phenotyping: past decades, current challenges, and future perspectives. Mol Plant 2020; 13(2):187-214.
[52]
Liu HJ, Yan J. Crop genome-wide association study: a harvest of biological relevance. Plant J 2019; 97(1):8-18.
[53]
Xu Y, Xu C, Xu S. Prediction and association mapping of agronomic traits in maize using multiple omic data. Heredity 2017; 119(3):174-184.
[54]
Xu Y, Liu X, Fu J, Wang H, Wang J, Huang C, et al. Enhancing genetic gain through genomic selection: from livestock to plants. Plant Commun 2020; 1(1):100005.
[55]
Harfouche AL, Jacobson DA, Kainer D, Romero JC, Harfouche AH, Scarascia G Mugnozza, et al. Accelerating climate resilient plant breeding by applying next-generation artificial intelligence. Trends Biotechnol 2019; 37(11):1217-1235.
[56]
Zheng HG, Zhao JJ, Qin XJ, Jia Q, Qi S. Overview of the global crop seed industry and strategic thinking on its development in China. Strategic Study of CAE 2021; 23(4):45-55.
[57]
Araus JL, Kefauver SC, Zaman-Allah M, Olsen MS, Cairns JE. Translating high-throughput phenotyping into genetic gain. Trends Plant Sci 2018; 23(5):451-466.
[58]
Jiang Y, Li C. Convolutional neural networks for image-based high-throughput plant phenotyping: a review. Plant Phenomics 2020; 2020:4152816.
[59]
Tardieu F, Cabrera-Bosquet L, Pridmore T, Bennett M. Plant phenomics, from sensors to knowledge. Curr Biol 2017; 27(15):R770-R783.
[60]
Wang X, Xuan H, Evers B, Shrestha S, Pless R, Poland J. High-throughput phenotyping with deep learning gives insight into the genetic architecture of flowering time in wheat. Gigascience 2019; 8(11):giz120.
[61]
Furbank RT, Jimenez-Berni JA, George-Jaeggli B, Potgieter AB, Deery DM. Field crop phenomics: enabling breeding for radiation use efficiency and biomass in cereal crops. New Phytol 2019; 223(4):1714-1727.
[62]
Zhang Y, Wang J, Du J, Zhao Y, Lu X, Wen W, et al. Dissecting the phenotypic components and genetic architecture of maize stem vascular bundles using high-throughput phenotypic analysis. Plant Biotechnol J 2021; 19(1):35-50.
[63]
Hammer G, Messina C, Wu A, Cooper M. Biological reality and parsimony in crop models—why we need both in crop improvement! in silico Plants 2019;1(1):diz010.
[64]
Nerkar G, Devarumath S, Purankar M, Kumar A, Valarmathi R, Devarumath R, et al. Advances in crop breeding through precision genome editing. Front Genet 2022; 13:880195.
[65]
Peng B, Guan K, Tang J, Ainsworth EA, Asseng S, Bernacchi CJ, et al. Towards a multiscale crop modelling framework for climate change adaptation assessment. Nat Plants 2020; 6(4):338-348.
[66]
Varshney RK, Sinha P, Singh VK, Kumar A, Zhang Q, Bennetzen JL. 5Gs for crop genetic improvement. Curr Opin Plant Biol 2020; 56:190-196.
[67]
Bailey-Serres J, Parker JE, Ainsworth EA, Oldroyd GED, Schroeder JI. Genetic strategies for improving crop yields. Nature 2019; 575(7781):109-118.
AI Summary AI Mindmap
PDF(2208 KB)

Accesses

Citations

Detail

Sections
Recommended

/