Artificial Intelligence in Cancer Immunotherapy: Navigating Challenges and Unlocking Opportunities

Wei Xiang , Lu Yu , Xiaoyuan Chen , Marco J. Herold

Engineering ›› 2025, Vol. 44 ›› Issue (1) : 13 -17.

PDF (774KB)
Engineering ›› 2025, Vol. 44 ›› Issue (1) :13 -17. DOI: 10.1016/j.eng.2024.12.014
Views & Comments
research-article
Artificial Intelligence in Cancer Immunotherapy: Navigating Challenges and Unlocking Opportunities
Author information +
History +
PDF (774KB)

Graphical abstract

Cite this article

Download citation ▾
Wei Xiang, Lu Yu, Xiaoyuan Chen, Marco J. Herold. Artificial Intelligence in Cancer Immunotherapy: Navigating Challenges and Unlocking Opportunities. Engineering, 2025, 44(1): 13-17 DOI:10.1016/j.eng.2024.12.014

登录浏览全文

4963

注册一个新账户 忘记密码

1. Introduction

Cancer continues to be a major cause of global mortality rates, with conventional treatments such as chemotherapy and radiotherapy exhibiting inconsistent efficacy, high costs, and considerable side effects. Over the past decade, a promising alternative has emerged: cancer immunotherapy, which leverages the body’s immune system to identify and eradicate cancer cells [1]. Significant progress has been made in several novel immunotherapies, including immune checkpoint inhibitors (ICIs), chimeric antigen receptor T-cell therapies, and personalized therapeutic cancer vaccines. However, the heterogeneity and complexity of cancer types still pose enormous challenges. The efficient use of large numbers of medical datasets is essential for overcoming these obstacles. Without artificial intelligence (AI), linking translational and clinical data to derive meaningful insights would remain an insurmountable task. Recently, single-cell RNA sequencing cell typing (scGPT), a pretrained generative model utilizing a database of more than 33 million cells, is developed to accurately extract important biological insights concerning cellular biology [2]. A personalized learning workflow is also proposed as a simple and effective pipeline to discover neoantigens for cancer immunotherapy [3].

AI for medical research is what antibiotics were to medicine in 1910, and the era of digital penicillin has now arrived. AI is becoming a driving force behind cancer research, providing capabilities that were once unimaginable. It is playing a critical role in identifying modifiable risk factors, discovering new drug targets, and creating innovative clinical platforms. As AI continues to evolve, it will become more than a tool but an imperative—driving breakthroughs that are otherwise unattainable and transforming every stage of cancer care, from detection and diagnosis to treatment, recurrence, and survival.

2. Key challenges in AI-powered cancer immunotherapy

AI-powered cancer immunotherapy stands at the forefront of precision medicine, combining advanced computational techniques with biological insights to personalize treatments and improve patient outcomes. By analyzing large, diverse datasets, AI models can uncover and generate previously unrecognized patterns, predict therapeutic responses, and streamline the development of drugs and vaccines [4]. Despite recent advancements, several critical challenges continue to hinder the full potential of AI in this domain. These obstacles are rooted in the inherent complexity of biological systems, the limitations of current computational tools, and the difficulty of translating research findings into practical clinical applications. Some of the key challenges in AI-powered cancer immunotherapy are categorized below.

2.1. Personalized cancer medicine development considering individual biological factors

A fundamental challenge in personalized cancer medicine development lies in the effective integration and interpretation of diverse, often incomplete datasets, such as genomic, proteomic, clinical, and imaging data, specifically for tailoring treatments to individual patients. To achieve this purpose, these varied data types must be synthesized in a way that provides actionable, patient-specific insights. However, current approaches have notable limitations. For example, one multimodal model that is proposed to integrate genomic, imaging, and clinical trial data struggles with incomplete or noisy datasets [5]. He et al. [6] introduced a context–aware autoencoder to predict clinical drug responses; however, it relies solely on cell-line data, which cannot fully capture patient-specific complexities.

These limitations significantly hinder progress in developing personalized cancer medicine. For example, tasks such as optimizing messenger RNA (mRNA) vaccine sequences are complicated by the need to navigate a vast combinatorial space while simultaneously addressing the intricate biological constraints unique to each patient [7]. Similarly, discovering predictive biomarkers requires advanced systems that can analyze noisy and incomplete data, which is essential for identifying meaningful patterns unique to a patient’s tumor [8].

2.2. Modeling complex biological systems with precision and scalability

Biological systems—especially within the context of cancer immunotherapy—exhibit complex dynamics that span the molecular, cellular, and tissue scales [9]. To accurately represent these processes, computational models must be both scalable and precise. Although significant progress has been made through advancements such as AlphaFold [10] in protein structure prediction, these tools primarily focus on static representations of individual protein structures, which limits their ability to model dynamic cellular- and tissue-level interactions. Recent efforts such as AF2Complex [11] offer a promising extension by addressing protein–protein interactions. However, these methods still fall short when applied to tumor microenvironments (TMEs), whose spatial organization and functional dynamics must be comprehensively captured [12].

2.3. Building fair and trustworthy AI systems

Integrating AI into cancer immunotherapy raises important ethical and privacy concerns that must be addressed to ensure its responsible adoption. Many existing AI systems rely on datasets that lack sufficient diversity, resulting in inconsistent performance across different populations [13]. Another challenge lies in the opacity of many AI models, which often function as “black boxes” due to their complex architectures and enormous numbers of parameters. Current methods for improving explainability, such as those outlined in Refs. [14], [15], primarily focus on image datasets, limiting their usefulness in critical areas such as clinical trials and treatment planning [16]. Privacy concerns also pose a significant challenge. Reliance on sensitive patient data introduces the risks of breaches, unauthorized access, and difficulties in meeting regulatory standards [17]. Developing advanced privacy-preserving techniques and adhering to stricter regulatory guidelines are essential for protecting patient information while enabling meaningful AI applications.

3. Opportunities for AI in cancer immunotherapy

3.1. Generative AI in mRNA-based personalized cancer vaccines

While mRNA coronavirus disease 2019 (COVID-19) vaccines have been highly successful, extending this technology to treat cancers and other diseases requires significant advancements, particularly in the design of mRNAs that exhibit elevated expression levels and extended durability. Key sequence components, such as coding sequences (CDSs) and untranslated regions (UTRs), play a critical role in the therapeutic effectiveness of mRNA. Optimizing these sequences is crucial to unlocking the full potential of mRNA therapeutics, yet it remains extremely challenging due to the vast number of possible mRNA sequences [18]. Current strategies for designing 5' UTRs explore only a small portion of the available design space, and most research has focused on comparisons with natural UTRs, lacking a broad investigation of alternative designs. To address these limitations, generative AI offers a promising approach [19]. With their exceptional capacity for translation-based reasoning, generative AI models can contribute to the development of superior mRNA molecules by uncovering potential structural and sequence patterns [7]. Utilizing these models could provide an effective strategy for mRNA design, significantly advancing the development of mRNA-based vaccines and therapeutics (Fig. 1).

3.2. Accelerating cancer drug discovery through quantum machine learning

In recent years, drug discovery has been revolutionized by AI methods [20]. Biotechnology companies focused on drug development are now establishing new business models based on AI-driven molecular modeling techniques. In this evolving landscape, AI tools such as ChatGPT and other large language models (LLMs) are making their mark in drug discovery [21]. However, current methods primarily rely on approximations to model drug–target interactions, which limits their precision and applicability [22].

Quantum machine learning (QML), founded on the principles of quantum mechanics, offers a transformative solution, as it is capable of simulating molecular interactions such as binding affinities with exceptional accuracy [23]. By closely aligning with the physical reality of these interactions, QML has the potential to significantly advance our understanding of drug–target dynamics and improve the precision of computational drug discovery. For example, Li et al. [24] proposed a hybrid quantum generator for discovering drug candidates by generating molecular structures that adhere to established chemical and physical properties. In addition, hybrid quantum-classical workflows offer practical solutions for integrating QML into real-world drug-discovery processes [25]. These advances make QML particularly valuable for applications such as prioritizing promising drug candidates, substantially accelerating the path to therapeutic discovery.

3.3. Chain-of-thought reasoning in biomarker discovery

Although biomarkers are critically important for evaluating the efficacy of cancer immunotherapies [26], the growing number of complex biomarkers increases both the costs and time required for decision-making in oncology practice. LLMs have emerged as useful tools for addressing these challenges, as they can reason across distributed and multimodal evidence sources [27], [28]. However, a prominent concern regarding LLMs is hallucination, in which models generate fabricated or erroneous information that lacks grounding in actual data [29]. Moreover, the autoregressive nature of most LLMs can result in plausible yet incorrect predictions, particularly in cases involving rare or ambiguous clinical scenarios [30].

Retrieval augmented generation (RAG) [31] and chain-of-thought (CoT) reasoning [32] offer promising approaches for addressing some of the inherent limitations of LLMs. RAG-based systems have been tested in healthcare settings, demonstrating their ability to retrieve and integrate clinical trial data for decision-making support [33]. Similarly, studies have shown that LLMs combined with CoT reasoning can be fine-tuned to provide clinically relevant insights, such as predicting patient outcomes, underscoring their potential value in cancer immunotherapy [34]. Despite their promise, these applications require rigorous validation to ensure that the retrieved knowledge is comprehensive, up-to-date, and free from bias. To mitigate these issues, the development of curated, high-quality biomedical databases with rigorous updating protocols is necessary. Additionally, collaborative efforts among healthcare institutions, researchers, and AI developers are essential to establish centralized, unbiased repositories optimized for clinical use.

3.4. Integrating multimodal large language models into tumor modeling analysis

TMEs are intricate niches characterized by cellular, molecular, and genetic diversity, particularly in the context of cancer immunotherapy. AI technologies have been employed in TME discovery to delineate tissue compartments without the use of antibodies. This includes identifying components such as normal ductal epithelium, precursor cells of pancreatic cancer, smooth muscle cells, acini, adipocytes, collagen, islets of Langerhans, lymph nodes, and nerves [35]. For instance, employing machine learning in combination with hematoxylin and eosin (H&E) staining [36] makes it possible to visualize three-dimensional (3D) tissue architecture across large tissue volumes. However, the modeling of biological interactions involving multimodal data continues to be a significant obstacle. Multimodal large language models (MLLMs) can parse and analyze vast amounts of scientific data spanning different data types to extract relevant information about cellular interactions and pathways. This capability is crucial for identifying and modeling complex relationships among various biological entities, such as cells, proteins, and signaling pathways [37].

3.5. Optimizing clinical trials with trustworthy AI

Clinical trials are a key step in developing new therapies, yet nearly 90% of products under clinical trial fail to reach clinical implementation. AI tools such as LLMs can significantly enhance this process at multiple stages [38]. The use of AI in medical imaging also facilitates the early detection of disease progression, enabling more informed and timely decisions compared with traditional methods [39]. However, the complexity and opacity of many AI models, which involve millions of parameters and nonlinear transformations, pose challenges to trust and adoption [40]. To address these issues, explainable AI frameworks must be implemented for tracing decision-making processes and identifying the key features driving predictions. Furthermore, fairness-aware algorithms are essential to mitigate biases and ensure equitable outcomes for diverse populations. Another critical consideration is safeguarding patient privacy. Robust data-anonymization methods, secure sharing protocols, and strict adherence to ethical standards are necessary to protect sensitive information. By embedding fairness, explainability, and privacy protections into AI systems, clinical trials can become more efficient, inclusive, and trustworthy, ultimately paving the way for broader adoption and improved success rates.

4. Conclusions

Over the past decade, remarkable progress has been made in AI-powered cancer immunotherapy, yet its practical realization continues to present substantial challenges. One prominent issue lies in the efficient use of real-world data, which includes electronic health records, medical imaging, and laboratory tests. Unlike structured clinical trial data, which is gathered under well-defined protocols, real-world data is often fragmented, unstructured, and heterogeneous. Utilizing this data with AI holds great potential to advance personalized immunotherapy but necessitates the creation of integrated systems equipped with advanced algorithms to manage the data complexity. Key priorities in this domain also include identifying novel therapeutic targets and developing efficient AI-driven drug-discovery platforms that can effectively match targets to candidate therapies. These initiatives may be supported by scalable and interpretable AI systems capable of processing multimodal datasets and delivering transparent, actionable insights to healthcare providers.

Future efforts should emphasize the establishment of unified frameworks for integrating multimodal data, alongside rigorous standards for model validation, reproducibility, and adherence to ethical principles. Ensuring patient privacy through advanced anonymization methods and secure data-sharing practices is equally essential for maintaining public trust. Addressing these challenges will enable AI to make transformative contributions to cancer immunotherapy, thereby driving innovation, improving treatment outcomes, and enhancing the overall quality of cancer care for patients.

Acknowledgments

This research was supported by Australian Centre for AI in Medical Innovation (ACAMI) funded by the Victoria State Government, National University of Singapore (NUHSRO/2020/133/Startup/08, NUHSRO/2023/008/NUSMed/TCE/LOA, NUHSRO/2021/034/TRP/09/Nanomedicine, NUHSRO/2021/044/Kickstart/09/LOA, and 23-0173-A0001), National Medical Research Council (MOH-001388-00, CG21APR1005, MOH-001500-00, and MOH-001609-00), Singapore Ministry of Education (MOE-000387-00 and MOET32023-0005), and National Research Foundation (NRF-000352-00).

References

[1]

Addala V, Newell F, Pearson JV, Redwood A, Robinson BW, Creaney J, et al. Computational immunogenomic approaches to predict response to cancer immunotherapies. Nat Rev Clin Oncol 2024; 21(1):28-46.

[2]

Cui H, Wang C, Maan H, Pang K, Luo F, Duan N, et al. scGPT: toward building a foundation model for single-cell multi-omics using generative AI. Nat Methods 2024; 21(8):1470-1480.

[3]

Tran NH, Qiao R, Xin L, Chen X, Shan B, Li M. Personalized deep learning of individual immunopeptidomes to identify neoantigens for cancer vaccines. Nat Mach Intell 2020; 2(12):764-771.

[4]

Cao LL, Kagan JC. Targeting innate immune pathways for cancer immunotherapy. Immunity 2023; 56(10):2206-2217.

[5]

Esteva A, Feng J, van D der Wal, Huang SC, Simko JP, DeVries S, et al. Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials. NPJ Digit Med 2022; 5(1):71.

[6]

He D, Liu Q, Wu Y, Xie L. A context–aware deconfounding autoencoder for robust prediction of personalized clinical drug response from cell-line compound screening. Nat Mach Intell 2022; 4(10):879-892.

[7]

Pardi N, Krammer F. mRNA vaccines for infectious diseases—advances, challenges and opportunities. Nat Rev Drug Discov 2024; 23(11):838-861.

[8]

Tay C, Tanaka A, Sakaguchi S. Tumor-infiltrating regulatory T cells as targets of cancer immunotherapy. Cancer Cell 2023; 41(3):450-465.

[9]

Walsh LA, Daniela FQ. Decoding the tumor microenvironment with spatial technologies. Nat Immun 2023; 24(12):1982-1993.

[10]

Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021; 596(7873):583-589.

[11]

Gao M, Nakajima D An, Parks JM, Skolnick J. AF2Complex predicts direct physical interactions in multimeric proteins with deep learning. Nat Commun 2022; 13:1744.

[12]

Brunton SL, Kutz JN. Promising directions of machine learning for partial differential equations. Nat Comput Sci 2024; 4(7):483-494.

[13]

Chen RJ, Wang JJ, Williamson DFK, Chen TY, Lipkova J, Lu MY, et al. Algorithmic fairness in artificial intelligence for medicine and healthcare. Nat Biomed Eng 2023; 7(6):719-742.

[14]

Yan S, Yu Z, Zhang X, Mahapatra D, Chandra SS, Janda M, et al. Towards trustable skin cancer diagnosis via rewriting model’s decision. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; 2023 Jun 18–22; Vancouver, BC, Canada; 2023.

[15]

DeGrave AJ, Cai ZR, Janizek JD, Daneshjou R, Lee SI. Auditing the inference processes of medical-image classifiers by leveraging generative AI and the expertise of physicians. Nat Biomed Eng. In press.

[16]

Perez-Lopez R, Ghaffari LN, Mahmood F, Kather JN. A guide to artificial intelligence for cancer researchers. Nat Rev Cancer 2024; 24(6):427-441.

[17]

Viswanathan VS, Parmar V, Madabhushi A. Towards equitable AI in oncology. Nat Rev Clin Oncol 2024; 21(8):628-637.

[18]

Mihir M, Christopher SP, Melissa JM. Tailor made: the art of therapeutic mRNA design. Nat Rev Drug Discov 2024; 23(1):67-83.

[19]

Ingraham JB, Baranov M, Costello Z, Barber KW, Wang W, Ismail A, et al. Illuminating protein space with a programmable generative model. Nature 2023; 623(7989):1070-1078.

[20]

Jayatunga MK, Xie W, Ruder L, Schulze U, Meier C. AI in smallmolecule drug discovery: a coming wave?. Nat Rev Drug Discov 2022; 21(3):175-176.

[21]

You Y, Lai X, Pan Y, Zheng H, Vera J, Liu S, et al. Artificial intelligence in cancer target identification and drug discovery. Signal Transduct Target Ther 2022; 7(1):156.

[22]

Wang Y, Xia Y, Yan J, Yuan Y, Shen HB, Pan X. ZeroBind: a protein-specific zero-shot predictor with subgraph matching for drug–target interactions. Nat Commun 2023; 14:7861.

[23]

Kim Y, Eddins A, Anand S, Wei KX, van E den Berg, Rosenblatt S, et al. Evidence for the utility of quantum computing before fault tolerance. Nature 2023; 618(7965):500-505.

[24]

Li J, Topaloglu RO, Ghosh S. Quantum generative models for small molecule drug discovery. IEEE Trans Quantum Eng 2021; 2:3103308.

[25]

Li W, Yin Z, Li X, Ma D, Yi S, Zhang Z, et al. A hybrid quantum computing pipeline for real world drug discovery. Sci Rep 2024; 14:16942.

[26]

Kraehenbuehl L, Weng CH, Eghbali S, Wolchok JD, Merghoub T. Enhancing immunotherapy in cancer by targeting emerging immunomodulatory pathways. Nat Rev Clin Oncol 2022; 19(1):37-50.

[27]

Carreras J, Roncador G, Hamoudi R. Artificial intelligence predicted overall survival and classified mature B-cell neoplasms based on immuno-oncology and immune checkpoint panels. Cancers 2022; 14(21):5318.

[28]

Kong J, Ha D, Lee J, Kim I, Park M, Im SH, et al. Network-based machine learning approach to predict immunotherapy response in cancer patients. Nat Commun 2022; 13:3703.

[29]

Huang Q, Dong X, Zhang P, Wang B, He C, Wang J, et al. OPERA: alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; 2024 Jun 17–21; Seattle, WA, USA; 2024.

[30]

Farquhar S, Kossen J, Kuhn L, Gal Y. Detecting hallucinations in large language models using semantic entropy. Nature 2024; 630(8017):625-630.

[31]

Lewis P, Perez E, Piktus A, Petroni F, Karpukhin V, Goyal N, et al. Retrieval-augmented generation for knowledge-intensive NLP tasks. Adv Neural Inf Process Syst 2020; 33:9459-9474.

[32]

Wei J, Wang X, Schuurmans D, Bosma M, Ichter B, Xia F, et al. Chain-of-thought prompting elicits reasoning in large language models. In: Proceedings of the Thirty-Sixth Conference on Neural Information Processing Systems; 2022 Nov 28–Dec 9; New Orleans, LA, USA; 2022.

[33]

Jeong M, Sohn J, Sung M, Kang J. Improving medical reasoning through retrieval and self-reflection with retrieval-augmented large language models. Bioinformatics 2024; 40(Suppl 1):i119-i129.

[34]

Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, et al. Large language models encode clinical knowledge. Nature 2023; 620(7972):172-180.

[35]

Sorin M, Rezanejad M, Karimi E, Fiset B, Desharnais L, Perus LJM, et al. Single-cell spatial landscapes of the lung tumour immune microenvironment. Nature 2023; 614(7948):548-554.

[36]

Kiemen AL, Braxton AM, Grahn MP, Han KS, Babu JM, Reichel R, et al. CODA: quantitative 3D reconstruction of large tissues at cellular resolution. Nat Methods 2022; 19(11):1490-1499.

[37]

McDonald TO, Cheng YC, Graser C, Nicol PB, Temko D, Michor F. Computational approaches to modelling and optimizing cancer treatment. Nat Rev Bioeng 2023; 1(10):695-711.

[38]

Yuan C, Ryan PB, Ta C, Guo Y, Li Z, Hardin J, et al. Criteria2Query: a natural language interface to clinical databases for cohort definition. J Am Med Inform Assoc 2019; 26(4):294-305.

[39]

Lu L, Dercle L, Zhao B, Schwartz LH. Deep learning for the prediction of early on-treatment response in metastatic colorectal cancer from serial medical imaging. Nat Commun 2021; 12:6654.

[40]

Dolezal JM, Wolk R, Hieromnimon HM, Howard FM, Srisuwananukorn A, Karpeyev D, et al. Deep learning generates synthetic cancer histology for explainability and education. NPJ Precis Oncol 2023; 7(1):49.

RIGHTS & PERMISSIONS

THE AUTHOR

PDF (774KB)

6426

Accesses

0

Citation

Detail

Sections
Recommended

/