Prompt learning in computer vision: a survey

2024, Volume 25, Issue 1

Abstract

Keywords

Related Research

Frontiers of Information Technology & Electronic Engineering >> 2024, Volume 25, Issue 1 doi: 10.1631/FITEE.2300389

Prompt learning in computer vision: a survey

1. Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai 200438, China; 2. Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China; 3. MOE Frontiers Center for Brain Science, Fudan University, Shanghai 200433, China; 4. Shanghai Center for Brain Science and Brain-Inspired Technology, Shanghai 201210, China;

Received: 2023-05-31 Accepted: 2024-02-19 Available online: 2024-02-19

HTML0 PDF 59 Collect 0

Next Previous

Abstract

has attracted broad attention in computer vision since the large pre-trained vision-language models (VLMs) exploded. Based on the close relationship between vision and language information built by VLM, becomes a crucial technique in many important applications such as . In this survey, we provide a progressive and comprehensive review of visual as related to AIGC. We begin by introducing VLM, the foundation of visual . Then, we review the vision methods and prompt-guided generative models, and discuss how to improve the efficiency of adapting AIGC models to specific downstream tasks. Finally, we provide some promising research directions concerning .

Keywords

Prompt learning ; Visual prompt tuning (VPT) ; Image generation ; Image classification ; Artificial intelligence generated content (AIGC)

Related Research