Engineering is the practical activity in which human beings transform the world with the help of science and technology. An engineering front refers to the key direction that is forward looking, leading, and exploratory. It has a major influence and plays a leading role in the future development of engineering science and technology and serves as an important guide for cultivating the capabilities for innovation in the field of engineering science and technology. According to the innovation stage of the front, Engineering fronts are categorized into engineering research fronts which focuses on the theoretical exploration and engineering development fronts which focuses on the practical application of engineering science and technology.

Underpinned by experts’ evaluation and data, the Global Engineering Fronts 2022 project has adopted multi-round interactions between experts and data for iterative research and analysis, realizing a deep integration of judgments of experts and data analyses. In 2022, 95 global engineering research fronts and 93 global engineering development fronts are selected, with 29 engineering research fronts and 29 engineering development fronts being listed as the key focus of interpretation. The distribution of engineering research and engineering development fronts among the nine fields is shown in Table 1.1.

The research of fronts consists of three stages: data preparation, data analysis, and expert review. In the data preparationstage, the domain, library, and information experts formulate the literature and patent search strategy according to the technological systems to define the scope of data mining. In the data analysis stage, the co-citation clustering method is used to obtain clustered literature topics and the co-word clustering method to obtain the Theme Scape patent maps. In the expert review stage, the fronts are gradually selected and determined through front topic section, front name revision, expert panel discussions, and other methods. To address the problem of lacking of novelty due to algorithm limitations or lags in data mining, experts from different fields were encouraged to check the results of the data analysis to revise, combine, and expand the engineering fronts. A flowchart of the operation procedure of the Global Engineering Fronts project is illustrated in Figure 1.1, in which the green, purple, and red boxes indicate the data analysis, expert research, and multi-round iterative interactions between experts and data, respectively.

《1 Identification of engineering research fronts》

1 Identification of engineering research fronts

The identification of the engineering research fronts is performed in two steps. The first step involves determining the clustered literature topics through the co-citation clustering method according to the SCI journal papers and

《Table 1.1》

Table 1.1 Distribution of engineering research and engineering development fronts among the nine fields

Field Number of engineering research fronts Number of engineering development fronts
Mechanical and Vehicle Engineering 10 10
Information and Electronic Engineering 10 10
Chemical, Metallurgical, and Materials Engineering 12 10
Energy and Mining Engineering 12 12
Civil, Hydraulic, and Architectural Engineering 10 10
Environmental and Light Textile Engineering 10 10
Agriculture 11 11
Medicine and Health 10 10
Engineering Management 10 10
Total 95 93

data of conference proceedings collected from the Web of Science Core Collection. The second step is calling for the engineering research themes through expert nomination. Candidate engineering research fronts that were identified through expert argumentation and refinement went through questionnaire surveys and multiple rounds of expert discussions, yielding 95 engineering research fronts in the nine fields.

《1.1 Acquisition and preprocessing of paper data》

1.1 Acquisition and preprocessing of paper data

Firstly, the mapping relationship between the technology systems of nine academic departments of the CAE and Web of Science disciplines was constructed, and the lists of academic journals and academic conferences corresponding to each field were obtained. After the correction and supplementation by domain experts, the sources for data analysis in the nine fields were determined to be 12 709 journals and 48 260 conferences. In addition, for articles from 79 multidisciplinary sciences journals, the field of each article was reassigned to the most relevant subject area according to the subjects cited in its references.

Accordingly, the articles and conference papers published between 2016 and 2021 were retrieved (the cut-off date of the citations was January 2022). For each field, the differences between journals and conferences, the publication year, and so on were comprehensively considered. By processing journals and conference proceedings separately, papers with high impact that are ranked among the top 10% of the citations were selected as the original dataset for the analysis of the research fronts, referring to the highly cited papers in Web of Science, as shown in Table 1.1.1.

《1.2 Mining of clustered literature topics》

1.2 Mining of clustered literature topics

Through the co-citation clustering analysis of the top 10% highly cited papers in the aforementioned nine data datasets, all the clustered literature topics in the nine fields were obtained. The topics of the journal and conference papers published during 2016–2019 were selected according to the number of core papers, total number of citations, mean publication year, and the proportion of consistently cited papers. Consequently, 35 diverse literature topics were extracted. The topics of the journal and conference papers published during 2020–2021 were selected according to the number of core papers, total number of citations, and proportion of consistently cited papers. Thereafter, 25 different literature topics were obtained. Overlapping topics were replaced by topics that did not intersect with other fields. In addition, subjects that were not covered by clustering topics were extracted separately by keywords. Finally, 772 clustered literature topics in the nine fields were obtained (Table 1.2.1).

《Figure 1.1》

Figure 1.1 Operation procedure of the Global Engineering Fronts project

《Table 1.1.1》

Table 1.1.1 Number of journals, conferences, and top 10% highly cited papers in each field

No. Field Number of journals Number of conferences Number of top 10% highly cited papers
1 Mechanical and Vehicle Engineering 533 3 026 96 918
2 Information and Electronic Engineering 999 21 306 220 667
3 Chemical, Metallurgical, and Materials Engineering 1 209 4 259 292 796
4 Energy and Mining Engineering 938 2 682 149 334
5  Civil, Hydraulic, and Architectural Engineering 650 1 284 71 484
6 Environmental and Light Textile Engineering 1 362 1 376 225 197
7 Agriculture 1 363 934 165 566
8 Medicine and Health 4 835 12 072 499 690
9 Engineering Management 820 1 321 55 778

《Table 1.2.1》

Table 1.2.1 Statistics of co-citation clustering results in each field

No. Field Number of Clustered topics Number of core papers Number of candidate engineering research hotspots
1 Mechanical and Vehicle Engineering 10734 43833 103
2 Information and Electronic Engineering 22342 96506 71
3 Chemical, Metallurgical, and Materials Engineering 29447 119038 61
4 Energy and Mining Engineering 16204 68338 96
5 Civil, Hydraulic, and Architectural Engineering 7893 34302 126
6 Environmental and Light Textile Engineering 24309 98407 93
7 Agriculture 17736 69902 78
8 Medicine and Health 50805 214345 65
9 Engineering Management 5662 22140 79

《1.3   Determination and interpretation of research fronts》

1.3   Determination and interpretation of research fronts

While processing and mining the paper data, domain experts put forward research front issues via a comprehensive analysis of data pertaining to science and technology news and policies, and integrated them into each stage of front determination.

In the data preparation stage, the library and information experts transform the front research issues raised by domain experts into search strategies, which are an important part of the initial data source. In the data analysis stage, for subjects that are not covered by clustered literature topics, domain experts provide keywords, representative papers, or representative journals for customized search and mining. In the expert review stage, domain experts will check for omissions based on the clustered literature results and conduct the second round of nomination for fronts that do not exist in the data mining results but are considered important. Library and information experts provide data support. Finally, the domain experts merge, revise, and refine the engineering research front topics obtained through data mining and expert nomination. Subsequent to questionnaire surveys and multiple rounds of conference discussions, approximately 10 engineering research fronts were selected for each field.

In each field, three or four key research fronts were selected according to the development prospects and the significance. Authoritative experts in the front direction were invited to interpret the fronts in detail from the perspectives of national and institutional layout, cooperation networks, development trends, and R&D priorities.

《2 Identification of engineering development fronts》

2 Identification of engineering development fronts

The identification of the engineering development fronts is primarily performed using two methods. First, based on the Derwent Innovation patent database, the top 10 000 patent families of 53 subjects in the nine fields with highest citations were clustered, and 53 Theme Scape maps were obtained. The domain experts interpreted the candidate engineering development fronts from these maps. The second approach involves nomination by experts. The candidate development fronts obtained through these two methods went through multiple rounds of expert discussion and questionnaire surveys. Consequently, approximately 10 engineering development fronts were identified in each field.

《2.1 Acquisition and preparation of patent data》

2.1 Acquisition and preparation of patent data

In the data preparation stage, based on the Derwent Innovation patent database, the initial patent data retrieval scope and search strategies for the 53 disciplines of the nine fields were determined using the Derwent World Patents Index (DWPI) Manual Codes, International Patent Classification numbers, United States Patent Classification numbers, and other patent classification numbers and specific technical keywords. Domain experts deleted, supplemented, and improved the patent search strategies to determine the patent retrieval criteria and nominated candidate front topics, which were then transformed into patent search strategies by library and information experts. The above two parts of the search strategies were integrated to determine the patent search strategies of the 53 disciplines, searched in the “enhanced patent data— DWPI and Derwent Patent Citation Index (DPCI)” collection, and obtained the patent literature of the corresponding disciplines. The retrieved patents were published between 2016 and 2021; the cut-off date of the citations was January 2022. To further concentrate patent literature, the millions of patent documents were screened according to the annual average number of citations and technical coverage width indicators, thereby obtaining the top 10 000 patent families in each discipline.

《2.2 Mining of patent topics》

2.2 Mining of patent topics

Semantic similarity analysis of patent texts were conducted for the top 10 000 highly cited patents on 53 disciplines in the nine fields. Based on literature topic clustering using DWPI titles and abstracts, 53 Theme Scape patent maps were obtained, which effectively display the distribution of the engineering development techniques and show the overall technical information of the collected patents in the form of keywords.

Experts from various fields, with the assistance of library and information experts, selected the engineering development fronts from Theme Scape maps, merged similar fronts, and determined the final development fronts. Finally, they selected the candidate engineering development fronts of each specialty group. To avoid missing emerging fronts, domain experts interpreted the data from patents with few citations and poor correlation in the Theme Scape maps.

《2.3 Determination and interpretation of development fronts》

2.3 Determination and interpretation of development fronts

While processing and mining the patent data, domain experts identified issues on development fronts based on a comprehensive analysis of other data, such as science and technology news and policies, and integrated them into each stage of front determination.

In the data preparation stage, the library and information experts transformed the key front issues raised by domain experts into patent search strategies as an important part of the basic dataset. In the data analysis stage, domain experts conducted the second round of front nomination to supplement the emerging technology points that are significant but have been submerged in data mining with few patents. In the expert review stage, domain experts studied highly cited patents, and library and information experts assisted them in interpreting patent maps from multiple perspectives, such as “peaks” “blue oceans” and “islands”. Finally, domain experts merged, revised, and refined the interpreted results of the patent maps and fronts nominated by experts to obtain candidate engineering development fronts, and then selected approximately 10 engineering development fronts in each field through questionnaire surveys or multiple rounds of seminars.

In each field, three or four key development fronts were selected according to the development prospects and the significance. Authoritative experts in the front direction were invited to interpret the fronts in detail from the perspectives of national and institutional layout, cooperation networks, development trends, and R&D priorities.

《3 Development roadmap》

3 Development roadmap

Technology roadmaps are an important tool to depict the development trend of technologies. To strengthen the academic leading role of the engineering fronts, the Global Engineering Fronts 2022 project conducted detailed analysis on the development focuses and trends of three key engineering research fronts and three key engineering development fronts for each field, and drew a development roadmap in a visual way for each front in the next five to ten years.

《4 Terminologies》

4 Terminologies

Publications/Papers: This includes peer-reviewed and published journal articles, reviews, and conference papers retrieved from Web of Science.

High-impact papers: Papers that are in the top 10% in terms of citation frequency are considered to be of high impact, taking into account the year of publication and journal subject category.

Clustered literature topic: A combination of topics and keywords obtained through a co-citation clustering analysis of high-impact papers.

Core papers: Depending on how the research front is obtained, core papers have two meanings. If it originates from a front that is obtained from data mining and revised by experts, the core paper is considered as a high-impact paper. If it comes from a front nominated by domain experts, the core paper is included in the top 10% of papers in terms of citation frequency obtained using the corresponding search strategy.

Percentage of core papers: The proportion of core papers in which a country or institution participates among the total number of core papers produced by all countries or institutions.

Citing papers: Collection of papers that have cited core papers.

Citation number: The number of times the paper has been cited by the Web of Science Core Collection.

Mean publication year: Average publication years for all papers among the clustered literature topics.

Consistently cited papers: Papers included in the top 10% in terms of citation velocity.

Citation velocity: An indicator used to measure the growth rate of the cumulative number of citations for a certain period. In this study, the citation velocity of each paper begins with the month of publication, and the cumulative number of citations per month was recorded.

Highly cited patents: The around top 10 000 DWPI families ranked by the average annual DPCI citations.

Core patents: According to the different ways of obtaining a development front, core patents have two meanings. If they come from the fronts of the patent map, core patents refer to highly cited patents. If they are from the fronts nominated by domain experts, core patents refer to all patents obtained by topic search.

Percentage of published patents: The proportion of core patents in which a country (priority country) or institution participates among the total number of core patents produced by all countries or institutions.

Theme Scape map: A themed landscape representing the overall outlook of a specific industry or technical field. It is a visual presentation in the form of a map obtained by analyzing the semantic similarity of value added DWPI information of patents to gather the patents of related technologies.

Technical coverage width: It is measured by the number of DWPI Classes to which each DWPI patent family covers. This indicator can reflect the breadth of the technology coverage of each patent.

Specialty division criteria system of the academic divisions of the CAE: This is specified in the Specialty Division Criteria of the Academic Divisions of the Chinese Academy of Engineering for Member Election (Trial). It refers to 53 specialized disciplines covered by the nine academic divisions of engineering science and technology, including Mechanical and Vehicle Engineering; Information and Electronic Engineering; Chemical, Metallurgical, and Materials Engineering; Energy and Mining Engineering; Civil, Hydraulic, and Architectural Engineering; Environmental and Light Textile Engineering; Agriculture; Medicine and Health; and Engineering Management.