An engineering front refers to the key direction that is forwardlooking, leading, and exploratory. It has a major influence  and a leading role in the future development of engineering  science and technology and serves as an important guide  for cultivating the capabilities for innovation in the field of  engineering science and technology. The front is focused  on the theoretical research or application development of  engineering science and technology. Engineering fronts are  divided into engineering research fronts and engineering  development fronts. In this research, the engineering front  identification is based on public data and expert research;  therefore, it does not involve nonpublic domains.  

Underpinned by experts’ evaluation and data, the 2021  Global Engineering Fronts project has adopted multiround  interactions between experts and data for iterative research  and analysis, realizing a deep integration of judgments of  experts and data analyses. In 2021, 93 global engineering  research fronts and 93 global engineering development fronts  are selected, with 28 engineering research fronts and 28  engineering development fronts being listed as the key focus  of interpretation. The distribution of engineering research  and engineering development fronts among the nine fields is  shown in Table 1.1.  

The research of fronts consists of three stages: data  preparation, data analysis, and expert review. In the data  preparation stage, domain, library, and information experts  revise the initial literature and patent data to clarify the scope  of data mining. In the data analysis stage, the co-citation  clustering method is used to obtain clustered literature topics  and ThemeScape patent maps. In the expert review stage, the  fronts are gradually selected and determined through patent  map interpretation, expert panel discussions, questionnaire  surveys, and other methods. Then the list of the Top 10 fronts  is modified, and the front naming is improved based on  the performance of the front in literature or patent data. To  address the problem of lacking of novelty due to algorithm  limitations or lags in data mining, experts from different fields  were encouraged to check the results of the data analysis to  fill in the gaps and nominate engineering fronts. A flowchart  of the operation procedure of the Global Engineering Fronts  project is illustrated in Figure 1.1, in which the green, purple,  and red boxes indicate the data analysis, expert research, and  multiround iterative interactions between experts and data,  respectively.  

《1  Identification of engineering research  fronts》

1  Identification of engineering research  fronts

 The identification of the engineering research fronts is  performed in two steps. The first step involves determining  the clustered literature topics through the clustering method  of co-citation according to the SCI journal papers and data of  

《Table 1.1》

Table 1.1 Distribution of engineering research and engineering development fronts among the nine fields  

Field Number of engineering research fronts Number of engineering development fronts
Mechanical and Vehicle Engineering 10 10
Information and Electronic Engineering 10 10
Chemical, Metallurgical, and Materials Engineering 11 11
Energy and Mining Engineering 12 12
Civil, Hydraulic, and Architectural Engineering 10 10
Environmental and Light Textile Engineering 10 10
Agriculture 10 10
Medicine and Health 10 10
Engineering Management 10 10
Total 93 93

 

《Figure 1.1》

Figure 1.1 Operation procedure of the Global Engineering Fronts project

conference proceedings collected from the Web of Science  Core Collection of Clarivate. The second step is defining the  engineering research fronts through expert nomination.  Alternative engineering research fronts that were identified  through expert argumentation and refinement went through  questionnaire surveys and multiple rounds of expert  discussions, yielding 93 engineering research fronts in the  nine fields.  

《1.1 Acquisition and preprocessing of paper  data》

1.1 Acquisition and preprocessing of paper  data

 Clarivate mapped the fields of Web of Science and the nine  academic division fields of the CAE and obtained a list of  journals and conferences in each field. After the correction  and supplementation by domain experts, the sources for  data analysis in the nine fields were determined to be 12 215  journals and 44 153 conferences. In addition, for articles from  72 multidisciplinary sciences journals such as Nature and  Science, the field of each article was reassigned to the most  relevant subject area according to the subjects cited in its  references. Accordingly, the articles and conference papers  published between 2015 and 2020 were retrieved (the cut-off  date of the citations was January 2021).  

For each field, Clarivate comprehensively considered the  differences between journals and conferences, the publication  year, and so on. Next, the list of aforementioned papers  was retrieved and extracted. By processing journals and  conference proceedings separately, papers with high impact  that are ranked among the top 10% of the citations were  selected as the original dataset for the analysis of the research  hotspots, as shown in Table 1.1.1.  

《1.2 Mining of clustered literature topics》

1.2 Mining of clustered literature topics

 Through the co-citation clustering analysis of the top 10%  highly cited papers in the aforementioned nine data datasets,  all the clustered literature topics in the nine fields were  obtained. The topics of papers published during 2019–2020  were selected according to the number of core papers, total  number of citations, and proportion of consistently cited  papers. Thereafter, 25 different literature topics were obtained.  The topics of the papers published before 2019 were selected  according to the mean publication year of core publications  and the proportion of consistent citations. Consequently, 35  diverse literature topics were extracted. Overlapping topics  were replaced by topics that did not intersect with other fields.  In addition, subjects that were not covered by clustering topics  were extracted separately by keywords. Finally, 775 clustered  literature topics in the nine fields were obtained (Table 1.2.1).  

《1.3 Determination and interpretation of research fronts  》

1.3 Determination and interpretation of research fronts  

While processing and mining the paper data, domain experts  put forward research front issues by a comprehensive analysis  of data pertaining to science and technology news and  national strategic layouts of different countries, and integrated  them into each stage of front determination.  

In the data preparation stage, the library and information  experts transform the front research questions raised by the  domain experts into search formulas, which are an important  part of the initial data source. In the data analysis stage, for  subjects that are not covered by clustered literature topics,  the domain experts provide keywords, representative  papers, or representative journals to support Clarivate for  customized search and mining. In the expert review stage,  the domain experts will check for omissions based on the  clustered literature results provided by Clarivate and conduct  the second round of nomination for fronts that do not exist in  the data mining results but are considered important. Library  and information experts provide data support. Finally, the  domain experts merge, revise, and refine the engineering  research front topics obtained through data mining and  expert nomination. Subsequent to questionnaire surveys and  multiple rounds of conference discussions, approximately 10  engineering research fronts were selected for each field.

 In each field, three key research fronts were selected  according to the development prospects and the significance.

《Table 1.1.1》

Table 1.1.1 Number of journals and conferences in each field and number of top 10% highly cited papers

No. Field Number of journals Number of conferences Number of top10% highly cites papers
1 Mechanical and Vehicle Engineering 521 2 779 73 481
2 Information and Electronic Engineering 987 18 590 204 705
3 Chemical, Metallurgical, and Materials Engineering 1 188 4 068 274 485
4 Energy and Mining Engineering 616 2 338 115 816
5 Civil, Hydraulic, and Architectural Engineering 576 1 154 63 930
6 Environmental and Light Textile Engineering 1 345 1 288 207 518
7 Agriculture 1 484 1 093 204 873
8 Medicine and Health 4 685 11 583 476 629
9 Engineering Management 813 1 260 50 986
10 IPN 3 7.69% 127

《Table 1.2.1 》

Table 1.2.1 Statistics of co-citation clustering results in each field

No. Field Number of topics Number of top10% highly cites papers Number of alternative engineering research hotspots
1 Mechanical and Vehicle Engineering 8 158 33 822 114
2 Information and Electronic Engineering 20 495 88 795 67
3 Chemical, Metallurgical, and Materials Engineering 28 481 117 286 66
4 Energy and Mining Engineering 12 763 54 261 91
5 Civil, Hydraulic, and Architectural Engineering 7 223 31 099 102
6 Environmental and Light Textile Engineering 22 872 94 186 94
7 Agriculture 22 006 89 460 92
8 Medicine and Health 49 923 211 212 67
9 Engineering Management 5 147 21 001 82
10 IPN 3 7.69% 127

Authoritative experts in the front direction were invited to  interpret the fronts in detail from the perspectives of national  and institutional layout, cooperation networks, development  trends, and R&D priorities.  

《2  Identification of engineering  development fronts  》

2  Identification of engineering  development fronts  

The identification of the engineering development fronts is  primarily performed using two methods. First, based on the  Derwent Innovation patent database of Clarivate, the top 10  000 patent families of 53 subjects in the nine fields with high  citations were clustered, and 53 ThemeScape maps were  obtained. The domain experts interpreted the alternative  engineering development fronts from these maps. The  second approach involves nomination by expert or patent  analysis by small peer group. The alternative development  fronts obtained through these two methods went through  questionnaire sur veys and several special seminars.  Consequently, approximately 10 engineering development  fronts were identified in each field.  

《2.1 Acquisition and preparation of the  ThemeScape maps》

2.1 Acquisition and preparation of the  ThemeScape maps

 In the data preparation stage, based on the Derwent  Innovation patent database, Clarivate developed the initial  patent data retrieval scope and search strategies for the 53  disciplines of the nine fields using the Derwent World Patents  Index (DWPI) Manual Codes, International Patent Classification  numbers, United States Patent Classification numbers , and  other patent classification numbers, and specific technical  keywords. Domain experts deleted, supplemented, and  improved the DWPI Manual Codes to determine the patent  retrieval criteria; further, the nominated alternative front  topics were selected, which were then transformed into patent  search formulas by library and information experts. Clarivate  integrated the above two parts of the search formulas,  determined the patent search formulas of the 53 disciplines,  searched in the “enhanced patent data - DWPI and DPCI  (Derwent Patent Citation Index)” collection, and obtained  the patent literature of the corresponding disciplines. The  retrieved patents were published between 2015 and 2020; the  cut-off date of the citations was January 2021.  

To further concentrate patent literature, the millions of patent  documents were screened according to the annual average  number of citations and technical coverage width indicators,  thereby obtaining the top 10 000 patent families in each  discipline.  

《2.2 Mining of patent topics  》

2.2 Mining of patent topics  

Semantic similarity analysis of patent texts were conducted  for the top 10 000 highly cited patents on 53 disciplines in  the nine fields. Based on literature topic clustering using  DWPI titles and abstracts, 53 ThemeScape patent maps were  obtained, which effectively display the distribution of the  engineering development techniques and show the overall  technical information of the collected patents in the form of  keywords.  

Experts from various fields, with the assistance of library and  information experts, selected the engineering development  fronts from ThemeScape maps, merged similar fronts, and  determined the final development fronts. Finally, they  selected the alternative engineering development fronts  of each specialty group. To avoid missing emerging fronts,  domain experts interpreted the data from patents with few  citations and poor correlation in the ThemeScape maps.  

《2.3 Determination and interpretation of  development fronts  》

2.3 Determination and interpretation of  development fronts  

While processing and mining the patent data, domain  experts identified issues on development front based on a  comprehensive analysis of other data, such as science and  technology news and national strategic layouts of different  countries, and integrated them into each stage of front  determination.  In the data preparation stage, the library and information  experts transformed the key front issues raised by the domain  experts into patent search formulas as an important part  of the basic dataset.

In the data analysis stage, the domain  experts conducted the second round of front nomination to supplement the emerging technology points that are  significant, but have been submerged in data mining with few  patents. In the expert review stage, the domain experts studied  highly cited patents, and the library and information experts  assisted them in interpreting patent maps from multiple  perspectives, such as “peaks” and “blue oceans”. Finally, the  domain experts merged, revised, and refined the interpreted  results of the patent maps and fronts nominated by experts to  obtain candidate engineering development fronts, and then  selected approximately 10 engineering development fronts in  each field through questionnaire surveys or multiple rounds  of seminars.

 In each field, three key development fronts were selected  according to the development prospects and the significance.  Authoritative experts in the front direction were invited to  interpret the fronts in detail from the perspectives of national  and institutional layout, cooperation networks, development  trends, and R&D priorities.  

《3  Terminologies  》

3  Terminologies  

Publications/Papers: This includes peer-reviewed and  published journal articles, reviews, and conference papers  retrieved from Web of Science.  

High-impact papers: Papers that are in the top 10% in terms  of citation frequency are considered to be of high impact,  taking into account the year of publication and journal subject  category.  

Clustered literature topic: A combination of topics and  keywords obtained through a co-citation clustering analysis of  high-impact papers.  

Core papers: Depending on how the research front is  obtained, core papers have two meanings. If the paper  originates from a front revised by data mining experts, then  the core paper is considered as a high-impact paper. If it  comes from a front nominated by domain experts, the core  paper is included in the top 10% of papers in terms of citation  frequency obtained using the corresponding search formula.  

Percentage of core papers: The proportion of core papers  in which a country or institution participates among the  total number of core papers produced by all countries or  institutions.

Citing papers: Collection of papers that have cited core  papers.  

Citation number: The number of times the paper has been  cited by the Web of Science Core Collection of Clarivate .  Mean publication year: Average publication years for all  papers among the clustered literature topics.  

Citation velocity: An indicator used to measure the growth  rate of the cumulative number of citations for a certain period.  In this study, the citation velocity of each paper begins with  the month of publication, and the cumulative number of  citations per month was recorded.  

Consistently cited papers: Papers included in the top 10% in  terms of citation velocity.  

Highly cited patents: The around top 10 000 DWPI families  ranked by the average annual DPCI citations.  

Core patents: According to the different ways of obtaining  the development front, core patents have two meanings. If it  comes from the front of the patent map, the core patent refers  to the highly cited patent; if it is from the front nominated by  domain experts, the core patent refers to all patent obtained  by topic search.  

Percentage of published patents: The proportion of  published patents in which a country (priority country) or  institution participates among the total number of published  patents produced by all countries or institutions.  

ThemeScape map: A themed landscape representing the  overall outlook of a specific industry or technical field. It is a  visual presentation in the form of a map obtained by analyzing  the semantic similarity of value added DWPI information of  patents to gather the patents of related technologies.  

Technical coverage width: It is measured by the number of  DWPI Classes to which each DWPI patent family covers. This  indicator can reflect the breadth of the technology coverage of  each patent.  

Specialty division criteria system of the academic divisions  of the CAE: This includes 53 specialized fields covered by nine  academic divisions of engineering science and technology. It is  determined according to the Academic Divisions and Specialty  Division Criteria of the Chinese Academy of Engineering for the  Election of Academicians (for Trial Implementation).