Journal Home Online First Current Issue Archive For Authors Journal Information 中文版

Frontiers of Information Technology & Electronic Engineering >> 2021, Volume 22, Issue 9 doi: 10.1631/FITEE.2000286

A review on cyber security named entity recognition

Affiliation(s): School of Software, Yunnan University, Kunming 650091, China; Key Laboratory of Software Engineering of Yunnan Province, Kunming 650091, China; Engineering Research Center of Cyberspace, Kunming 650091, China; less

Received: 2020-06-13 Accepted: 2021-09-10 Available online: 2021-09-10

Next Previous

Abstract

With the rapid development of Internet technology and the advent of the era of big data, more and more texts are provided on the Internet. These texts include not only security concepts, incidents, tools, guidelines, and policies, but also risk management approaches, best practices, assurances, technologies, and more. Through the integration of large-scale, heterogeneous, unstructured information, the identification and classification of entities can help handle issues. Due to the complexity and diversity of texts in the domain, it is difficult to identify security entities in the domain using the traditional methods. This paper describes various approaches and techniques for NER in this domain, including the rule-based approach, dictionary-based approach, and based approach, and discusses the problems faced by NER research in this domain, such as conjunction and disjunction, non-standardized naming convention, abbreviation, and massive nesting. Three future directions of NER in are proposed: (1) application of unsupervised or semi-supervised technology; (2) development of a more comprehensive ontology; (3) development of a more comprehensive model.

Related Research