Journal Home Online First Current Issue Archive For Authors Journal Information 中文版

Strategic Study of CAE >> 2022, Volume 24, Issue 4 doi: 10.15302/J-SSCAE-2022.04.014

Next-Generation Database Benchmark for Financial Scenarios

1. Institute of Financial Technology, Fudan University, Shanghai 200433, China;

2. School of Computer Science, Fudan University, Shanghai 200433, China

Funding project:Chinese Academy of Engineering project “Strategic Research on Financial Data Security Governance Intelligentization" (2022-XY-12); National Natural Science Foundation of China project (92046024, 92146002) Received: 2022-06-19 Revised: 2022-07-14 Available online: 2022-08-04

Next Previous

Abstract

As the major financial entity in China, banks have high performance and security requirements for databases and data service solutions. With the progression of data application services in banking, the data types and business scenarios become more diverse, and it is difficult for users to make optimal choices among a wide diversity of database products and data service solutions. In combination with the data application demands of the financial industry, this study comprehensively analyzes the current status of applications of databases in banking, particularly the status and challenges of database localization in recent years, by using literature research and theoretical analysis. In addition, we systematically investigate the database benchmarks of China and other countries, and further prospect the necessity and importance of constructing next-generation database benchmarks for financial scenarios. We find that current database benchmarks have many deficiencies and face various challenges in dealing with the database testing in financial scenarios owing to the complex business logic, diverse data patterns, and high security requirements. Therefore, to build a nextgeneration database benchmark that can meet the requirements of financial scenarios, we propose several suggestions to address these challenges, which involve the aspects of workloads, data schemes, metrics, and technical architecture.

Figures

Fig. 1

Fig. 2

Fig. 3

References

[ 1 ] Lin Y F, Fu C H, Ren X M. How Financial Innovation Promotes High Quality Development: A Perspective of New Structural Economics [J]. Finance Forum, 2019, 24(11): 3-13. Chinese.

[ 2 ] The People’s Bank of China. Total assets of financial institutions reached 381.95 trillion yuan at end-2021 [EB/OL]. (2022-03-15) [2022-06-22]. Chinese. link1

[ 3 ] Jazzyear. China Fintech Report Series [R/OL]. Online: Jazzyear, 2020. Chinese. link2

[ 4 ] The People’s Bank of China. The People’s Bank of China issued the Fintech Development Plan for 2022-2025 [EB/OL]. (2022-01-04) [2022-06-06]. Chinese. link1

[ 5 ] Hu L M. Application and Prospect of Distributed Database in Financial Industry [J]. FinTech Time, 2020(05): 25-33. Chinese.

[ 6 ] Poess M and Floyd C. New TPC Benchmarks for Decision Support and Web Commerce [J]. ACM Special Interest Group on Management of Data Record, 2000, 29(4): 64-71.

[ 7 ] Nambiar R O and Poess M. The Making of TPC-DS [C]. Seoul: Proceedings of the 32nd International Conference on Very Large Data Bases, 2006: 1049-1058.

[ 8 ] China Academic of Information and Communications Technology. Database Development Research Report (2021) [R]. Beijing: China Academic of Information and Communications Technology, 2021. Chinese.

[ 9 ] ITpub Technology Stack. Thirty Years of Turbulence: The Development and Changes of Bank Databases [EB/OL]. (2021-04-02) [2022-06-06]. Chinese. link1

[10] The People’s Bank of China. The People’s Bank of China issued the Fintech Development Plan for 2019-2022 [EB/OL]. (2019-08-22) [2022-06-06]. Chinese. link1

[11] China Financial Standardization Technical Committee. Three Financial Industry Standards Including “Financial Application Specification of Distributed Database Technology Technical Architecture” were Officially Released [EB/OL]. (2020-12-25) [2022-06-06]. Chinese. link1

[12] Wang F P. Pursue Excellence and Forge Ahead——China CITIC Bank GoldenDB Distributed Database Transformation Practice [J]. Financial Computerizing, 2020, (02): 76-78. Chinese.

[13] Li Z N. Distributed database financial applications advance steadily and orderly [J]. Financial Computerizing, 2020(12): 34-35. Chinese.

[14] Dai G W. Build A "New Ecology" and Explore the Development of Distributed Databases in The Financial Industry [J]. Financial Computer of China, 2021, (07): 85-86. Chinese.

[15] Leutenegger S T and Dias D M. A modeling study of the TPC-C benchmark [C]. Washington: Proceedings of the 1993 ACM International Conference on Management of Data, 1993: 22-31.

[16] CCF Technical Committee on Database, Tsinghua University, modb.pro. Research on Classification and Evaluation of Database System [EB/OL]. (2021-12-22) [2022-06-06]. Chinese. link1

[17] Jin Z Q, Qian W N, Zhou M Q, et al. Benchmarking Data Management Systems: From Traditional Database to Emergent Big Data [J]. Chinese Journal of Computers, 2015, 38(01): 18-34. Chinese.

[18] Yan Y B, Zhu W Q, Yang T, et al. Overview on Benchmark Test of Big Data System [J]. Network New Media Technology, 2018, 7(03): 6-13. Chinese.

[19] Bitton D, DeWitt D J, Turbyfill C. Benchmarking database systems-A systematic approach [R]. University of Wisconsin-Madison Department of Computer Sciences, 1983.

[20] Xin R, Mokhtar M. Databricks Sets Official Data Warehousing Performance Record [EB/OL]. (2021-11-02) [2022-06-06]. link1

[21] Dageville B, Cruanes T. Industry Benchmarks and Competing with Integrity [EB/OL]. (2021-11-12) [2022-06-06]. link1

[22] Mokhtar M, Tavakoli-Shiraji A, Xin R, Zaharia M. Snowflake Claims Similar Price/Performance to Data-bricks, but Not So Fast! [EB/OL]. (2021-11-15) [2022-06-06]. link1

[23] Cao P, Gowda B, Lakshmi S, et al. From BigBench to TPCx-BB: Standardization of a Big Data Benchmark [C]. New Delhi: 8th TPC Technology Conference, 2016: 24-44.

[24] Hao Y, Qin X, Chen Y, et al. TS-Benchmark: A Benchmark for Time Series Databases [C/OL]. Chania: 37th IEEE International Conference on Data Engineering, 2021: 588-599. link1

[25] Murphy R C, Wheeler K B, Barrett B W, et al. Introducing the graph 500 [J]. Cray Users Group (CUG), 2010, 19: 45-74.

[26] Dreseler M, Boissier M, Rabl T, et al. Quantifying TPC-H choke points and their optimizations [J]. Proceedings of the VLDB Endowment, 2020, 13(8): 1206-1220.

[27] O’Neil P E, O’Neil E J, Chen X, et al. The star schema benchmark and augmented fact table indexing [C]. Lyon: First TPC Technology Conference, 2009: 237-252.

[28] Ghazal A, Rabl T, Hu M, et al. Bigbench: Towards an industry standard benchmark for big data analytics [C]. New York: Proceedings of the 2013 ACM International Conference on Management of Data, 2013: 1197-1208.

[29] Eichmann P, Zgraggen E, Binnig C, et al. IDEBench: A benchmark for interactive data exploration [C/OL]. Portland: Proceedings of the 2020 ACM International Conference on Management of Data, 2020: 1555-1569. link1

[30] Funke F, Kemper A, Krompass S, et al. Metrics for Measuring the Performance of the Mixed Workload CH-benCHmark [C]. Seattle: Third TPC Technology Conference, 2011: 10-30.

[31] Cooper B F, Silberstein A, Tam E, et al. Benchmarking cloud serving systems with YCSB [C]. Indianapolis: Proceedings of the 1st ACM Symposium on Cloud Computing, 2010: 143-154.

[32] Patil S, Polte M, Ren K, et al. YCSB++: benchmarking and performance debugging advanced features in scalable table stores [C]. Cascais: ACM Symposium on Cloud Computing in conjunction with SOSP 2011, 2011: 9.

[33] Chintapalli S, Dagit D, Evans B, et al. Benchmarking streaming computation engines: Storm, flink and spark streaming [C]. Chicago: 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016: 1789-1792.

[34] Angles R, Antal J B, Averbuch A, et al. The LDBC social network benchmark [J/OL]. CoRR, 2020, abs/2001.02299. link1

[35] Zhang C, Lu J H, Xu P F, et al. UniBench: A Benchmark for Multi-model Database Management Systems [C]. Riode Janeiro: 10th TPC Technology Conference, 2018: 7-23.

[36] Tian J F, Jiang C Y. Database Performance Evaluation Tool Under Financial Scenario [J]. Information and Communi-cations Technology and Policy, 2020, 46(4): 85-90. Chinese.

[37] Jiang C, Tian J, Ma P. Databench-T: A Transactional Database Benchmark for Financial Scenarios [C]. Shen-yang: 2021 IEEE 20th International Conference on Trust, Security and Privacy in Computing and Communications, 2021: 1418-1421.

[38] Liew S P, Takahashi T, Ueno M. PEARL: Data Synthesis via Private Embeddings and Adversarial Reconstruction Learning [C/OL]. Online: The Tenth International Conference on Learning Representations, 2022. link1

Related Research