Journal Home Online First Current Issue Archive For Authors Journal Information 中文版

Engineering >> 2022, Volume 17, Issue 10 doi: 10.1016/j.eng.2022.02.007

Data-Driven Discovery of Stochastic Differential Equations

a School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
b State Key Laboratory of Digital Manufacturing Equipment and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
c Department of Mechanical Engineering, University of Kansas, Lawrence, KS 66045, USA
d HUST-Wuxi Research Institute, Wuxi 214174, China
e Key Laboratory of Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China
f Department of Plant Sciences, University of Cambridge, Cambridge CB2 3EA, UK
g Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux 4367, Luxembourg
h Department of Physics, Humboldt University of Berlin, Berlin 12489, Germany
i Department of Complexity Science, Potsdam Institute for Climate Impact Research, Potsdam 14473, Germany

Received: 2021-10-13 Revised: 2022-02-06 Accepted: 2022-02-15 Available online: 2022-03-23

Next Previous

Abstract

Stochastic differential equations (SDEs) are mathematical models that are widely used to describe complex processes or phenomena perturbed by random noise from different sources. The identification of SDEs governing a system is often a challenge because of the inherent strong stochasticity of data and the complexity of the system’s dynamics. The practical utility of existing parametric approaches for identifying SDEs is usually limited by insufficient data resources. This study presents a novel framework for identifying SDEs by leveraging the sparse Bayesian learning (SBL) technique to search for a parsimonious, yet physically necessary representation from the space of candidate basis functions. More importantly, we use the analytical tractability of SBL to develop an efficient way to formulate the linear regression problem for the discovery of SDEs that requires considerably less time-series data. The effectiveness of the proposed framework is demonstrated using real data on stock and oil prices, bearing variation, and wind speed, as well as simulated data on well-known stochastic dynamical systems, including the generalized Wiener process and Langevin equation. This framework aims to assist specialists in extracting
stochastic mathematical models from random phenomena in the natural sciences, economics, and engineering fields for analysis, prediction, and decision making.

SupplementaryMaterials

Figures

Fig. 1

Fig. 2

Fig. 3

Fig. 4

Fig. 5

References

[ 1 ] Einstein A. Über die von der molekularkinetischen Theorie der Wärme geforderte Bewegung von in ruhenden Flüssigkeiten suspendierten Teilchen. Ann Phys 1905;322(8):549–60. German. link1

[ 2 ] Bose T, Trimper S. Stochastic model for tumor growth with immunization. Phys Rev E 2009;79:051903. link1

[ 3 ] Hull JC. Options, futures, and other derivatives. 9th ed. Boston: Pearson; 2015. link1

[ 4 ] Wilkinson DJ. Stochastic modelling for quantitative description of heterogeneous biological systems. Nat Rev Genet 2009;10(2):122–33. link1

[ 5 ] Chong KL, Shi JQ, Ding GY, Ding SS, Lu HY, Zhong JQ, et al. Vortices as Brownian particles in turbulent flows. Sci Adv 2020;6(34):eaaz1110. link1

[ 6 ] Rigas G, Morgans AS, Brackston RD, Morrison JF. Diffusive dynamics and stochastic models of turbulent axisymmetric wakes. J Fluid Mech 2015;778 (R2):1–10. link1

[ 7 ] Calif R. PDF models and synthetic model for the wind speed fluctuations based on the resolution of Langevin equation. Appl Energy 2012;99:173–82. link1

[ 8 ] Friedrich R, Siegert S, Peinke J, Lück St, Siefert M, Lindemann M, et al. Extracting model equations from experimental data. Phys Lett A 2000;271:217–22. link1

[ 9 ] Lamouroux D, Lehnertz K. Kernel-based regression of drift and diffusion coefficients of stochastic processes. Phys Lett A 2009;373:3507–12. link1

[10] Rajabzadeh Y, Rezaie AH, Amindavar H. A robust nonparametric framework for reconstruction of stochastic differential equation models. Phys A 2016;450:294–304. link1

[11] Papaspiliopoulos O, Pokern Y, Roberts GO, Stuart AM. Nonparametric estimation of diffusions: a differential equations approach. Biometrika 2012;99:511–31. link1

[12] Van der Meulen F, Schauer M, Van Zanten H. Reversible jump MCMC for nonparametric drift estimation for diffusion processes. Comput Stat Data Anal 2014;71:615–32. link1

[13] Batz P, Ruttor A, Opper M. Approximate Bayes learning of stochastic differential equations. Phys Rev E 2018;98:022109. link1

[14] Garcia CA, Otero A, Felix P, Jesus P, Marquez DG. Nonparametric estimation of stochastic differential equations with sparse Gaussian processes. Phys Rev E 2017;96:022104. link1

[15] Bandi FM, Phillips PCB. A simple approach to the parametric estimation of potentially nonstationary diffusions. J Econom 2007;137:354–95. link1

[16] Brunton SL, Proctor JL, Kutz JN. Discovering governing equations from data by sparse identification of nonlinear dynamical systems. Proc Natl Acad Sci USA 2016;113:3932–7. link1

[17] Boninsegna L, Nuske F, Clementi C. Sparse learning of stochastic dynamical equations. J Chem Phys 2018;148:241723.

[18] Callaham JL, Loiseau JC, Rigas G, Brunton SL. Nonlinear stochastic modeling with Langevin regression. Proc R Soc A Math Phys Eng Sci 2021;477:20210092. link1

[19] Tipping ME. Sparse Bayesian learning and the relevance vector machine. J Mach Learn Res 2001;1:211–44. link1

[20] Wipf DP, Rao BD. Sparse Bayesian learning for basis selection. IEEE Trans Signal Process 2004;52:2153–64. link1

[21] Pan W, Yuan Y, Goncalves J, Stan GB. A sparse Bayesian approach to the identification of nonlinear state–space systems. IEEE Trans Auto Control 2016;61:182–7. link1

[22] Yuan Y, Tang X, Zhou W, Pan W, Li X, Zhang HT, et al. Data driven discovery of cyber physical systems. Nat Commun 2019;10:1–9. link1

[23] Ping Z, Li X, He W, Yang T, Yuan Y. Sparse learning of network-reduced models for locating low frequency oscillations in power systems. Appl Energy 2020;262:114541. link1

[24] Zhou W, Ardakanian O, Zhang HT, Yuan Y. Bayesian learning-based harmonic state estimation in distribution systems with smart meter and DPMU data. IEEE Trans Smart Grid 2020;11:832–45. link1

[25] Yuan Y, Zhang H, Wu Y, Zhu T, Ding H. Bayesian learning-based modelpredictive vibration control for thin-walled workpiece machining processes. IEEE-ASME Trans Mechatron 2017;22:509–20. link1

[26] Mao X. Numerical solutions of stochastic differential delay equations under the generalized Khasminskii-type conditions. Appl Math Comput 2011;217:5512–24. link1

[27] Mao X. The truncated Euler–Maruyama method for stochastic differential equations. J Comput Appl Math 2015;290:370–84. link1

[28] Ghasemi F, Sahimi M, Peinke J, Friedrich R, Jafari GR, Tabar MRR. Markov analysis and Kramers–Moyal expansion of nonstationary stochastic processes with application to the fluctuations in the oil price. Phys Rev E 2007;75:060102. link1

[29] Langevin P. Sur la théorie du mouvement Brownien. C R Acad Sci 1908;146:530–3. French.

[30] Coffey W, Kalmykov YP. The Langevin equation: with applications to stochastic problems in physics, chemistry and electrical engineering. 3rd ed. Singapore: World Scientific; 2012. link1

[31] Shao H, Jiang H, Zhang X, Niu M. Rolling bearing fault diagnosis using an optimization deep belief network. Meas Sci Technol 2015;26:115002. link1

[32] Yuan Y, Ma G, Cheng C, Zhou B, Zhao H, Zhang HT, et al. A general end-toend diagnosis framework for manufacturing systems. Natl Sci Rev 2020;7:418–29. link1

[33] Cheng C, Ma G, Zhang Y, Sun M, Teng F, Ding H, et al. A deep learning-based remaining useful life prediction approach for bearings. IEEE-ASME Trans Mechatron 2020;25(3):1243–54. link1

[34] Safizadeh MS, Latifi SK. Using multi-sensor data fusion for vibration fault diagnosis of rolling element bearings by accelerometer and load cell. Inf Fusion 2014;18:1–8. link1

[35] Fama EF. Efficient capital markets: a review of theory and empirical work. J Financ 1970;25:383–417. link1

[36] Black F, Scholes M. The pricing of options and corporate liabilities. J Polit Econ 1973;81:637–54. link1

[37] Merton RC. Theory of rational option pricing. Bell J Econ Manag Sci 1973:141–83. link1

[38] Zárate-Miñano R, Anghel M, Milano F. Continuous wind speed models based on stochastic differential equations. Appl Energy 2013;104:42–9. link1

[39] Zárate-Miñano R, Milano F. Construction of SDE-based wind speed models with exponentially decaying autocorrelation. Renew Energy 2016;94:186–96. link1

[40] National Institute of Water and Atmospheric Research Limited. CliFlo: NIWA’s National Climate Database [Internet]. Auckland: National Institute of Water and Atmospheric Research Limited; [cited 2020 Dec 8]. Available from: http://cliflo. niwa.co.nz/.

[41] Kusiak A, Li W, Song Z. Dynamic control of wind turbines. Renew Energy 2010;35:456–63. link1

[42] Melício R, Mendes VMF, Catalão JPS. Transient analysis of variable-speed wind turbines at wind speed disturbances and a pitch control malfunction. Appl Energy 2011;88:1322–30. link1

[43] Chang Y, Wong JF. Oil price fluctuations and Singapore economy. Energ Policy 2003;31:1151–65. link1

[44] Lizardo RA, Mollick AV. Oil price fluctuations and US dollar exchange rates. Energy Econ 2010;32:399–408. link1

Related Research