aState Key Laboratory of Chemical Engineering & Shanghai Engineering Research Center of Hierarchical Structure Nanomaterials, School of Chemical Engineering, East China University of Science and Technology, Shanghai 200237, China
bSchool of Chemistry and Molecular Engineering, East China University of Science and Technology, Shanghai 200237, China
cProduct Planning and New Auto Technologies Research Institute, BYD Auto Industry Company Limited, Shenzhen 518118, China
Reactive transport equations in porous media are critical in various scientific and engineering disciplines, but solving these equations can be computationally expensive when exploring different scenarios, such as varying porous structures and initial or boundary conditions. The deep operator network (DeepONet) has emerged as a popular deep learning framework for solving parametric partial differential equations. However, applying the DeepONet to porous media presents significant challenges due to its limited capability to extract representative features from intricate structures. To address this issue, we propose the Porous-DeepONet, a simple yet highly effective extension of the DeepONet framework that leverages convolutional neural networks (CNNs) to learn the solution operators of parametric reactive transport equations in porous media. By incorporating CNNs, we can effectively capture the intricate features of porous media, enabling accurate and efficient learning of the solution operators. We demonstrate the effectiveness of the Porous-DeepONet in accurately and rapidly learning the solution operators of parametric reactive transport equations with various boundary conditions, multiple phases, and multi-physical fields through five examples. This approach offers significant computational savings, potentially reducing the computation time by 50-1000 times compared with the finite-element method. Our work may provide a robust alternative for solving parametric reactive transport equations in porous media, paving the way for exploring complex phenomena in porous media.
Porous media play a pivotal role in a wide array of industrial sectors [1,2]-encompassing new energy [3], petrochemicals [4], photocatalysis [5], and biomedicine [6]-due to their intricate pore network and substantial specific surface area. The transport and reaction phenomena occurring within porous media are critical determinants influencing essential parameters such as energy storage efficiency, catalytic performance, and adsorption rates of industrial products [7⇓-9]. To ensure the rational design and optimization of industrial products, it is imperative to model these complex transport and reaction processes using partial differential equations (PDEs) [10]. In the realm of industrial research, a significant challenge faced by researchers is the rapid and accurate resolution of parametric PDEs [11⇓-13]. These specialized PDEs involve parameters such as coefficients, domain geometries, initial conditions (ICs), and boundary conditions (BCs) that vary within defined ranges [14]. However, traditional methodologies (e.g., the finite-element method, FEM) demand extensive computational resources, as independent simulations must be conducted for each parameter variation. The intricacies of grid partitioning and nonlinear numerical computations further intensify these challenges, particularly when addressing multi-physical field interactions within intricate porous structures. Consequently, there is a critical need for innovative approaches to expedite the resolution of parametric PDEs in porous media. By overcoming the limitations of traditional methods, such novel strategies have the potential to significantly enhance the iteration rate of industrial product development, thereby fostering advancements in industrial research and applications.
Using deep learning techniques based on neural networks (NNs) presents a promising avenue for addressing forward and inverse problems in numerical simulation technology, potentially revolutionizing the field [15,16]. As the universal approximation theorem indicates, NNs can solve equations by approximating nonlinear functions that satisfy given constraints. Various network architectures, including recurrent NNs (RNNs) [17], convolutional NNs (CNNs) [18], residual networks (ResNets) [19], physics-informed NNs (PINNs) [11], and derivative architectures [20,21], have been applied to solve PDEs. However, existing NNs are limited in their ability to adapt to changes in input parameters, such as alterations in domain geometries, and require retraining to accommodate new constraints. To address this limitation, NNs must learn operators rather than functions, as per a lesser-known but powerful theorem that demonstrates that NNs can approximate continuous nonlinear operators [22]. Building upon this concept, the deep operator network (DeepONet) was introduced by Lu et al. [23], designed to learn explicit and implicit operators-such as integrals, fractional Laplacians, and deterministic and stochastic differential equations-from data. The DeepONet has demonstrated successful applications in solving parametric PDEs in simple one-dimensional (1D) domain geometries across diverse fields, including quantum chemistry [24], hydromechanics [25,26], battery technology [27], power grid analysis [28], organism modeling [29], and contagion dynamics [30]. Furthermore, researchers have extended the DeepONet framework to address more complex parametric PDEs [31,32], deal with low-accuracy data [33,34], and even train NNs without data [11,35]. For example, Cai et al. [31] developed DeepM&Mnet to simulate multi-physics problems in the bulk phase by coupling multiple pretrained DeepONets, while Wang et al. [11] proposed a physics-informed DeepONet inspired by PINNs to alleviate data requirements. These advancements highlight the potential of the DeepONet and related frameworks in revolutionizing the efficient solution of parametric PDEs in the bulk phase across a wide spectrum of industrial and scientific domains.
Although the DeepONets mentioned above have demonstrated impressive capabilities, they encounter challenges when learning the solution operators of parametric PDEs in complex domain geometries such as porous media. These frameworks struggle to extract representative features from intricate porous structures, which limits their effectiveness in resolving parametric PDEs in this context. While some pioneering research [34,36⇓-38] has been conducted on applying DeepONets in complex domain geometries, the studied structures, such as triangles and five-pointed stars, still deviate from those of real porous media [34]. To overcome this challenge, we take inspiration from the powerful image-processing capabilities of CNNs [39]. For example, Cawte and Bazylak [40] successfully utilized a three-dimensional (3D) CNN to accurately predict gas diffusion layer material permeability from the image data of porous fibrous materials. Their success highlights the potential of CNNs for extracting useful features from intricate porous structures, offering promising avenues for solving parametric PDEs in complex domains such as porous media.
In this work, we investigate an innovative extension of the DeepONet framework called the Porous-DeepONet, which is specifically tailored to solve parametric PDEs in porous media by incorporating CNNs into the DeepONet architecture, with a focus on reactive transport equations. First, the efficacy of the Porous-DeepONet is demonstrated through its application in learning solution operators within complex and random porous structures, utilizing the Fick diffusion equation as a representative example. Then, we further illustrate the power of the Porous-DeepONet by changing the type of BCs on the surface of porous media, such as whether there is a reaction, whether there is a boundary layer, and whether it is permeable. The studied equations include the Fick diffusion and surface reaction equation, the advection equation, and the heat conduction equation. Furthermore, to enhance the versatility of the Porous-DeepONet in addressing coupled equations from multi-physical fields, a combination of the Porous-DeepONet and the DeepM&Mnet is employed to predict the charging process in porous electrodes by solving the Poisson-Nernst-Planck (PNP) equations. To assess the generalizability of the Porous-DeepONet in identifying solutions for non-real porous structures, a simplified porous model is utilized to generate data for validation purposes. In addition, a comparison is made between the solution speed of the Porous-DeepONet and that of the FEM, shedding light on the computational efficiency of the proposed approach. Lastly, the feasibility of employing a physics-informed Porous-DeepONet is discussed, indicating the potential for leveraging domain knowledge to further enhance the model’s performance.
2. Methodologies
In the subsequent sections, we first introduce the DeepONet, which is based on the generalized universal approximation theorem for the operator. Subsequently, we discuss an extension of the DeepONet called the Porous-DeepONet, which is specifically designed to learn parametric PDEs’ solution operators in porous media. Moreover, we outline the approaches for coupling the Porous-DeepONet with other DeepONet frameworks. Finally, we describe the data-generation process, including the generation of real porous structures and simplified porous models, as well as the numerical details of the FEM.
2.1. DeepONets applied to porous media: The Porous-DeepONet
2.1.1. Generalized universal approximation theorem for the operator
Suppose that(U, V, S)is a triplet of Banach spaces and is a linear or nonlinear differential operator. Assume that the input functions are expressed as and the solutions of the parametric PDE as , respectively. Therefore, the general form of parametric PDEs can be denoted as . For particular ICs and BCs, we consider that there exists a unique solution to . Then, the solution operator can be defined as . According to the generalized universal approximation theorem [23], a nonlinear continuous operator satisfies the following inequality for any positive number :
where is the absolute value; is the dot product in ; and are continuous vector functions; , are dimensional vectors, and refers to the points in the domain of . However, this theorem only informs us whether the operator can be learned and does not indicate how to learn efficiently.
2.1.2. The DeepONet
The DeepONet provides a flexible and powerful deep learning framework that encapsulates the generalized universal approximation theorem for operators above [11,23]. A DeepONet comprises two sub-networks: a trunk net and a branch net. The trunk net is responsible for extracting the latent representation of space-time locations , where is time, and the and are space location, while the branch net takes as input the functions in the form of discretized points. The outputs of the trunk net and branch net are and , respectively. More specifically, the solution operators of a function evaluated at can be expressed by
To reduce the generalization error of the DeepONet, a bias is added in the last stage: . It is worth noting that there are no constraints on the locations of and , which can be chosen randomly or at equal distances. For random sampling, adopting a fully connected NN (FNN) structure is appropriate for the trunk net and branch net [23].
2.1.3. The Porous-DeepONet
To learn the solution operators of parametric PDEs in porous media, the various structures of porous media should be inputted into the branch network as the function . These structures can be considered to be image-like data with equal-spaced grids. However, due to the complexity and randomness of these structures, the FNNs commonly used in DeepONets [25,27,30] may not be able to encode the local information of porous structures well. Therefore, other NN architectures may be considered. CNNs have powerful image-processing capabilities and have been successfully applied to predict the transport property of porous media and reconstruct porous structures [41]. Inspired by the DeepONet and CNNs, we propose a framework for solving parametric PDEs in porous media called the Porous-DeepONet, depicted in Fig. 1(a). By inputting porous structure and space-time coordinates , we obtain the output in the domain of the corresponding solution function . For example, if we aim to solve the Fick diffusion equation in different porous structures, the inputs and outputs of the Porous-DeepONet are the images of the porous structures, space-time coordinates, and the corresponding concentration. In addition, other parameters (e.g., the diffusion coefficient) can be inputted into the Porous-DeepONet by adding one more branch net [37]. Although a few works [29,42] have introduced CNNs into DeepONets, this DeepONet based on CNNs has not been applied to solve parametric PDEs in porous media. For example, Kontolati et al. [42] solved the 2D time-dependent Brusselator reaction-diffusion system with uncertainty sources in the bulk phase using a DeepONet based on CNNs to model an autocatalytic chemical reaction between two species. The function of the CNN in that case was to extract representative features of spatially dependent uncertainty sources, not porous media.
When training, suppose that we sample porous structures from function space and other parameters ; in addition, for each input and , we collect points (i.e., ) to evaluate the solution function. Therefore, the total number of training samples should be . Notably, the Porous-DeepONet is trained by adopting the Adamax and Huber loss as an optimizer and loss function, respectively:
where is a hyper-parameter, is the true solution operator, and is the true solution. Huber loss combines the excellent performance of both mean squared error (MSE) loss and mean absolute error (MAE) loss to achieve fast convergence speed and good robustness [43]. The average value of the absolute value of absolute error is selected as the performance metric for the test dataset:
where is the result predicted by the Porous-DeepONet and is the real result calculated by the FEM. The Porous-DeepONet framework was implemented using the Python package PyTorch. The Porous-DeepONet training and the results prediction were performed using an NVIDIA GeForce RTX 3090 graphic processing unit (GPU) and an Intel Xeon Platinum 9242 central processing unit (CPU), respectively. These hardware resources accelerated the computational processes involved in the training and inference tasks.
2.2. Coupling the Porous-DeepONet with other frameworks
2.2.1. The Porous-DeepM&Mnet
While the Porous-DeepONet excels in learning solution operators for individual physical fields, the coupling between these fields can sometimes impact prediction accuracy. The integration of the Porous-DeepONet with the DeepM&Mnet, as proposed by Cai et al. [31], offers a powerful framework for accurately solving multi-physics PDEs in porous media. The DeepM&Mnet addresses this challenge by coupling multiple pretrained DeepONets to infer elec-troconvection multi-physical fields effectively. In the combined framework, depicted in Fig. 1(b) [31], we leverage both the Porous-DeepONet and the DeepM&Mnet to handle the interactions between different physical fields (e.g., , and ) in porous media. To account for the influence of and on , these fields are treated as input parameters in the prediction process, namely . The same approach is applied to and . One limitation arises when using a pretrained Porous-DeepONet for , as it requires the availability of and , which may not always be feasible in practice. To address this issue, the Porous-DeepM&Mnet introduces an NN to approximate , and simultaneously, while leveraging the pretrained Porous-DeepONets as constraints during the solution process. Finally, calculate the difference between the output of these pretrained Porous-DeepONets (e.g., , and ) and the true value as a loss for training. This hybrid approach ensures accurate predictions while incorporating the knowledge learned from the pre-trained models. For more in-depth information on the DeepM&Mnet framework and its applications, readers may refer to the original literature by Cai et al. [31].
2.2.2. The Porous-physics-informed (PI)-DeepONet
When dealing with complex porous media, the number of samples required to represent the function space in a DeepONet can be substantial, leading to increased memory requirements for training. To address this issue and reduce the demand for data, Wang et al. [11] introduced a PI-DeepONet framework, as depicted in Fig. 1(c). The PI-DeepONet incorporates the principles of physics-based constraints into the training process, enabling the network to learn the underlying physical laws and reducing the reliance on a large volume of data. In the PI-DeepONet framework, the residuals of the governing equation and BCs and ICs are formulated as regularized losses or constraints using automatic differentiation techniques. This approach not only facilitates the learning of the underlying physics but also mitigates the demand for an extensive amount of data, making it particularly well-suited for scenarios involving complex porous media, where data availability may be limited. Readers interested in delving deeper into the details of the PI-DeepONet framework and its implementation are referred to the original literature by Wang et al. [11] for a comprehensive understanding of this innovative approach.
2.3. Data generation
2.3.1. Generation and simplification of porous structures
To demonstrate the effectiveness of the Porous-DeepONet in handling complex porous media structures, we selected amorphous porous structures as the model system. While experimental characterization techniques such as computed tomography (CT) and scanning electron microscopy (SEM) can provide nondestructive ways of obtaining such structures [13], the high cost of these methods made generating random structures a more feasible option. To this end, we generated 5000 disordered structures of varying porosity, morphology, and random seed using the Python package PoreSpy [44], as shown in Figs. 2(a)-(d). The setups used to generate these structures are listed in Table S1 in Appendix A. In addition to these randomly generated structures, we considered a simplified porous model called the stacked electrodes model [9,45], which represents the structure of disordered porous electrodes and can be modified by changing the number and position of the plates. Fig. 2(e) shows an example of this model. By considering both random and simplified structures, we demonstrated how the Porous-DeepONet can effectively handle a wide range of porous media structures.
2.3.2. Numerical details of the FEM
The governing equations and their associated BCs and ICs were solved using the FEM implemented in COMSOL Multiphysics 5.3. The direct solver employed for the computations was the parallel direct sparse matrix solver (PARDISO), while the fully coupled solver utilized an automatic Newton solver. A relative tolerance factor of was set for the solvers. In the grid generation process, conventionally sized cells were used to discretize the porous structures, as illustrated in Fig. S1 in Appendix A. The grid scheme for the stacked electrodes model followed the approach is detailed in the literature [45]. The FEM computations were carried out on a server equipped with an Intel Xeon Platinum 9242 CPU.
3. Results and discussion
As described below, we conducted a series of comprehensive numerical studies to demonstrate the effectiveness of the Porous-DeepONet in solving various types of parametric PDEs related to reactive transport processes. Table 1 summarizes the different examples considered, while the associated training details can be found in Section S1 and Tables S2 and S3 in Appendix A. By evaluating their performance across these diverse examples, we demonstrated the broad applicability of Porous-DeepONets in handling complex PDEs related to porous media structures. This comprehensive approach underscores the potential of the Porous-DeepONet as a powerful tool for advancing our understanding of porous media behavior and improving the design and optimization of related technologies.
3.1. The Fick diffusion equation
The first example in this study focuses on showcasing the capability of the Porous-DeepONet to handle the complex structures of porous media. We considered the transient-state Fick diffusion equation within various porous structures in this context. The Fick diffusion equation is commonly used to model the diffusion of molecules, ions, and other substances in porous media such as zeolites, membranes, and cells. Fig. 2(a) illustrates a typical application scenario of the Fick diffusion equation in porous media, such as seawater desalination. In the figure, the white regions represent pores, while the black regions represent solid material. The inlet concentration and outlet concentration remain constant over time. In addition, the surface of the porous medium, top boundary, and bottom boundary are impermeable. As time progresses, substances within the porous medium diffuse from regions of high concentration to regions of low concentration until equilibrium is reached. The corresponding transient-state Fick diffusion equation for this scenario can be expressed as follows:
with the following ICs and BCs:
where represents the concentration of the diffusing substance; is the diffusion coefficient; is the Hamiltonian operator; is the initial concentration; represents the unit normal vector of the surface; denotes the surface molar flux and is calculated as and and represent the top and bottom molar flux, respectively. For the specific values of these parameters, please refer to Table S1.
We use the Porous-DeepONet to map the solution operator from a porous structure to the full spatiotemporal solution of the Fick diffusion equation. The comparison between the predicted solution by the Porous-DeepONet and the exact solution by the FEM for a random input porous structure in the test dataset shows excellent agreement from the initial state (i.e., ) to the equilibrium state (i.e., ), as shown in Fig. 3. The prediction errors are mainly concentrated where the concentration gradient is steep, which aligns with observations by Wang et al. [11]. More visualizations for different input porous structures are provided in Fig. S9 in Appendix A. In addition, the distributions of the absolute error in the training dataset and test dataset indicate that most of the values of absolute errors are within a certain range of -25 to , and the resulting test error is 6.6 , as shown in Fig. S10 in Appendix A. Based on these observations, it can be concluded that Porous-DeepONets are an effective extension to enhance and generalize the capabilities of DeepONets in solving parametric PDEs in porous media.
3.2. The Fick diffusion and surface reaction equation
The next example we provided involves a steady-state Fick diffusion and surface reaction equation with various reaction rates, where the surface morphology influences the surface reaction and is more intricate than a homogeneous reaction. Fig. 2(b) illustrates the diffusion of material in a porous medium and its reaction on the surface-a crucial phenomenon in catalysts and adsorbents. In this scenario, we assume that the reaction rate follows a power exponential function form. It is important to note that, to capture the diffusion-reaction process more realistically, the outlet concentration is not fixed. The corresponding equation can be expressed as follows:
with the following BCs:
where represents the reaction rate constant, and is the reaction order. By adjusting the values of and , and the structures of the catalyst or adsorbent, Eqs. (8) and (9) can effectively model the diffusion and reaction phenomena in a wide range of catalysts or adsorbents. The specific values and permissible ranges for these parameters are detailed in Table S1.
The results obtained from applying the Porous-DeepONet to learn the solution operator mapping the porous structure and the reaction parameters and to the solution are presented in Fig. 4. The test dataset includes four representative input and , showcasing a range of reaction depths from high to low, resulting in a gradual increase in the outlet concentration. The comparison between the Porous-DeepONet predictions and the ground truth demonstrates a good agreement across different scenarios, capturing the transition from large reaction rates and complex structures to small reaction rates and simpler structures. However, some inaccuracies can be observed in regions where the concentration exhibits significant gradients, similar to the example of the Fick diffusion discussed earlier.
To evaluate the performance of the Porous-DeepONet model, the distributions of the absolute error in the training and testing datasets are illustrated in Fig. S11(a) in Appendix A. The test error is reported to be , indicating that the Porous-DeepONet can serve as an effective surrogate for PDEs with complex and variable flux BCs. This finding highlights the capability of the Porous-DeepONet to accurately capture the behavior of the system under diverse and challenging conditions, demonstrating its potential for practical applications in modeling complex porous media with varying reaction parameters and BCs.
3.3. Advection equation
To further demonstrate the ability of the Porous-DeepONet to handle boundary layers, the advection equation is considered, as shown in Fig. 2(c). In this scenario, the inlet velocity differs across different structures, and the outlet pressure is set to atmospheric pressure. The surface of the porous media is subjected to a no-slip BC. Moreover, to accurately simulate the change in velocity at the entrance, it is assumed that the porous medium is connected to the bulk phase. The corresponding advection equation is as follows:
with the following BCs:
where is the velocity, is the magnitude of the velocity, is the mass density, is the pressure, is a unit matrix, is the viscous resistance, and is the viscosity. Eqs. (10)-(13) likely capture the complex phenomena associated with mass transport in these systems. These types of equations are crucial for understanding and optimizing the performance of fuel cells and reaction towers. The values and ranges of the parameters involved in these equations are listed in Table S1, providing important information for characterizing the system’s behavior and potentially enabling further analysis and modeling.
The Porous-DeepONet is employed to predict the magnitude of the velocity by inputting porous structure and the magnitude of the inlet velocity . As depicted in Fig. 5, the results demonstrate that the trained Porous-DeepONet exhibits an overall excellent agreement with the reference PDE solution, even though the velocity is not zero at certain surface parts. The distributions of the absolute error further indicate that the test error amounts to , as illustrated in Fig. S11(b) in Appendix A. This finding underscores the capability of the Porous-DeepONet to effectively solve parametric PDEs with BCs related not only to flux but also to variable parameters such as .
3.4. Heat conduction equation
This example aims to explore the potential of the Porous-DeepONet in solving multiphase parametric PDEs, as illustrated in Fig. 2(d). In contrast to the previous examples, let us consider a scenario in which heat can be transferred between two phases. More specifically, we assume that there is no thermal resistance between the two phases in porous media when the inlet temperature is higher than the outlet temperature. To clearly differentiate between the two phases, we designate phase 1 as air with a low
thermal conductivity and phase 2 as metal with a high thermal conductivity. The corresponding equation is provided below:
with the following BCs:
where is the heat conductivity, is temperature, and is the heat flux, given by . The specific values and permissible ranges of the parameters in Eqs. (14)-(16) are provided in Table S1.
Fig. 6 demonstrates that the trained Porous-DeepONet model can predict the solution of the heat conduction equation for any given heat conductivity of phase and porous structure in the test dataset. The average of the absolute value of the absolute error is found to be , as detailed in Fig. S11(c) in Appendix A. Interestingly, unlike the single-phase examples discussed earlier, the maximum error does not occur where the gradient is the largest but rather occurs on the surface. This phenomenon suggests that, when learning the solution of parametric PDEs in porous structures, special attention needs to be given to regions with steep gradients and the surface of porous structures. These findings highlight the importance of accurately modeling and capturing the behavior of the surface and regions with significant gradients when using Porous-DeepONets to solve PDEs in porous structures.
3.5. The PNP equations
The last example aims to solve multi-physics parametric PDEs in porous media using the Porous-DeepM&Mnet. In addition, the possibility of combining the Porous-DeepM&Mnet with other porous models is explored by replacing real porous structures with the porous model. Considering the representativeness of the model and the complexity of the PDEs, we chose the stack electrode model [9,45] and the PNP equations to investigate the charging process in porous electrodes. The simplification process of real porous electrodes is shown in Fig. 2(e). The simplified porous structure can be regarded as a 1D image, which is easily inputted into the Porous-DeepONet and Porous-DeepM&Mnet. After the potential is applied to the cathode and anode, the anions and cations diffuse from the center of the two plates and the bulk phase to the vicinity of the plate. They then adsorb onto the plate to form electric double layers (EDLs). The following PDEs describe this phenomenon:
with the following ICs and BCs:
where represents the cation and anion, respectively; while denotes the valence of the ion. In addition, represents the potential, corresponds to the space charge concentration (given by ), is the ideal gas constant and represents the Faraday constant. and denote the vacuum and relative dielectric constant, respectively. It is important to note that varying the number and position of plates generates different porous structures. Furthermore, and are variables used to represent various combinations of anions and cations. More detailed information is given in Table S1.
Fig. 7 presents the results of , and predicted by the Porous-DeepM&Mnet, which is based on the Porous-DeepONet. The displayed results are taken from a randomly selected entry in the test dataset (further visualizations can be found in Fig. S12 in Appendix A). These results demonstrate that the Porous-DeepM&Mnet effectively captures the details of the EDLs adsorbed on the electrode plate. The test errors for , and are , and , respectively, as can be easily observed in Figs. S11(d)-(f) in Appendix A. Therefore, the Porous-DeepONet can seamlessly integrate with other Deep-ONet frameworks to solve more complex parametric PDEs.
3.6. Comparison of the computation time between the Porous- DeepONet and FEM
To demonstrate the high solving efficiency of the Porous-DeepONet, we measured the time required for the the Porous-DeepONet and the FEM to solve the same problem on an Intel Xeon Platinum 9242 CPU with a single CPU core, as shown in Fig. 8. The time cost of the trained Porous-DeepONet includes loading the trained model and inferring the results, whereas the time cost of the FEM encompasses importing the porous structure, meshing, assembling the matrix, and solving. It was found that the Porous-DeepONet was up to three orders of magnitude faster than the conventional FEM in solving the same problem. When the Porous-DeepM&Mnet was based on the Porous-DeepONet, the solution speed improved by about 50 times when solving the PNP equations. It should be noted that the first and last examples took significantly more time than the other examples because they consider time , which requires a greater number of inferences (Table 1). To reduce the inference time, the Porous-DeepONet can be used to infer results in parallel on a CPU or GPU.
4. Conclusions
This work introduces the Porous-DeepONet, a deep-learning framework designed for learning the solution operators of parametric PDEs in porous media, with a focus on reactive transport equations. We employed CNNs to extend the DeepONet, resulting in a simple yet highly effective approach capable of handling complex domain geometries. This extension enables significant improvements in solving efficiency compared with conventional FEM methods. To evaluate the accuracy and applicability of the Porous-DeepONet, we solved various reactive transport equations, including the Fick diffusion equation, the Fick diffusion and surface reaction equation, the advection equation, and the heat conduction equation. The results demonstrated that the Porous-DeepONet can efficiently solve single-phase and multiphase parametric PDEs with complex BCs, achieving up to three orders of magnitude faster computation times than the conventional FEM. We also combined the Porous-DeepONet with the DeepM&Mnet to tackle the challenge of solving multi-physical field coupled PNP equations, leading to a substantial reduction in computation time by a factor of 50.
In addition to the examples mentioned above, the Porous-DeepONet has a wide range of applications in various scientific and engineering fields involving porous media, including quantum chemistry and social sciences. Despite the promising initial results, several technical challenges require further investigation. Firstly, to address inaccuracies in step gradient regions and near surfaces, integrating a self-attention mechanism or utilizing data-augmentation techniques in the Porous-DeepONet could be beneficial. Secondly, the requirement of numerous paired input-output observations for the Porous-DeepONet may be costly. An alternative approach could involve transitioning the Porous-DeepONet from being data-driven to being driven by physical information, as demonstrated by the PI-DeepONet [11]. However, due to the complexity of BCs in porous media and challenges with convergence in the loss function of PINNs, exploring the Porous-PI-DeepONet remains a valuable research direction. Thirdly, the Porous-DeepONet is currently limited to solving parametric PDEs in fixed-size porous structures. To enhance its adaptability to more complex input porous structures (e.g., random-size SEM images post-segmentation from databases), advanced architectures such as fully CNNs need to be integrated into the Porous-DeepONet. Lastly, due to limited video memory size and training time constraints, the Porous-DeepONet encounters limitations when handling super-large 3D structures with an RTX 3090 graphics card. Drawing inspiration from examples like the PNP equations, simplified porous models such as the stacked electrodes model [45] and pore network model [46] can serve as alternative inputs to the Porous-DeepONet. Addressing these technical challenges not only has the potential to improve the performance of the Porous-DeepONet but also positions it as a viable alternative to the FEM for solving PDEs in diverse structures.
Acknowledgments
This work was supported by the National Key Research and Development Program of China (2022YFA1503501), the National Natural Science Foundation of China (22378112, 22278127, and 22078088), the Fundamental Research Funds for the Central Universities (2022ZFJH004), and the Shanghai Rising-Star Program (21QA1401900).
Compliance with ethics guidelines
Pan Huang, Yifei Leng, Cheng Lian, and Honglai Liu declare that they have no conflict of interest or financial conflicts to disclose.
G.Cai, P.Yan, L.Zhang, H.C.Zhou, H.L.Jiang. Metal-organic framework-based hierarchically porous materials: synthesis and applications. Chem Rev, 121 (20) (2021), pp. 12278-12326.
[2]
M.H.Sun, S.Z.Huang, L.H.Chen, Y.Li, X.Y.Yang, Z.Y.Yuan, et al. Applications of hierarchically structured porous materials from energy storage and conversion, catalysis, photocatalysis, adsorption, separation, and sensing to biomedicine. Chem Soc Rev, 45 (12) (2016), pp. 3479-3563.
[3]
J.Zhou, B.Wang. Emerging crystalline porous materials as a multifunctional platform for electrochemical energy storage. Chem Soc Rev, 46 (22) (2017), pp. 6927-6945.
[4]
C.Perego, R.Millini. Porous materials in catalysis: challenges for mesoporous materials. Chem Soc Rev, 42 (9) (2013), pp. 3956-3976.
[5]
X.Wang, L.Chen, S.Y.Chong, M.A.Little, Y.Wu, W.H.Zhu, et al. Sulfone-containing covalent organic frameworks for photocatalytic hydrogen evolution from water. Nat Chem, 10 (12) (2018), pp. 1180-1189.
[6]
H.Yuk, T.Zhang, S.Lin, G.A.Parada, X.Zhao. Tough bonding of hydrogels to diverse non-porous surfaces. Nat Mater, 15 (2) (2016), pp. 190-196.
[7]
H.Pan, Y.Shao, P.Yan, Y.Cheng, K.S.Han, Z.Nie, et al. Reversible aqueous zinc/manganese oxide energy storage from conversion reactions. Nat Energy, 1 (5) (2016), p. 16039.
[8]
S.Tian, B.Wang, W.Gong, Z.He, Q.Xu, W.Chen, et al. Dual-atom Pt heterogeneous catalyst with excellent catalytic performances for the selective hydrogenation and epoxidation. Nat Commun, 12 (1) (2021), p. 3181.
[9]
C.Lian, M.Janssen, H.Liu, R. vanRoij. Blessing and curse: how a supercapacitor’s large capacitance causes its slow charging. Phys Rev Lett, 124 (7) (2020), Article 076001.
[10]
H.Tao, C.Lian, H.Liu. Multiscale modeling of electrolytes in porous electrode: from equilibrium structure to non-equilibrium transport. Green Energy Environ, 5 (3) (2020), pp. 303-321.
[11]
S.Wang, H.Wang, P.Perdikaris. Learning the solution operator of parametric partial differential equations with physics-informed DeepONets. Sci Adv, 7 (40) (2021), Article eabi8605.
[12]
L.Lu, X.Meng, Z.Mao, G.E.Karniadakis. DeepXDE: a deep learning library for solving differential equations. SIAM Rev, 63 (1) (2021), pp. 208-228.
[13]
R.Bostanabad, Y.Zhang, X.Li, T.Kearney, L.C.Brinson, D.W.Apley, et al. Computational microstructure characterization and reconstruction: review of the state-of-the-art techniques. Prog Mater Sci, 95 (2018), pp. 1-41.
[14]
A.Quarteroni, G.Rozza, A.Manzoni. Certified reduced basis approximation for parametrized partial differential equations and applications. J Math Ind, 1 (1) (2011), p. 3.
[15]
R.B.Yunus, S.A. AbdulKarim, A.Shafie, M.Izzatullah, A.Kherd, M.K.Hasan, et al. An overview on deep learning techniques in solving partial differential equations. S.A. Abdul Karim (Ed.), Intelligent systems modeling and simulation II:machine learning, neural networks, efficient numerical algorithm and statistical methods, Springer International Publishing, Cham (2022), pp. 37-47.
[16]
G.E.Karniadakis, I.G.Kevrekidis, L.Lu, P.Perdikaris, S.Wang, L. Yang. Physics-informed machine learning. Nat Rev Phys, 3 (6) (2021), pp. 422-440.
C.Beck, M.Hutzenthaler, A.Jentzen, B.Kuckuck. An overview on deep learning-based approximation methods for partial differential equations. Discrete Contin Dyn Syst B, 28 (6) (2023), pp. 3697-3746.
[22]
T.Chen, H.Chen. Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems. IEEE Trans Neural Netw, 6 (4) (1995), pp. 911-917.
[23]
L.Lu, P.Jin, G.Pang, Z.Zhang, G.E.Karniadakis. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nat Mach Intell, 3 (3) (2021), pp. 218-229.
[24]
M. SharmaPriyadarshini, S.Venturi, I.Zanardi, M.Panesi. Efficient quasi-classical trajectory calculations by means of neural operator architectures. Phys Chem Chem Phys, 25 (20) (2023), pp. 13902-13912.
[25]
C.Lin, M.Maxey, Z.Li, G.E.Karniadakis. A seamless multiscale operator neural network for inferring bubble dynamics. J Fluid Mech, 929 (2021), p. A18.
Q.Zheng, X.Yin, D.Zhang. Inferring electrochemical performance and parameters of Li-ion batteries based on deep operator networks. J Energy Storage, 65 (2023), Article 107176.
[28]
C.Moya, S.Zhang, G.Lin, M.Yue. DeepONet-grid-UQ: a trustworthy deep operator framework for predicting the power grid’s post-fault trajectories. Neurocomputing, 535 (2023), pp. 166-182.
[29]
M.Yin, E.Ban, B.V.Rego, E.Zhang, C.Cavinato, J.D.Humphrey, et al. Simulating progressive intramural damage leading to aortic dissection using DeepONet: an operator-regression neural network. J R Soc Interface, 19 (187) (2022), Article 20210670.
[30]
E.Pickering, S.Guth, G.E.Karniadakis, T.P.Sapsis. Discovering and forecasting extreme events via active learning in neural operators. Nat Comput Sci, 2 (12) (2022), pp. 823-833.
[31]
S.Cai, Z.Wang, L.Lu, T.A.Zaki, G.E.Karniadakis. DeepM&Mnet: inferring the electroconvection multiphysics fields based on operator approximation by neural networks. J Comput Phys, 436 (2021), Article 110296.
[32]
M.Yin, E.Zhang, Y.Yu, G.E.Karniadakis. Interfacing finite elements with deep neural operators for fast multiscale modeling of mechanics problems. Comput Methods Appl Mech Eng, 402 (2022), Article 115027.
[33]
L.Lu, R.Pestourie, S.G.Johnson, G.Romano. Multifidelity deep neural operators for efficient learning of partial differential equations with application to fast inverse design of nanoscale heat transport. Phys Rev Res, 4 (2) (2022), Article 023210.
[34]
L.Lu, X.Meng, S.Cai, Z.Mao, S.Goswami, Z.Zhang, et al. A comprehensive and fair comparison of two neural operators (with practical extensions) based on FAIR data. Comput Methods Appl Mech Eng, 393 (2022), Article 114778.
[35]
W.Li, M.Z.Bazant, J.Zhu. Phase-field DeepONet: physics-informed deep operator neural network for fast simulations of pattern formation governed by gradient flows of free-energy functionals. Comput Methods Appl Mech Eng, 416 (2023), Article 116299.
[36]
S.Goswami, K.Kontolati, M.D.Shields, G.E.Karniadakis. Deep transfer operator learning for partial differential equations under conditional shift. Nat Mach Intell, 4 (12) (2022), pp. 1155-1164.
[37]
S.Goswami, A.Bora, Y.Yu, G.E.Karniadakis. Physics-informeddeep neural operator networks. T.Rabczuk, K.J.Bathe (Eds.), Machine learning in modeling and simulation: methods and applications, Springer, Cham (2023), pp. 219-254.
[38]
J.Sun, J.Li, Y.Hao, C.Qi, C.Ma, H.Sun, et al. Boundary-to-solution mapping for groundwater flows in a Toth basin. Adv Water Resour, 176 (2023), Article 104448.
[39]
K.M.Jablonka, D.Ongari, S.M.Moosavi, B.Smit. Big-data science in porous materials: materials genomics and machine learning. Chem Rev, 120 (16) (2020), pp. 8066-8129.
[40]
T.Cawte, A.Bazylak. A 3D convolutional neural network accurately predicts the permeability of gas diffusion layer materials directly from image data. Curr Opin Electrochem, 35 (2022), Article 101101.
[41]
Y.Wang, C.H.Arns, S.S.Rahman, J.Y.Arns. Porous structure reconstruction using convolutional neural networks. Math Geosci, 50 (7) (2018), pp. 781-799.
[42]
K.Kontolati, S.Goswami, M.D.Shields, G.E.Karniadakis. On the influence of over-parameterization in manifold based surrogates and deep neural operators. J Comput Phys, 479 (2023), Article 112008.
[43]
Q.Sun, W.X.Zhou, J. Fan. Adaptive huberregression. J Am Stat Assoc, 115 (529) (2020), pp. 254-265.
[44]
J.T.Gostick, Z.A.Khan, T.G.Tranter, M.D.R.Kok, M.Agnaou, M.Sadeghi, et al. PoreSpy: a Python toolkit for quantitative analysis of porous media images. J Open Source Softw, 4 (37) (2019), p. 1296.
[45]
P.Huang, H.Tao, J.Yang, C.Lian, H.Liu. Four stages of thermal effect coupled with ion-charge transports during the charging process of porous electrodes. AIChE J, 68 (10) (2022), Article e17790.
Note: Please be aware that the following content is generated by artificial intelligence. This website is not responsible for any consequences arising from the use of this content.