6
Reverse Design of Heat Exchange Systems Using Physics-Informed Machine Learning

Chang He¹,² and Yunquan Chen³

¹Sun Yat-Sen University, School of Chemical Engineering and Technology, No. 135, Xingang Xi Road, Zhuhai, Guangdong 519082, China

²The Key Laboratory of Low-carbon Chemistry & Energy Conservation of Guangdong Province, Guangdong Engineering Center for Petrochemical Energy Conservation, 132 Waihuan East Road, University City, Panyu District, Guangzhou 510275, China

³Sun Yat-Sen University, School of Materials Science and Engineering, No. 135, Xingang Xi Road, Guangzhou 510275, China

6.1 Introduction

Designing high-performance and low-cost heat exchange systems for energy savings and emissions reduction has been a long-lasting challenge in electronic devices, power plants, and petrochemical processes [1–3]. To address this challenge, traditional practices typically rely on a trial-and-error or empirical procedure [4–6], where successive candidate solutions are generated through space-filling experimental designs over the combined space of the inputs (including geometric variable , operating variable , and uncertain variable , Figure 6.1) until one that is best-performing is found by evaluating the outputs of all objectives {obj₁,…Obj_k…,Obj_K}. For each solution, to accurately capture the multi-phase fluid flow and interfacial heat and mass transfer, it is often required to build a high-fidelity numerical model by solving the first-principle governing equations via mesh-based methods [7–9] (e.g. finite volume method and finite element method) with the aid of computational fluid dynamics (CFDs) tools. In practice, however, it is well known that numerical modeling is time-consuming, especially for varying geometry design problems that require tedious preprocessing and calibration procedures such as mesh regeneration or calibration of initial and boundary conditions [10]. Hence, the resulting computational burden becomes a major bottleneck that slows down the research and development of cutting-edge heat exchange designs.

In contrast to intuition-based approaches, inverse design directly starts with the targeted functionality and sets it as an objective function to be optimized via partial differential equation (PDE)-constrained shape or topology optimization. When the geometry is represented by using the parameterized function of discrete geometric components, inverse design is known as shape optimization. On the other hand, when the geometry is parameterized by using a density function or level set so that the connectivity/structure/layout is arbitrary, inverse design is known as topology optimization [11, 12]. In this case, PDE-constrained inverse design can be performed via gradient-based optimization algorithms, where the gradient is obtained using adjoint methods that compute the gradient of the objective function with respect to all involved parameters. These methods are commonly based on numerical PDE solvers built into commercial CFD software, which tracks the interactions and spatial movement of each particle characterized by computationally expensive PDEs [13]. Moreover, data interface and transfer issues would lead to inefficient integration of numerical models with equation-oriented optimization frameworks since a large number of recourses to the PDE solver have to be executed before converging to the optimum. The time and memory costs further increase exponentially, which may cause a combinatorial explosion in the computation burden if no effective approach is applied.

A structure depicts the traditional mesh-based numerical method. It represents geometric variables, operating variables, uncertain variables, C F D simulations, and optimization.. — **Figure 6.1** Traditional mesh-based numerical method for determining the optimal designs.

In recent times, a prevalent approach to addressing the aforementioned computational burden has been the use of data-driven surrogate-based optimization. This is typically achieved by extensively studying shortcut models or detailed full-order models (FOMs) [14]. Shortcut models, employed to parameterize the geometry, consist of computationally inexpensive analytical or lumped parameter equations (rather than governing PDEs) with numerous ideal assumptions [15, 16]. These assumptions include perfect mixing, plug flow, and equilibrium behavior, among more. Consequently, it becomes advantageous for researchers to develop automated procedures that can effectively explore the design space by coupling with optimization algorithms such as mathematical programming [16, 17] and metaheuristic methods [18]. However, such approaches do not scale up well with the system’s size and would suffer from inaccurate and mismatch issues due to the insufficient description of the spatially distributed multiphysics phenomena inside a heat exchanger. By contrast, the FOM-based surrogate model can accurately capture the distributed phenomena. Neural networks, Kriging and radial basis functions, and specialized regression models are often employed in a supervised-learning paradigm to learn the nonlinear mapping from arbitrary designs to their associated objectives, i.e. neural networks are regarded as surrogate models of the original FOMs to accelerate the design and optimization [17, 18]. Nevertheless, it often requires a large number of datasets to form high-fidelity surrogate models, particularly for coping with complex varying geometry problems, where the generation of such datasets via conventional PDE solvers could become prohibitively expensive. As a result, the benefit of data-driven surrogate-based optimization must be weighed against model accuracy and time cost to resolve them, which is a major driver and could impede the development of better designs for heat exchange systems.

With the explosive growth of computing and data resources, deep learning has shown promise for simulation, modeling, and optimization due to its capability of handling strong nonlinearity and high dimensionality issues. Recently, the physics-informed neural network (PINN) developed by Karniadakis et al. and coworkers [11, 12, 19, 20] has opened up a deep learning, mesh-free route to solve governing PDEs by mathematically encoding the underlying physics (e.g. conservation laws, transfer laws, and kinetic equations) into the loss function of the neural network. The governing PDEs can take various forms, such as integer-order PDEs [21], integrodifferential equations [22], fractional or stochastic PDEs [23–25], and so on. One of the notable strengths of PINN is its ability to efficiently take the derivatives of a neural network by applying chain rules for differentiating the compositions of PDEs using automatic differentiation [26]. PINN has been successfully applied to diverse areas such as fluid mechanics [27–29], medical diagnosis [30–32], heat transfer analysis [33–35], and materials science [36–38]. Moreover, due to its capability of learning physics equations in rich representation as well as the use of automatic differentiation that removes the need for mesh generation, physics-informed learning is well-placed to become a natural bridge that integrates the numerical model with the optimization procedure for accelerating the discovery of optimal geometry designs.

Despite the recent success and promising prospects of PINN, it has been found that this approach is currently limited to tasks characterized by relatively simple and well-defined physics [10, 20]. Existing research on PINN generally aims to construct a high-fidelity surrogate model of the investigated system using a single deep neural network to control all variables and derivatives in the PDEs. It often struggles to accurately penalize residuals or even fails to train, particularly when tackling multiphysics and multiscale problems with high-frequency functions [10]. For example, heat sink systems typically involve a conjugate heat transfer process [39–41], which is a combination of heat transfer in fluids and heat transfer in solids. As a result, the involved physical variables or parameters (such as velocity, pressure, and temperature) have distinct meanings and may extend over several orders of magnitude in on spatial scales. However, most studies have applied encoding to the parametric inputs using a single variable or parameter in PDEs, without focusing on boundary conditions or geometric dimensions, as well as multivariable parameterization issues. When applying the stand PINN method to approximate this process, the trained neural networks and constructed surrogate models are not robust and stable enough, which are not conducive to subsequent tasks such as real-time prediction and design optimization. Therefore, the extension of the standard PINN to accomplish the parameterized system representation task is technically challenging and requires significant improvements in the structure and algorithm of the neural network for better accuracy, faster training, and improved generalization.

This chapter aims to not only use the PINN for modeling heat exchange systems with complex geometries but also propose an inverse design method that seamlessly combines the constructed surrogate model with multi-objective optimization and decision-making algorithms. For simulating the involved heat and mass transfer processes, specialized neural network structures are developed to decompose the standard PINN model into multiple interconnected subnetworks with identical architecture, which were functionally designed to approximate the latent solutions of the governing PDEs. In addition, the developed PINN applies the input encoding to the geometric and operating inputs in a fully decoupled setting and then concatenates them together in surrogate modeling. This surrogate model, combined with multi-objective optimization and decision-making methods, not only delivers the Pareto-optimal solutions directly but also allows real-time visualization of the distributions of the monitored state variables for better physical inspection. Results are presented for two illustrative examples: (i) In the first example, we demonstrate the proposed method’s ability to describe the underlying behaviors of conjugate heat transfer inside the heat sink system. (ii) In the second example, we further leverage the capabilities of space decomposition, physics-informed deep learning, and transfer learning to accelerate the multi-objective stochastic optimization of a tubular air cooler system.

This chapter is derived from the work of He and coworkers [42, 43].

6.2 PINN-Based Inverse Design Method

This section begins by providing a brief introduction to the inverse design problem. Subsequently, the standard PINN method is presented, along with the PINN-based optimization and decision-making methods utilized to address this problem.

6.2.1 Overview of Inverse Design

Given a steady-state heat exchanger, the governing PDEs can be represented by a generic form defined on a bounded domain Ω ⊂ ℝ³

(6.1) $italic Subscript i Baseline left-bracket bold-italic mu left-parenthesis bold x Subscript normal f Baseline right-parenthesis semicolon bold-italic lamda right-bracket equals 0 comma i element-of left-bracket 1 comma ellipsis comma upper N Subscript Baseline right-bracket comma bold x Subscript normal f Baseline equals left-bracket x Subscript normal f Baseline comma y Subscript normal f Baseline comma z Subscript normal f Baseline right-bracket element-of normal upper Omega$

with suitable boundary conditions

(6.2) $Subscript j Baseline left-bracket bold-italic mu left-parenthesis bold x Subscript normal b Baseline right-parenthesis semicolon bold-italic lamda right-bracket equals 0 comma j element-of left-bracket 1 comma ellipsis comma upper N Subscript Baseline right-bracket comma bold x Subscript normal b Baseline equals left-bracket x Subscript normal b Baseline comma y Subscript normal b Baseline comma z Subscript normal b Baseline right-bracket element-of partial-differential upper Omega$

where _i and _j denote the general form of the PDE operator and boundary condition operator, respectively; N and N are the numbers of corresponding equations involved in the PDEs and boundary conditions. ∂Ω denotes the boundary of domain Ω that is required for defining the constraints. Generally, _j may consist of differential, nonlinear, and identity terms, which could be subject to the following Dirichlet and Neumann boundary conditions [44], as given by

(6.3) $Dirichlet colon bold-italic mu left-parenthesis bold x Subscript normal upper D Baseline right-parenthesis minus g left-parenthesis bold x Subscript normal upper D Baseline right-parenthesis equals 0 comma bold x Subscript normal upper D Baseline element-of partial-differential Subscript normal upper D Baseline normal upper Omega$

(6.4) $Neumann colon bold n dot nabla bold-italic mu left-parenthesis bold x Subscript normal upper N Baseline right-parenthesis minus h left-parenthesis bold x Subscript normal upper N Baseline right-parenthesis equals 0 comma bold x Subscript normal upper N Baseline element-of partial-differential Subscript normal upper N Baseline normal upper Omega$

where x = [u, p, T]^T, x_D, and x_N are the coordinates of ∂_DΩ and ∂_NΩ, ∂Ω = ∂_DΩ ∪ ∂_NΩ, ∂_DΩ ∩ ∂_NΩ = ∅; n is the unit normal vector outward to the boundary at x_N. Here, the initial condition can be simply treated as a special type of Dirichlet boundary condition on the domain [45].

The latent solution of the PDEs, μ(x) = [μ₁(x),…, μ_n(x)] ∈ ℝⁿ, is determined by the independent variable set λ = [λ⁽¹⁾, λ⁽²⁾], which is our quantity of interest for the inverse design problem [11]. Here, it is noted that λ includes design variable λ⁽¹⁾ and control variable λ⁽²⁾, which describe the specific shapes/structures to be manufactured and the corresponding operating conditions to be maintained for realizing an optimal design, respectively. To address this complex problem, traditional intuition-based approaches have to indirectly convert it into a large number of forward-design problems via experimental design or even brute-force search. This is done until a feasible solution is found by evaluating the outputs of all objective values of interest.

As presented in the Section 6.2.1.1, we utilize the advancements of PINN for inverse design optimization due to its natural capability of embedding physical models, as well as the use of automatic differentiation that removes the need for mesh generation. In particular, a deep neural network μ_net is developed as a parametric surrogate model of the solution μ, where λ = [λ⁽¹⁾, λ⁽²⁾] satisfies all the equality constraints enforced by the governing PDEs and the boundary conditions. With the aid of this surrogate model, we can directly search for the best independent variable set λ^* by minimizing objective functions of interest, Θ = [Θ₁, …, Θ_m]. Besides, a set of equality/inequality constraints that stem from the multi-objective problems of the system, as well as the thermal and hydraulic requirements, are also incorporated in the inverse design. In all, the multi-objective inverse design problem can be formulated by

(6.5)

where h denotes the additional equality or inequality constraints.

6.2.1.1 Standard Physics-Informed Neural Networks

The PINN algorithm can parameterize a physical model by applying the input encoding to the temporal/spatial and parametric inputs and then concatenating these inputs together. It often uses a conventional feed-forward fullyconnected architecture, where a deep neural network represented by μ_net(x, λ; θ) is constructed as a surrogate of the solution μ. As shown in Figure 6.2, μ_net(x, λ; θ) takes the spatial coordinate x = [x, y, z] and parametric variables λ = [λ₁, λ₂,… λ_n] as the input and outputs a vector α^L (i.e. u, p, T) that has the same dimension as μ. The training of the PINN aims to obtain the set of best parameters θ^* = [w^*, b^*] by minimizing loss terms (ℒ_f and ℒ_b) such that the neural network can approximate the solution of the original PDEs.

(6.6) $bold-italic mu Subscript n e t Baseline left-parenthesis bold x comma bold-italic lamda semicolon bold-italic theta right-parenthesis delta-equals bold-italic alpha Superscript normal upper L Baseline left-parenthesis bold x comma bold-italic lamda semicolon bold-italic w comma bold-italic b right-parenthesis right-arrow bold-italic mu left-parenthesis bold x comma bold-italic lamda right-parenthesis$

A structure illustrates a Physics-Informed Neural Network P I N N framework for the design of heat exchangers. It represents the neural network with inputs x and lambda and theta as the network parameters. The neural network outputs variables u, rho, and T. P D Es governing the physical system, with lambda as a parameter. The equations are N [f(x); lambda]=0 and C[f(x subscript b); lambda]=0. Here, N and C are operators. — **Figure 6.2** Schematic of the standard PINN algorithm for parametric surrogate modeling.

Source: [19]/with permission of Elsevier.

The derivative terms (i.e. ∇and ∇²) are handled via automatic differentiation. a = σ[·] denotes a nonlinear activation function.

where w and b are weight matrices and bias vectors of the neural network, respectively, which will be tuned at the training stage.

In order to train PINN, the derivative terms of μ_net with respect to its inputs are computed by applying the chain rule for differentiating compositions of the functions using automatic differentiation. Next, the neural network μ_net is restricted to the training points = [x₁,…,x_{∣ ∣}] to satisfy the physics imposed by the PDEs and boundary conditions. For this purpose, a composite loss function that penalizes the divergence of the neural network from the PDEs and boundary conditions is considered.

(6.7)

where

(6.8)

(6.9)

and ω_f and ω_b are the weights that balance the interplay between the two loss terms ℒ_f and ℒ_b; || ||₂ denotes the L² norm of residuals. Two mutually independent sets, _f and _b, are the sampled points in the domain and on the boundary, respectively, _f ⊂ Ω, _b ⊂ ∂Ω, = [_b, _f]. Since the loss function is often highly nonlinear and non-convex, the network parameters θ of PINN are iteratively optimized by gradient-based optimizers, such as gradient descent, Adam, and L-BFGS [45].

(6.10)

To improve the training performance, the loss function of PINN can take the form of Monte Carlo integration approximation [46] since it can ensure the consistency of the loss per volume across the domain. Additionally, it is worth noting that the weighting coefficient in the loss function plays a pivotal role in enhancing the accuracy of PINN, which can be user-defined or tuned automatically. It is often defined as a fixed value that does not vary with the spatial position of the sampled points in the flow field or on the boundary. Nevertheless, the gradient profile at several positions of the flow field varies greatly (e.g. sampled points nearby the sharp corners or discontinuous areas), which makes a fixed weighting coefficient unreasonable. Here, the signed distance function (SDF) [47] is introduced in the loss function to weigh the loss terms, by which each weighting coefficient can be a function of the spatially distributed points in the spatial domain. The SDF is defined as follows [48]:

(6.11) $omega left-parenthesis bold x right-parenthesis equals StartLayout Enlarged left-brace 1st Row StartLayout 1st Row 1st Column d left-parenthesis bold x right-parenthesis comma 2nd Column bold x element-of normal upper Omega EndLayout 2nd Row StartLayout 1st Row 1st Column 0 comma 2nd Column bold x element-of partial-differential normal upper Omega EndLayout 3rd Row StartLayout 1st Row 1st Column minus d left-parenthesis bold x right-parenthesis comma 2nd Column bold x element-of normal upper Omega EndLayout EndLayout$

where

(6.12) $d left-parenthesis bold x right-parenthesis equals min Underscript presentation form for vertical right-brace Endscripts Underscript bold x Subscript normal p Baseline element-of partial-differential normal upper Omega Endscripts double-vertical-bar bold x minus bold x Subscript normal p Baseline double-vertical-bar comma bold x element-of upper Omega$

where d(x) represents the minimum distance of a given point from ∂Ω. According to this definition, the absolute value of SDF often decreases close to zero near the boundary of the domain, where sharp gradients occur. Hence, weighting by the SDF tends to reduce the impact of sharp gradients on the convergence, which contributes to speeding up the learning rate and sometimes also improves prediction accuracy.

Overall, by combining the SDF and Monte Carlo integration approximation, the combined loss function for training the parameters of each subnetwork is as follows:

(6.13)

Another similar training strategy, namely integral continuity plane [49], can be used for enforcing the conservation constraints (e.g. mass, volume, and energy equations) of a given control volume in the domain. For example, we can specify that the volumetric flow of coolant passing through the heat exchanger system must be equal to that entering the system in the flow channel, as listed in Eq. (6.14). This conservation constraint helps to accelerate the solution of the continuity PDEs. Similarly, the heat flux of coolant entering the flow channel, along with the heat flux generated by the working heat source, must be equal to that leaving the channel. Such heat flux conservation is also expressed in an integral form, as given by Eq. (6.15).

(6.14) $integral left-parenthesis bold n dot bold u right-parenthesis normal d upper S Subscript plane Baseline equals upper U Subscript inlet Baseline upper A Subscript inlet$

(6.15) $integral rho Subscript fluid Baseline upper C Subscript normal p comma fluid Baseline left-bracket left-parenthesis bold n dot bold u right-parenthesis dot upper T Subscript fluid Baseline right-bracket normal d upper S Subscript outlet Baseline equals rho Subscript fluid Baseline upper C Subscript normal p comma fluid Baseline upper U Subscript inlet Baseline upper T Subscript inlet Baseline plus q Subscript normal s Baseline upper A Subscript source$

where A_inlet and A_source are the areas of the channel inlet plane and heat source surface, respectively.

The training hyperparameters of the neural network, e.g. network sizes, learning rates, optimizers, initializations, and regularizations, also need to be fine-tuned to achieve a good level of accuracy. More details of the training strategy can be found in Refs [45, 49].

Finally, it should be pointed out that the standard PINN uses a single neural network to construct a surrogate of the solution μ_net. While this approach has demonstrated promising results in certain applications, its limitations become apparent in more complex multiphysics and multiscale tasks that involve high-dimensional and intricate geometric domains. For example, heat sink systems typically involve a convective heat transfer process, which entails a two-way coupling of fluid flow and heat transfer. In practice, however, it is challenging to precisely penalize the residuals in training the neural network of this process in a fully coupled setting. This is because the terms in the combined loss function have distinct physical meanings, such as velocity, pressure, and temperature, and may span several orders of magnitude in spatial scales, making the training process less robust and stable.

6.2.1.2 Design Optimization and Decision-making Methods

In the field of multi-objective optimization, previous studies [50–52] have often relied on evolutionary methods, which mainly include decomposition-based and Pareto-based approaches. The former typically uses a scalarizing function to aggregate all the objectives into a single scalar objective function or retain only one objective while enforcing the others as constraints. A major issue with this approach resides in the extensive knowledge of the problem structure, and not all scalarizing functions can guarantee that all Pareto-optimal solutions are obtainable [53]. The latter can make full use of the Pareto-dominance relations to induce partial ordering in the objective space. It provides a more convenient way to obtain a set of solutions toward the Pareto front and covers the entire Pareto front. One of the most widely used algorithms is the Non-dominated Sorting Genetic Algorithm II (NSGA-II) [54], which is employed in the present study. It is noted that the alternative solutions of the multi-objective optimization on the Pareto-optimal front have the same worthiness based on optimization objectives in the absence of additional preference information, which entails a need to identify the most preferred solution for decision-makers in practice. To this end, a classic decision-making method, namely technique for order preference by similarity to ideal solution (TOPSIS) [55, 56], is subsequently used to identify the most preferred solution closest to the positive-ideal solution and furthest away from the negative-ideal solution as the compromise scheme.

Assuming the number of alternative solutions is M, the procedure of the TOPSIS method can be divided into the following steps:

Step I: All objectives of interest are converted into extremely large ones using positive management

(6.16) $upper Y Subscript italic m n Baseline equals left-parenthesis max Underscript m Endscripts normal upper Theta Subscript italic m n Superscript asterisk Baseline right-parenthesis minus normal upper Theta Subscript italic m n Superscript asterisk Baseline comma StartLayout 1st Row for-all m element-of left-bracket 1 comma 2 comma ellipsis upper M right-bracket 2nd Row for-all n element-of left-bracket 1 comma 2 comma ellipsis upper N right-bracket EndLayout$

where normal upper Theta Subscript italic m n Superscript asterisk denotes the mth alternative solution in terms of the nth objective, and max Underscript m Endscripts normal upper Theta Subscript italic m n Superscript asterisk denotes the maximum value of all the alternative solutions in terms of the nth objective. Y_mn denotes the element of the positive matrix Y.

Step II: The weighted normalized decision matrix Z is constructed by normalizing and weighting the positive matrix Y according to the following equation:

(6.17) $upper Z Subscript italic m n Baseline equals StartFraction upper Y Subscript italic m n Baseline Over StartRoot sigma-summation Underscript m equals 1 Overscript upper M Endscripts upper Y Subscript italic m n Superscript 2 Baseline EndRoot EndFraction times omega Subscript n Baseline comma for-all n element-of left-bracket 1 comma 2 comma ellipsis upper N right-bracket$

where ω_n is the weighting factor of the nth criterion, ∑ω_n = 1.
Step III: Determining the positive-ideal solution Z⁺ and the negative-ideal solution Z⁻:

(6.18) $upper Z Subscript n Superscript plus Baseline equals left-parenthesis max Underscript m Endscripts upper Z Subscript italic m n Baseline right-parenthesis comma for-all n element-of left-bracket 1 comma 2 comma ellipsis upper N right-bracket$

(6.19) $upper Z Subscript n Superscript minus Baseline equals left-parenthesis min Underscript m Endscripts upper Z Subscript italic m n Baseline right-parenthesis comma for-all n element-of left-bracket 1 comma 2 comma ellipsis upper N right-bracket$

Step IV: The Euclidean distances for each alternative solution from the positive-ideal solution and the negative-ideal solution are calculated from Eq. 6.16.

(6.20) $upper R Subscript n Superscript plus Baseline equals StartRoot sigma-summation Underscript n equals 1 Overscript upper N Endscripts left-parenthesis upper Z Subscript n Superscript plus Baseline minus upper Z Subscript italic m n Baseline right-parenthesis squared EndRoot comma for-all m element-of left-bracket 1 comma 2 comma ellipsis upper M right-bracket$

(6.21) $upper R Subscript m Superscript minus Baseline equals StartRoot sigma-summation Underscript n equals 1 Overscript upper N Endscripts left-parenthesis upper Z Subscript n Superscript minus Baseline minus upper Z Subscript italic m n Baseline right-parenthesis squared EndRoot comma for-all m element-of left-bracket 1 comma 2 comma ellipsis upper M right-bracket$

Step V: Measuring the relative closeness to the positive-ideal solution by

(6.22) $upper S Subscript m Baseline equals StartFraction upper R Subscript m Superscript minus Baseline Over upper R Subscript m Superscript plus Baseline plus upper R Subscript m Superscript minus Baseline EndFraction comma for-all m element-of left-bracket 1 comma 2 comma ellipsis upper M right-bracket$

Finally, the Pareto solution with the largest S_i is recommended as the desired solution for the bi-objective optimization problem.

6.3 Example 1: Finned Heat Sink Model

6.3.1 System Description and Objectives

A 3D finned heat sink system with forced air cooling [57] is used as a motivating example in this chapter. As shown in Figure 6.3, the base plate with a built-in chip is placed on the bottom center of a rectangular channel, which is directly in contact with the flat base of the heat sink system through thermal conductive adhesive. In this way, we can consider that almost all heat flux generated by the hot chip can be transferred to the fins on the flat base. Meanwhile, the coolant medium (air) introduced from the channel inlet flows to the surface of the fins, which aims to promptly take away the transferred heat and prevent the junction temperature of the chip from reaching the threshold value. In summary, the heat sink system involves a 3D, steady-state conjugate heat transfer process.

A graphical representation illustrates the geometry of a heat exchanger. It displays three different views of the heat exchanger: 3D view, left view, and bottom view. A. A three-dimensional view of the heat exchange system, includes fins, a flat base, a base plate, and a channel. B. A side view includes height, width, base width, and base height. C. A bottom view indicates heat resource with length L. — **Figure 6.3** A schematic view of the geometry of the finned heat sink system. (a) 3D view, (b) left view, and (c) bottom view. In 3D modeling, the center of the channel is used as the origin of the Cartesian coordinate system.

Source: [42]/with permission of Elsevier.

The heat sink system involves a 3D, steady-state conjugate heat transfer process, which combines heat transfer in fluids with heat transfer in solids [58]. In fluids, heat convection dominates the heat transfer process. We assume that fluid passes through the system subject to incompressible and laminar flows, the governing continuity and Navier–Stokes equations are as follows [59]:

(6.23) $rho Subscript fluid Baseline left-parenthesis nabla dot bold u right-parenthesis equals 0$

(6.24) $rho Subscript fluid Baseline left-parenthesis bold u dot nabla right-parenthesis bold u minus nabla p plus mu nabla squared bold u equals 0$

where u = [u_x, u_y, u_z] denotes the fluid velocity vector; ∇ denotes the Hamiltonian operator; p, ρ_fluid, and μ denote the static pressure, density, and dynamic viscosity of the fluid, respectively. Here, it is assumed that the thermophysical properties of the fluid and solid, as listed in Table 6.1, are constant and insensitive to temperature change in the flow field.

Table 6.1 Thermophysical properties of fluid and solid.

Property	ρ (kg m⁻³)	κ (W (m K)⁻¹)	Cp (J (kg K)⁻¹)	μ (kg (m s)⁻¹)
Fluid	1.0	1.0	50	0.02
Solid	1.0	5.0	80	—

The occurrence of fluid heat transfer is based on the premise of fluid motion. Thereby, without internal heat sources, the governing equation of fluid heat transfer is given by

(6.25) $rho Subscript fluid Baseline upper C Subscript normal p comma fluid Baseline bold u dot nabla upper T Subscript fluid Baseline minus kappa Subscript fluid Baseline nabla squared upper T Subscript fluid Baseline equals 0$

where T_fluid, C_p,fluid, and κ_fluid are the temperature, specific heat, and thermal conductivity of the fluid. The first and second terms act as the convection term and diffusion term, which describe the thermal convection and heat conduction of fluid, respectively.

While heat conduction plays a major role in solids, the temperature distribution can be directly determined by solving Laplace’s equation.

(6.26) $StartFraction kappa Subscript solid Baseline Over rho Subscript solid Baseline upper C Subscript normal p comma solid Baseline EndFraction nabla squared upper T Subscript solid Baseline equals 0$

where T_solid, C_p,solid, and κ_solid are the temperature, specific heat, and thermal conductivity of the solid, respectively.

An essential condition for solving the aforementioned conjugate heat transfer problem is to provide well-defined boundary and initial conditions as model constraints, especially at the fluid–solid interface. First, for simulating the fluid flow process, a non-slip velocity condition (u = 0) is applied to both the channel wall and the fluid–solid interface. The initial air velocity at the channel inlet is known (u_x = U_inlet, u_y = 0, u_z = 0), while the air pressure at the channel outlet is set to zero. For simulating the heat transfer process, the channel wall is modeled using an adiabatic condition (n·(κ_fluid·∇T_fluid) = 0) with a known temperature at the channel inlet (T_fluid = T_inlet). It is assumed that the hot heat source could create an even heat flow at the bottom of the flat base. With this assumption, a constant heat flux q_s can be specified as a thermal boundary constraint for the heat sink model, as given by

(6.27) $negative bold n dot left-parenthesis kappa Subscript solid Baseline dot nabla upper T Subscript s b Baseline right-parenthesis equals q Subscript normal s$

where ∇T_sb denotes the temperature gradient at the bottom of the flat base; n denotes the unit normal vector outward to the surface. Herein, to guarantee the continuity of heat flux and temperature at the fluid–solid interface, the following constraints are enforced on boundary conditions [60]:

(6.28) $upper T Subscript s w Baseline minus upper T Subscript f w Baseline equals 0$

(6.29) $bold n dot left-parenthesis kappa Subscript solid Baseline dot nabla upper T Subscript s w Baseline right-parenthesis minus bold n dot left-parenthesis kappa Subscript fluid Baseline dot nabla upper T Subscript f w Baseline right-parenthesis equals 0$

where the subscripts sw and fw denote the solid wall and fluid wall at the fluid–solid interface, respectively.

The heat sink system generally aims at cost-effectively cooling down the running heat source to prevent thermal shutdown by exchanging sensible heat with air flowing through the surface of this system. Based on this premise, the first objective of interest is termed the mean surface temperature of the heat source (T_mean), which is often used as a key indicator to measure the cooling capacity of a given heat sink system. In addition, decision-makers are concerned about the operating cost of achieving a higher cooling capacity. For example, using complex fins with a larger heat exchange area would help to reduce the working temperature of the heat source, but it also incurs an additional pressure drop (Δp) in the airflow. In this case, increased power consumption by the intake fan is required proportionally. These two conflicting objectives can be calculated through the solid heat transfer network and fluid flow network, combined with the Monte Carlo integral approximation, as given by.

(6.30) $StartLayout 1st Row 1st Column upper T Subscript mean 2nd Column equals StartFraction 1 Over upper A Subscript source Baseline EndFraction integral Underscript source Endscripts bold-italic mu Subscript heat bar n e t bar solid Baseline left-parenthesis bold x Subscript source Baseline comma bold-italic lamda semicolon bold-italic theta Superscript asterisk Baseline right-parenthesis normal d bold x 2nd Row 1st Column Blank 2nd Column almost-equals StartFraction 1 Over upper N Subscript source Baseline EndFraction sigma-summation Underscript k equals 1 Overscript upper N Subscript source Baseline Endscripts bold-italic mu Subscript heat bar n e t bar solid Baseline left-parenthesis bold x Subscript source Superscript k Baseline comma bold-italic lamda semicolon bold-italic theta Superscript asterisk Baseline right-parenthesis EndLayout$

(6.31) $StartLayout 1st Row 1st Column normal upper Delta p 2nd Column equals p Subscript in Baseline minus p Subscript out Baseline 2nd Row 1st Column Blank 2nd Column equals StartFraction 1 Over upper A Subscript in Baseline EndFraction integral Underscript in Endscripts bold-italic mu Subscript flow bar n e t Baseline left-parenthesis bold x Subscript in Baseline comma bold-italic lamda semicolon bold-italic theta Superscript asterisk Baseline right-parenthesis normal d bold x minus StartFraction 1 Over upper A Subscript out Baseline EndFraction integral Underscript out Endscripts bold-italic mu Subscript flow bar n e t Baseline left-parenthesis bold x Subscript out Baseline comma bold-italic lamda semicolon bold-italic theta Superscript asterisk Baseline right-parenthesis normal d bold x 3rd Row 1st Column Blank 2nd Column almost-equals StartFraction 1 Over upper N Subscript in Baseline EndFraction sigma-summation Underscript k equals 1 Overscript upper N Subscript in Baseline Endscripts bold-italic mu Subscript flow bar n e t Baseline left-parenthesis bold x Subscript in Superscript k Baseline comma bold-italic lamda semicolon bold-italic theta Superscript asterisk Baseline right-parenthesis minus StartFraction 1 Over upper N Subscript out Baseline EndFraction sigma-summation Underscript k equals 1 Overscript upper N Subscript out Baseline Endscripts bold-italic mu Subscript flow bar n e t Baseline left-parenthesis bold x Subscript out Superscript k Baseline comma bold-italic lamda semicolon bold-italic theta Superscript asterisk Baseline right-parenthesis EndLayout$

where A_out denotes the cross-sectional area at the outlet of the channel. bold x Subscript source Superscript k , bold x Subscript in Superscript k , and bold x Subscript out Superscript k are the coordinates of samples collected on the heat source surface, channel inlet, and outlet sections, respectively; N_source, N_in, and N_out are the corresponding numbers of these samples.

The peak surface temperature of the heat source (T_peak) must be lower than the maximum allowable junction temperature (T_max) to ensure the safety and reliability of the system [61].

(6.32) $upper T Subscript peak Baseline StartLayout 1st Row equals max bold-italic mu Subscript heat bar n e t bar solid Baseline left-parenthesis bold x Subscript source Baseline comma bold-italic lamda semicolon bold-italic theta Superscript asterisk Baseline right-parenthesis 2nd Row almost-equals max bold-italic mu Subscript heat bar n e t bar solid Baseline left-parenthesis bold x Subscript source Superscript k Baseline comma bold-italic lamda semicolon bold-italic theta Superscript asterisk Baseline right-parenthesis EndLayout$

A bi-objective optimization model is formulated to minimize the mean surface temperature and the pressure drop of the system, as given by:

(6.33)

where Eqs. (6.29)–(6.31) are embedded as a surrogate model to predict the temperature and pressure drop. The superscripts lb and ub are the lower and upper bounds for the decision variables of interest for the present study, as listed in Table 6.2. For simplicity, the height (H) and thickness (D) of all fins of the heat sink system are consistent, while the length of the central fin (L_ctrl) and the length of two side fins (L_sd) are independent of each other. As a result, there will be four design variables in total, λ⁽¹⁾ = [H, D, L_ctrl, L_sd]. Besides, only the inlet velocity of the coolant is considered the operating variable, λ⁽²⁾ = [U_inlet].

Table 6.2 Summary of key variables and constraints.

Variable/constraint		Unit	Lower bound	Upper bound
Operating variable, λ⁽²⁾
Inlet air velocity	U_inlet	m s⁻¹	0	3.0
Design variable, λ⁽¹⁾
Fin height	H	m	0	0.6
Fin thickness	D	m	0.05	0.15
Central fin length	L_ctrl	m	0.50	1.0
Side fin length	L_sd	m	0.50	1.0
Process constraint
Inlet air temperature	T_inlet	°C	20
Heat flux	q_s	W m⁻²	1800
Max allowable junction temperature	T_max	°C	90

6.3.2 Improved PINN Structure

In this example, we develop a specialized neural network structure, namely hybrid PINN, to decompose the standard PINN into multiple small-size sub-networks, which can distinguish the difference between the state variables of interest in the combined loss function. According to the characteristics of fluid flow and heat transfer in the conjugate process, the neural network of this system representation is decomposed into three interconnected sub-networks with identical architecture, namely flow_net, heat_net_fluid, and heat_net_solid, which are functionally designed in this hybrid PINN model to approximate the latent solutions of Navier–Stokes, heat transfer in fluid, and heat transfer in solid, respectively. In Figure 6.4, each sub-network applies the input encoding to the same spatial and parametric inputs [x, λ]. According to the relationship between the governing PDEs involved in these sub-networks, it is assumed that there is a one-way coupling between the fluid flow and heat transfer processes. Meanwhile, the process of heat transfer between fluid and solid is mutually independent and coupled in a parallel correlation. On this basis, it is possible to compute the temperature field once the training of the flow field has converged. To be more specific, once the training of flow_net is completed and can satisfy convergence criteria, the resulting field distribution of velocity u^* can be used as an intermediate for training heat_net_ fluid and heat_net_solid simultaneously. Besides, the continuity condition at the fluid–solid interface is encoded into the loss function as a boundary condition, which can lead to a significant speedup of the multiphysics learning task.

6.3.3 Results

The neural network training and CFD simulation are implemented on TensorFlow-based NVIDIA Modulus [46] and COMSOL Multiphysics@5.5 [62], respectively. During the neural network training, each sub-network of the hybrid PINN model that we have developed consists of six hidden layers, with 256 neurons in each layer. A minimum of 1 × 10⁶ training steps for the flow_net, and 1 × 10⁶ for both the heat_net_fluid and heat_net_solid are required to guarantee the convergence of the neural networks. The corresponding times for training are 38.3 and 75.6 hours. After training, the obtained networks are transferred to Matlab 2021b platform for implementing optimization and decision-making algorithms. These computational models were solved on a workstation with two Intel Xeon E5-2695v4 CPUs@ 2.1 GHz, 96GB RAM, and two NVIDIA GeForce RTX 3070 GPUs, using the Linux Ubuntu operating system.

A schematic diagram for a hybrid Physics-Informed Neural Network P I N N strategy to simulate conjugate heat transfer processes. The flow network takes input variables like the geometry and boundary conditions and outputs the velocity u and pressure p fields. The heat net fluid takes the velocity field u and other relevant parameters as input and outputs the temperature T subscript fluid. The heat net solid takes the boundary conditions and other relevant parameters as input and outputs the temperature T subscript solid. — **Figure 6.4** Schematic of the hybrid PINN strategy for simulating the conjugate heat transfer processes.

Source: [42]/with permission of Elsevier.

The performance of the PINN-derived surrogate model for predicting the behavior of the conjugate heat transfer process is evaluated by comparing the results obtained from CFD simulations. To facilitate the model validation, a benchmark design is used with representative conditions (H = 0.4 m, D = 0.1 m, L_ctrl = 1 m, L_sd = 1 m, and U_inlet = 1 m s⁻¹). To quantitatively measure the divergence of the field distributions, the normalized mean absolute percentage error (NMAPE [63]) is employed as an evaluation index

(6.34) $NMAPE equals 100 percent-sign times StartFraction 1 Over upper N Subscript i Baseline EndFraction sigma-summation Underscript 1 Overscript upper N Subscript i Baseline Endscripts StartFraction StartAbsoluteValue phi Subscript i comma upper C upper F upper D Baseline minus phi Subscript i comma PINN Baseline EndAbsoluteValue Over max left-parenthesis phi Subscript i comma upper C upper F upper D Baseline right-parenthesis minus min left-parenthesis phi Subscript i comma upper C upper F upper D Baseline right-parenthesis EndFraction$

where φ denotes the evaluated state variable, and the subscripts CFD and PINN denote the results predicted by CFD simulation and PINN-derived surrogate model.

The statistics of NMAPE for the distributed velocity, pressure, and temperature are listed in Table 6.3. The PINN-derived surrogate model can lead to acceptable prediction accuracy, which is 0.40–1.14% for velocity, 1.53% for pressure, and 0.38–4.17% for temperature. Besides, the thermal and hydraulic behaviors of the heat sink system in terms of the pressure drop, the surface temperature, and the peak temperature on the heat source are also used to validate the PINN-derived surrogate model. The PINN-derived results for these indicators are 9.178 Pa, 70.34 °C, and 79.38 °C, which are only 0.45%, 2.66%, and 1.64% diverged from the reference CFD results, respectively. From the comparison, it is concluded that the PINN-derived surrogate model can offer good quantitative agreement and sufficient confidence to describe the underlying behaviors of the conjugate heat transfer process inside heat sink system.

Based on the surrogate model, the design optimization of the heat sink system was performed using the NSGA-II algorithm, with parameters for the initial population and maximum evolution generation set at 40 and 30, respectively. Figure 6.5 presents the Pareto-optimal solutions of the system under fixed inlet air velocities, which consist of passive cooling mode (U_inlet = 0 m s⁻¹) and active cooling mode (U_inlet > 0 m s⁻¹). The corresponding optimal geometric dimensions and operating conditions are provided in Figure 6.6. Note that, the solutions on the Pareto-optimal curve are all non-dominated and feasible, indicating that the heat source surface temperature is minimized relative to the specified pressure drop limit. Specifically, these solutions are all optimal options with balanced performance between cooling capacity and energy cost for decision-making. Therefore, selecting the most preferred solution from these optimal options is critical for designing the heat sink system. Here, we consider three representative scenarios using the TOPSIS method: high-performance design, equilibrium design, and low-cost design. These scenarios correspond to weighting factors of 0.8 (0.2), 0.5 (0.5), and 0.2 (0.8) allocated to the objective of T_mean (Δp) in the process of decision-making.

Table 6.3 Error evaluation between PINN and CFD solutions.

NMAPE	u_x (m s⁻¹)	u_y (m s⁻¹)	u_z (m s⁻¹)
	0.88%	1.14%	0.40%
	p (Pa)	T_fluid (°C)	T_solid (°C)
	1.53%	4.17%	0.38%
	Δp (Pa)	T_mean (°C)	T_peak (°C)
CFD	9.137	68.51	78.10
PINN	9.178	70.34	79.38
Relative error	0.45%	2.66%	1.64%

A graph of temperature ranges from 60 to 95 versus pressure drop ranges from 0 to 63. It represents a plot graph with different shades labeled as u subscript inlet = 0 ms power -1 are passive cooling, 1 ms power -1, 2 ms power -1, 3 ms power -1 are active cooling. — **Figure 6.5** The Pareto front of multi-objective optimization under fixed inlet air velocities.

Source: [42]/with permission of Elsevier.

A set of six graphs depicts A. A graph of fin height ranges from 0 to 0.8 versus pareto solutions ranging from 0 to 40. It represents a line graph with different shades labeled as u subscript inlet = 0 ms power -1, 1 ms power -1, 2 ms power -1, 3 ms power -1. B. A graph of fin thickness ranges from 0 to 0.8 versus pareto solutions ranging from 0 to 40. It represents a line graph with different shades labeled as u subscript inlet = 0 ms power -1, 1 ms power -1, 2 ms power -1, 3 ms power -1. C. A graph of inlet air velocity ranges from 0 to 3.5 versus pareto solutions range from 0 to 40. It represents a plot line graph with different shades labeled as u subscript inlet = 0 ms power -1, 1 ms power -1, 2 ms power -1, 3 ms power -1. — **Figure 6.6** The optimal results of geometric dimensions and operating conditions under fixed inlet air velocities. (a) fin height, (b) fin trickiness, (c) central fin length, (d) side fin lengthen, (e) inlet air velocity, (f) wall area.

Though it requires 38.3 + 75.6 = 113.9 hours to train the networks, note that the time cost for the subsequent multi-objective optimization and decision-making can be almost negligible, as they only take less than 10 seconds. This brings a huge advantage over the traditional trial-and-error or empirical methods. For example, if we want to obtain similar optimal solutions (e.g. active cooling, U_inlet = variable), at least 6⁴ = 1296 (four variables, with each variable having six evenly distributed values in its range) CFD runs are required to perform simultaneously. For a single CFD run, it would take about 10 minutes considering the time for mesh regeneration, communication, and computation. The total computational time required by CFD simulation is up to 1296 × 10 minutes = 216 hours. Besides, the traditional methods require additional time to screen the desired solutions from these 1296 candidates by analyzing the CFD simulation results. In contrast, the proposed inverse design method starts directly with the desired thermal and hydraulic objectives and works backward to identify the optimal geometry and corresponding operating conditions. It should be pointed out that, a further increase in geometric variables or their values would amplify the advantages of the proposed method in terms of time cost and computational efficiency.

In the passive cooling mode, the heat generated by the heat source can only dissipate through thermal conduction since there is no forced airflow. In Figure 6.5, the mean temperature of the heat source surface exactly coincides with its peak temperature at T_peak = 91.53 °C and Δp = 8.90 Pa, which is beyond the maximum allowable junction temperature (T_max = 90 °C). As a result, the passive cooling mode would render the operation of the heat source inefficient and short-lived due to long-term overheating. Unlike the passive cooling mode, in the active cooling mode, the co-presence of thermal convection and conduction effectively reduces the thermal resistance between the fins and the surrounding airflow, which improves the cooling capability of the heat sink system. All generated Pareto-optimal solutions are distributed in the safe region, and even the peak temperatures are below 90 °C. The mean temperature of the heat source surface is lower than its peak temperature by 8–10 °C, while they exhibit similar trends of change. As the pressure drop increases from 0 to 27 Pa, both the mean temperature and the peak temperature have an obvious drop, particularly for high inlet velocity of air (e.g. U_inlet = 3 m s⁻¹). As the pressure drop exceeds 27 Pa, the temperature decline is getting very slow, and almost all the Pareto-optimal curves overlap, indicating that the change in air inlet velocity has a negligible impact on the thermal performance of the heat sink system.

From Figure 6.6a–d, it is clear that in the passive cooling mode, the geometric dimensions remain unchanged and reach the upper limit of the allowable range (H = 0.6 m, D = 0.15 m, L_ctrl = 1.0 m, L_sd = 1.0 m). The resulting wall area is up to 5.38 m² for maximizing heat removal. In the active cooling mode, it can be observed that, though the fin thickness remains almost unchanged at D = 0.15 m, the fin height exhibits a strong linear growth trend from 0 to 0.6 m in the Pareto solutions. This growth trend suggests that the evolutionary algorithm focuses on optimizing conflicting objectives primarily by adjusting the fin height of the heat sink system. Followed by the fin height, it appears that fin length has a secondary role in the multi-objective optimal design of the heat sink system; e.g. in general, the total fin length (L_tot = 2L_sd + L_ctrl) increases from 1.6 to 2.8 m along the Pareto-optimal curve. Due to the growth of fin height and length, as seen in Figure 6.6f, the wall area of the fins in the active mode also has a linear growth from 0 to 3.90 m².

Figure 6.7 visualizes the distributed flow and pressure for the representative scenarios at U_inlet = 2.0 m s⁻¹. Due to the different serrated shapes of the fins, it is evident that the flow field exhibits distinct flow patterns as it flows through the heat sink system. For example, the height and total length of the spaced fins have an obvious rise from 0.14 and 1.70 m for the low-cost design to 0.45 and 2.30 m for the high-performance design, respectively. The increase in fin size can reduce the flow gap between the heat sink system and the channel wall. Note that, as the air flows past the heat sink system, it must change direction and create different hydrodynamic zones in front of and behind the system. A stagnation zone is formed as air encounters the heat sink system, where zero velocity (Figure 6.7a–c) and maximum pressure (Figure 6.7d) are observable. Similarly, the blockage in the flow direction causes a decrease in air velocity and an increase in air static pressure, forming a wake zone behind the heat sink system. After that, the vortex flow is created by the separation and reattachment of airflow. In all, the high-performance design has significantly larger stagnation and wake zones than those in the low-cost design, with sizes ranging from 1.5 to 3.0 times larger.

A comparison of flow and pressure fields among three different scenarios high-performance, equilibrium, and low-cost at an inlet velocity of U subscript inlet = 2.0 ms power -1 in a cross-section. A. u subscript x represents the x-component of the velocity field. It includes high-performance, equilibrium, and low-cost with the color scale ranging from 0 to 5 m/s. b. u subscript y represents the y-component of the velocity field. It includes high-performance, equilibrium, and low-cost with the color scale ranging from -0.4 to 0.8 m/s. C. u subscript z represents the z-component of the velocity field. It includes high-performance, equilibrium, and low-cost with a color scale ranging from 0 to 35 Pa. — **Figure 6.7** Comparison of flow and pressure fields among the representative scenarios at U_inlet = 2.0 m s⁻¹, in the cross-section (z = −0.2 m) of the coordinate system.

A. T subscript fluid represents the y-component of the velocity field. It includes high-performance, equilibrium, and low-cost with the color scale ranging from 20 to 36 degrees Celsius. B. T subscript solid represents the z-component of the velocity field. It includes high-performance, equilibrium, and low-cost with the color scale ranging from 30 to 80 degrees Celsius. — **Figure 6.8** The optimal temperature field of the heat sink system at U_inlet = 2.0 m s⁻¹, in the cross-section (z = −0.2 m; x = −0.5 m) of the coordinate system. (a) fluid temperature, (b) solid temperature.

As mentioned above, increasing the size of the fins can enlarge the stagnation and wake zones, which increases thermal resistance as the air contacts the heat sink system. Despite this, the increased height and length of the fins can increase heat transfer areas, thereby facilitating the inlet air to remove more heat from the heat source. Furthermore, at the same inlet velocity, the increase in the fin size causes the majority of the inlet air to pass through the reduced gap between the heat sink system and the channel wall. As shown in Figure 6.7a, this in turn enhances the velocity component of u_x according to the Bernoulli equation. The combined increase in heat transfer areas and flow velocity contributes to an overall improvement in heat transfer efficiency. At U_inlet = 2.0 m s⁻¹, the total heat transfer areas and coefficient increase from 1.84 m² and 5.91 W m⁻² K⁻¹ for the low-cost design to 3.72 m² and 6.29 W m⁻² K⁻¹ for the high-performance design, respectively. Correspondingly, the mean temperatures of the heat source surface and fin wall declined from 70.68 to 66.33 °C, and 34.13 to 26.75 °C. These results are consistent with the change in temperature fields seen in Figure 6.8. On the other hand, note that the improvement in heat removed from the heat source comes at the cost of a greater pressure drop. The resulting pressure drop has increased by 2.1 times, from 12.19 Pa for the low-cost design to 25.16 Pa for the high-performance design. Relatively, the equilibrium design offers a more moderate option for the design of the heat sink system, where the mean temperature and pressure drop are 68.12 °C and 16.27 Pa, respectively.

6.4 Illustrative Example 2: Tubular Air Cooler Model

6.4.1 System Description and Objectives

In the second example, we utilize a tubular air cooling system to demonstrate the potential of PINN in conjunction with transfer learning. In Figure 6.9a, it is seen that the air cooler system consists of eight-row and eight-column plain-type tubes with a staggered configuration. Figure 6.9b presents the corresponding cross-section of these tube bundles, where the region enclosed by the red dashed line is defined as the computational domain due to the symmetry of the geometry. The tube bundles can be divided into multiple heat exchange units, and each unit is composed of four copper tubes that have the same centerline spacing (Figure 6.9c). To accurately characterize this specific topology, in Figure 6.9d, we define the key geometric variables by using three tube pitch parameters, namely transversal tube pitch (S_T), longitudinal tube pitch (S_L), and cross tube pitch (S_C). S_C is a newly introduced geometric parameter and is defined as a multiple of S_L, i.e., S_C = F × S_L, F ∈ [−1, 1], where F is a ratio between them. Given the same diameter of the tube (D), the random combination of these geometric variables results in countless tube bundle arrangements, which is our quantity of interest since they inherently determine the thermal and hydraulic behaviors of the system.

A. A diagram depicts a three-dimensional view of a heat exchanger with multiple tubes arranged in a specific pattern. B. A diagram illustrates the top of the heat exchanger, indicating the directions of the air inlet and outlet as well as the arrangement of the tubes. C. A diagram represents the symmetry in the design of the tube walls. D. A diagram represents different tube arrangements labeled as rotated square, regular triangle, square. — **Figure 6.9** Illustrations of the geometry of the air-cooled heat exchanger system. (a) Schematic view. (b) Cross-section of the tube bundle. (c) Computational domain with assigned boundary conditions. (d) Geometric variable definition and three representative topology designs that are termed rotated square, regular triangle, and standard square, here ϕ is the intersection angle between tube centers.

The uncertain variables related to the inlet air can complicate the operation and management of the heat exchanger system in practice. In Figure 6.9a, the ambient air flows upwards and traverses the outer wall of the tube bundles, while the hot fluid pumped from the main tube is evenly divided and enters the tube bundles. A forced convective heat transfer process occurs inside the heat exchanger, where the sensible heat of the hot fluid is indirectly absorbed by upward airflow outside the tube bundles. In this forced convective heat transfer process, the temperature drop of the hot fluid only relies on sensible heat exchange with inlet air to achieve the required cooling capacity. This means that the thermal properties of the inlet air can strongly impact the characteristics of fluid flow and heat transfer inside the system. Specifically, the dry-bulb temperature of inlet air that depends on fluctuating weather conditions is considered an uncertain variable, while the velocity of inlet air is the major controllable variable that could mitigate the impact of the uncertainty and improve the system’s behavior. In all, to obtain the optimal geometric and operating variables while fully accounting for the impacts of weather uncertainty, a stochastic optimization is conducted in this example. The ranges of variation for the aforesaid variables , , and are listed in Table 6.4.

equations — **Table 6.4** The input space of different variables.

Input variables	Symbol	Unit	Lower bound	Upper bound
Geometric variables
Transverse tube pitch	S_T	m	2D	4D
Longitudinal tube pitch	S_L	m	3D	5D
Cross tube pitch	S_C	m	−5D	5D
Tube diameter	D	m	0.01
Operating variables
Velocity of inlet air	U_in	m s⁻¹	1	2
Uncertain variables
Dry-bulb temperature of inlet air	T_in	K	259.15	306.15

As shown in Figure 6.10, the reverse design starts with a decomposition of the geometric space, generating a batch of discrete geometric designs through the use of the Halton sequence sampling method, ^(m)∈. That is, the entire design space is represented using M discrete geometric designs, and each design bold-script script g represents a combined set of geometric variables. Meanwhile, the uncertainty of weather conditions is realized by extracting N uncertain samples that can represent the actual environment, and these samples are fed into each optimization model. We employ a non-intrusive sampling approach called the stochastic reduced-order model (SROM, see details in Chapter 8) to generate a finite set of stochastic samples with varying probabilities, ⁽ⁿ⁾∈ bold-script , ⁽ⁿ⁾ = {ℴ⁽ⁿ⁾, ⁽ⁿ⁾}.

A set of three frameworks that combines sequential decomposition, Physics-Informed Neural Networks P I N Ns, and Transfer Learning T L for reverse design. The sequential decomposition step involves discretizing various variables geometric variables, operating variables, and uncertain variables. P I N N - T L involves leveraging knowledge from source P I N Ns to target P I N Ns. The stochastic optimization represents decision making, optimization objects, and surrogate models. — **Figure 6.10** The proposed framework combines sequential decomposition, PINN, and TL for reverse design. (a) sequential decomposition; (b) PINN-TL; (c) stochastic optimization.

In this example, if we construct a separate PINN model for each sample, there will be a total of M × N PINN models to be trained. For each geometric design, we randomly select a source model from N PINN models and then leverage it to train the rest P = N − 1 target models via transfer learning. The N PINN models obtained in total can be concatenated to construct the surrogate models that map all input and output variables of interest. Here, we encounter the following two conflicting objectives [64] that are often used in engineering design:

(6.35) $Pressure drop colon normal upper Delta p equals p overbar Subscript in Baseline minus p overbar Subscript out Baseline$

(6.36) $Nusselt number colon upper N u equals StartFraction h Subscript normal a Baseline upper D Over lamda EndFraction$

where normal upper Delta p equals p overbar Subscript in Baseline minus p overbar Subscript out is the pressure drop between the inlet and outlet of airflow in the computational domain, and h_a is the air-side convection heat transfer coefficient that can be calculated from:

(6.37) $h Subscript normal a Baseline equals StartFraction upper Q Subscript normal a Baseline Over upper A normal upper Delta upper T Subscript normal m Baseline EndFraction equals StartFraction ModifyingAbove m With dot Subscript normal a Baseline upper C Subscript p a Baseline left-parenthesis upper T overbar Subscript normal a comma out Baseline minus upper T overbar Subscript normal a comma in Baseline right-parenthesis Over upper A left-brace left-parenthesis upper T overbar Subscript normal a comma out Baseline minus upper T overbar Subscript normal a comma in Baseline right-parenthesis slash ln left-bracket left-parenthesis upper T Subscript wall Baseline minus upper T overbar Subscript normal a comma in Baseline right-parenthesis slash left-parenthesis upper T Subscript wall Baseline minus upper T overbar Subscript normal a comma out Baseline right-parenthesis right-bracket right-brace EndFraction$

where A and Q_a are the total heat transfer area and the air-side heat transfer rate, ΔT_m is the logarithmic mean temperature difference between the tube wall and the air, upper T overbar Subscript normal a comma in and are the mean temperatures of air at inlet and outlet, T_wall is the temperature of the tube wall.

We therefore consider two types of surrogate models regarding the bi-objectives of Nu and Δp of the system, which are embedded in the stochastic optimization models. These surrogate models are constructed by using the generalized polynomial chaos (gPC) expansion [43] based on the dataset obtained from the source and target PINN models. In this way, the original optimization model can be reformulated as a set of sampling-based stochastic nonlinear programming models. The goal is to obtain optimal solutions through the expected maximization of the distribution of the objectives, as given by:

(6.38)

$bold upper Psi element-of left-bracket bold upper Psi Superscript l b Baseline comma bold upper Psi Superscript u b Baseline right-bracket$

$for-all m element-of StartSet 1 comma ellipsis comma upper M EndSet comma for-all n element-of StartSet 1 comma ellipsis comma upper N EndSet$

where _n and _n are the realizations of operating and uncertain variables in a specific sample n; lb and ub are the corresponding lower and upper bounds for these variables; prob_n is the probability related to the occurrence of a specific sample n; 𝚿 is the design constraint that defines the outlet temperature of the hot fluid to meet the cooling requirement. f_m,k (·) is the surrogate model corresponding to the kth objective under mth geometric design, which is used as the objective function depending on the operating and uncertain variables.

The Pareto solutions { normal upper Delta p Subscript m comma n Superscript asterisk Baseline comma italic upper N u Subscript m comma n Superscript asterisk } are generated by repeatedly solving this optimization model for M × N times according to the numbers of uncertain samples and geometric designs. Similar to example 1, the NSGA-II [54] is also adopted to handle each optimization problem since it can efficiently search for the Pareto solutions. After that, the TOPSIS method is applied to identify the optimal operating conditions for each uncertain sample and determine the expected value of the optimal objective function StartSet double-struck upper E left-parenthesis normal upper Delta p overbar Subscript m Superscript asterisk Baseline right-parenthesis comma double-struck upper E left-parenthesis ModifyingAbove upper N u With bar Subscript m Superscript asterisk Baseline right-parenthesis EndSet for each geometric design. Finally, the most preferred solutions regarding the best-performing geometric design {} are obtained by re-applying with the TOPSIS method.

6.4.2 Improved PINN Structure

Herein, we propose a specialized segregated-network architecture to effectively decompose the standard PINN architecture into multiple small-size sub-networks. This segregation allows for a better distinction between the state variables of interest within the combined loss function. Figure 6.11 illustrates the segregated-network architecture for surrogate modeling of multi-physical fields in the heat exchanger system. As shown in this figure, the neural network of this system representation is decomposed into two serial sub-networks (namely the flow sub-network and the heat sub-network) that take the spatial coordinate [x] = [x, y] as inputs and are designed to successively approximate the latent solutions of Navier–Stokes and energy equations. According to the correlation between these two governing equations for incompressible flow, it is assumed that there is a one-way coupling between the fluid flow process and the heat transfer process. This assumption implies that once the fluid flow sub-network has been trained and can satisfy the convergence criteria, the resulting field distribution of velocity {u*, v*} could be used as an intermediate input for training the heat transfer sub-network. Meanwhile, we disable the gradient computation for velocity components, which effectively prevents the update of parameters in the flow sub-network. In this way, it is possible to substantially speed up the learning rate of multiphysics regarding field distributions of key state variables including velocity, pressure, and temperature, i.e. [x] → [u, v, p], [x] → [T].

A schematic diagram of the segregated-network Physics-Informed Neural Network P I N N architecture for surrogate modeling of fluid flow and heat transfer behavior in heat exchangers. It is divided into two categories as flow subnetwork and the heat subnetwork. — **Figure 6.11** A schematic diagram of the segregated-network PINN architecture for surrogate modeling of fluid flow and heat transfer behavior. For the governing equations of the heat exchanger system, the boundary conditions include differential, nonlinear, and identity terms, which could be subject to Dirichlet or Neumann conditions.

It is well known that the network architecture plays a crucial role in improving the prediction ability of PINN, particularly for multiphysics learning tasks. Here, we address the convective heat transfer modeling problem for the heat exchanger system by comparing three representative types of architecture, i.e. fully-connected network (FCN), Fourier network (FN), and modified Fourier network (MFN). The theories for the three network architectures are detailed in Ref. [43] In order to compute arbitrary-order derivatives, we need to use a smooth and differentiable activation function. Note that the commonly used activation functions, such as ReLU, may fail to satisfy the continuous second-order derivatives. To address this issue, the Swish function with continuous derivatives, σ = x × sigmoid(βx), with a fixed parameter β = 1, is employed in each layer except the last one [65]. To improve the convergence performance, the loss function of each PINN can take the form of a Monte Carlo integration approximation since it can maintain the consistency of the loss per area across the domain and ensure that the loss is minimal throughout the entire spatial domain.

6.4.3 Transfer Learning

A generic description of transfer learning is as follows [66]: Given a source domain _S = {_S, P(X_S)} and learning task _S = {_S, f_S (·)}, a target domain _T = {_T, P(X_T)} and learning task _T = {_T, f_T (·)}, it aims to improve the learning of the target predictive function f_T(·) using the knowledge in _S and _S, where _S ≠ _T or _S ≠ _T; and are the feature and label spaces; P(X) is a marginal probability distribution, where X ∈ . In this study, we perform design space exploration and operating parameter optimization tasks, which are based on constructing a surrogate model using the solutions obtained from a batch of PINN models. In theory, neural network solvers can transfer knowledge across these PINN models via transfer learning since both source PINN and target PINN originate from the decomposition of the same design space of the heat exchanger system. Here, a parameter-transfer learning approach that assumes _S and _T share partial parameters or prior distributions of the hyper-parameter of the source model is employed, where the transferred knowledge acquired by a source PINN for a specific geometry is encoded into the shared parameters or priors. By discovering the shared parameters or priors, the acquired knowledge can be applied to the construction of target PINN models that have similar geometry and slightly different operating and uncertain conditions.

A schematic of the parameter-transfer learning approach is shown in Figure 6.12. Firstly, the source PINN model is pre-trained with two serial sub-networks regarding fluid flow and heat transfer using the sample set, namely, bold-script Subscript normal upper S Baseline equals left-bracket bold-script Subscript normal upper S comma normal f Baseline comma bold-script Subscript normal upper S comma normal b Baseline right-bracket . The parameters of these two sub-networks are tuned by minimizing the following combined losses and . The trained optimal network parameter set bold-italic theta Subscript normal upper S Superscript asterisk Baseline equals left-bracket bold-italic theta Subscript normal upper S comma Flow Superscript asterisk Baseline comma bold-italic theta Subscript normal upper S comma Heat Superscript asterisk Baseline right-bracket is further used for the initialization of network parameters for each target PINN model. Based on the sample set bold-script Subscript normal upper T Baseline equals left-bracket bold-script Subscript normal upper T comma normal f Baseline comma bold-script Subscript normal upper T comma normal b Baseline right-bracket , the initialized target PINN model is then re-trained to cope with the changes in operating and uncertain conditions without having the NN model trained from scratch. In this stage, the parameters of two similar sub-networks in the target PINN models are fine-tuned by minimizing the combined losses, and , with a limited number of iterations, smaller learning rate, and a faster learning rate decay, as listed in Table 6.5. Iteratively, bold-italic theta Subscript normal upper S Superscript asterisk is updated and is obtained at the end of this stage.

A schematic representation of a proposed parameter-transfer learning framework between a source P I N N Physics-Informed Neural Network and a target P I N N model. Pre-train Source P I N N represents a source P I N N is trained on a dataset of relevant heat exchanger simulations. Parameter transfer represents the weights and biases of the source P I N N are transferred to the target P I N N. — **Figure 6.12** Schematic representation of the proposed parameter-transfer learning between source PINN and target PINN models.

Table 6.5 Hyperparameters setting of source and target PINN models.

Source PINN
Parameters	Flow	Heat	Target PINN
No. neurons	256	256	256
No. layers	6	6	6
Learn rate schedule	Exponential decay	Exponential decay	Exponential decay
Learning rate	5 × 10⁻⁴	5 × 10^-4	1 × 10⁻⁴
Decay step	15 000	8000	3000
No. iterations	1.5 × 10⁶	0.8 × 10⁶	0.3 × 10⁶
Activation function	Swish	Swish	Swish
Optimizer	Adam	Adam	Adam

Although the space decomposition and transfer learning make the multiphysics problem trainable, the final computational time may still far exceed the allowable specification for engineering design. In the PINN framework, Monte Carlo sampling of spatial coordinates is employed to generate residual points, which serve as the input dataset for the neural network. These residual points are similar to the meshes used in traditional numerical methods. Increasing the number of residual points can improve computational accuracy, but it also leads to longer training times and increased computational load. In transfer learning, the data size of the source domain is generally much larger than that of the target domain, |X_S| ≫ |X_T|. Therefore, it can be inferred that the number of residual points provided to the target model can be reduced to shorten the computational time. However, it is necessary to investigate the appropriate size of residual points |_T| for training the target PINN models. Herein, we propose a point density adjustment (PDA) strategy that can quickly identify an optimal ratio of sample size in the domain to that on the boundary by comparing different sampling schemes. For example, the size of residual points on the boundary remains constant (|_T,b| = |_S,b|), while the proportion of interior residual points drops gradually.

Note that, the presence of numerous pointwise operations in network learning could put huge pressure on the memory sub-system of a GPU. To streamline the training process, we further propose the use of a domain-specific compiler called accelerated linear algebra (XLA) [67, 68] in conjunction with transfer learning. XLA can enable kernel fusion and just-in-time compilation of TensorFlow graphs. With XLA, a batch of pointwise operations can be executed simultaneously in a single kernel, reducing the number of memory transfers from GPU memory to the computation units.

6.4.4 Results

The results of NMAPEs for the velocity, pressure, and temperature predicted by the PINN model with different network architectures are listed in Table 6.6. These results clearly show that both FN and MFN architectures outperform FCN architecture. For example, the y-velocity and temperature NMAPEs for the MFN architecture have a significant increase from 4.60% to 32.99% and 3.15% to 64.14% compared with the FCN case, respectively. This increase reflects that Fourier-derived architectures lead to significant improvement in dealing with the increased nonlinearity problems over the regular FCN architecture because they can effectively reduce the impact of spectral bias [69] and have the ability to capture steep gradient variation (i.e. near the tube wall). Compared with the FN architecture, the MFN architecture further improves the prediction accuracy, e.g. the NMAPEs of x-velocity, y-velocity, pressure, and temperature drop from 2.57% to 1.54%, 9.47% to 4.60%, 10.60% to 6.80%, and 6.22% to 3.15%, respectively. From these results, it is concluded that the PINN model with MFN architecture is a more accurate solution to describe the underlying behaviors of fluid flow and heat transfer inside the heat exchanger system under consideration.

The aforesaid best-performing PINN trained with MFN architecture is used as the source model for implementing parameter-transfer learning. Herein, for each geometric design, a total of 12 groups of stochastic samples from the combined operating and uncertain spaces are generated according to the requirement for constructing high-fidelity surrogate models. Based on these samples, we only randomly select a single one as the source PINN model, while the rest are the target PINN models. As previously mentioned, the target PINN models are fine-tuned using parameter-transfer learning, as opposed to training them from scratch. Taking the rotated square design, for example, the source PINN model is fully trained (full run) with a total of 2.3 × 10⁶ iterations (fluid flow 1.5 × 10⁶, heat transfer 0.8 × 10⁶), as shown in Figure 6.13. The training loss of target PINN models in transfer learning runs converged at a much faster speed of 3 × 10⁵ for both fluid flow and heat transfer sub-networks, which leads to a time reduction by a factor of 3.73 (the average total training time of a single target PINN model is 20.35 hours).

Table 6.6 NMAPEs for the trained PINN models using different network architectures.

Outputs	FCN	FN	MFN
u (m s⁻¹)	6.67%	2.57%	1.54%
v (m s⁻¹)	32.99%	9.47%	4.60%
p (Pa)	16.95%	10.60%	6.80%
T (K)	64.14%	6.22%	3.15%

A set of two graphs depicts A. A graph of momentum loss ranges from 10 power -6 to 10 power 0 versus a number of iterations ranging from 0 to 15. It represents a line graph with different shades labeled as T subscript 1 to T subscript 11. B. A graph of energy loss ranges from 10 power -5 to 10 power 5 versus a number of iterations ranging from 0 to 8. It represents a line graph with different shades labeled as T subscript 1 to T subscript 11, S. — **Figure 6.13** Training losses at different air velocities and temperatures with/without transfer learning. (a) Fluid flow and (b) heat transfer. The model training was performed using an HP Precision T7920 workstation, where the basic configuration is as follows: NVIDIA 3090 GPU, Intel(R) Xeon E5-6230 CPU@ 2.30 GHz, 128GB RAM, 4TB SSD.

Although transfer learning performs well in improving convergence rates, the total training time is still unacceptable for a complex system since the simultaneous training of hundreds of target PINN models is required in transfer learning. In this example, the decomposition strategy sequentially generates 30 geometric designs and 12 stochastic samples; the total training time for the decomposed PINN models is expected to be up to 75.82 hours × 30 × 12 = 27 295 hours (3.16 years). As shown in Figure 6.14b, by combining with parameter-transfer learning, the expected total training time can be preliminarily reduced by 3.12 times from 27 295 to 8 748 hours. On this basis, we can further cut down the abovementioned training time to 3133 hours by using simultaneous PDA and XLA approach in transfer learning, as shown in Figure 6.14c,d. Despite the significant time reduction, the total computational cost is still unbearable in practice. With the support of the NVIDIA TensorFlow container, parallel processing using multi-GPU configuration has been introduced to the training process of this example. In Figure 6.14e, taking the use of two NVIDIA 3090 GPUs as an example, the training time can be remarkably reduced by about 3.6 times from 3133 to 871 hours. Furthermore, it is noted that one great advantage of transfer learning is that the source and target models are independent of each other, and thus they allow for being deployed to parallelly run on a multi-workstation. In Figure 6.14f, when these target models are evenly assigned to four nodes, the resulting training time is proportionately reduced to 326 hours (13.6 days, without considering the overhead time between nodes).

Figure 6.15a presents the optimal solutions regarding the expected values of the Nu and Δp obtained from multi-objective stochastic optimization. By applying the TOPSIS method, the most preferred solution is bold-script script g ⁽²⁷⁾, where double-struck upper E left-parenthesis ModifyingAbove upper N u With bar Superscript asterisk Baseline right-parenthesis = 25.38 and = 2.81 Pa. For a better comparison, three representative topology designs (see Figure 6.9d) often used in actual engineering design and manufacturing are also evaluated, namely rotated square ( bold-script script g ^RS, S_T = 0.5S_L, S_C = 0.5S_L, ϕ = 45°), regular triangle (^RT, S_T = √3/2S_L, S_C = 0.5S_L, ϕ = 60°), and standard square (^SS, S_T = S_L, S_C = 0, ϕ = 90°). The expected Nusselt numbers for ^RS, ^RT, and ^SS are 26.92, 26.47, and 26.37, which is 6.07%, 4.29%, and 3.90% higher than that for ⁽²⁷⁾, respectively. Meanwhile, the expected pressure drop for ⁽²⁷⁾ has a dramatic rise from 2.81 to 5.61 Pa for bold-script script g ^RS, 4.20 Pa for ^RT, and 3.51 Pa for ^SS. Figure 6.13b shows the relative closeness of each design point with their corresponding values. Herein, a larger value of relative closeness means a closer distance to the ideal solution and a further distance away from the nadir solution. The relative closeness of the ⁽²⁷⁾ has the largest relative closeness (0.91) as compared to the other geometric designs; therefore it is recommended as the most preferred solution for the case study.

A. A graph of geometric design ranges from g power (1) to g power (30) versus a stochastic sample ranging from power (1) to z power (12). It represents a box plot. B and C. A graph of geometric design ranges from g power (1) to g power (30) versus a source and target models ranging from S to T subscript 11. It represents a box plot with different sizes and different shades. D and E. A graph of geometric design ranges from g power (1) to g power (30) versus a source and target models ranging from S to T subscript 11. It represents a box plot with different sizes and different shades. — **Figure 6.14** Review of training time for PINN models of the heat exchanger system with (a) standard PINN with space decomposition, (b) PINN-TL without acceleration, (c) PINN-TL with PDA, (d) PINN-TL with PDA and XLA, (e) PINN-TL with PDA and XLA on multi-GPU configuration, (f) PINN-TL with PDA and XLA on multi-GPU and multi-workstation configuration, and (g) the corresponding total training time for the aforesaid methods. Note that, the area of each square in (a) corresponds to a training time of 75.82 hours, while the training times for the squares in (a–f) are proportional to their areas.

A. A graph of the E (Nu subscript m power *) bar ranges from 22 to 30 versus an E (delta p subscript m power *) bar ranging from 2 to 6. It represents a box plot labeled g (27), g power SS, g power RT, g power RS. B. A bar graph of relative closeness ranges from 0 to 1 versus a geometric design from 1 to 33. It represents the bar graph with different values. — **Figure 6.15** (a) The expected values of pressure drop and Nusselt number, (b) the relative closeness from TOPSIS analysis for each geometric design.

Figure 6.16 depicts the detailed geometry for ⁽²⁷⁾, bold-script script g ^RS, ^RT, and ^SS with respect to the optimal tube arrangement. As depicted, the value of transversal tube pitch S_T in ⁽²⁷⁾ is S_T = 0.0369 m, which is greater than that in ^RS (S_T = 0.02 m), ^RT (S_T = 0.026 m), and ^SS (S_T = 0.03 m). Notably, the ⁽²⁷⁾ has nearly the same longitudinal tube pitch S_L = 0.0302 m as compared to bold-script script g ^RT (S_L = 0.03 m) and ^SS (S_L = 0.03 m). Among them, the ^RS has the longest longitudinal tube pitch S_L = 0.04 m. The different included angles between tubes ϕ can be formed by changing the cross tube pitch S_C. To be more specific, the value of S_C is S_C = −0.0056 m in ⁽²⁷⁾, and an included angle of ϕ = 98.6° is formed, which is greater than that in bold-script script g ^RS (S_C = 0.02 m, ϕ = 45°), ^RT (S_C = 0.015 m, ϕ = 60°) and ^SS (S_C = 0 m, ϕ = 90°).

In Figure 6.17a, we present the probability of occurrence for all 50 uncertain samples used in the stochastic optimization. As shown, they have a scattered and wide distribution that ranges from 0.0012 to 0.0627. According to these samples, Figure 6.17b presents the optimal value of the expected inlet air velocity. The velocity of inlet air under each geometric design shows an almost rising trend with the increase in the dry-bulb temperature of inlet air. This phenomenon is due to the decrease in heat transfer driving force when the air dry-bulb temperature is increased, resulting in the air velocity having to increase to meet the cooling target of the system. Besides, ⁽²⁷⁾, bold-script script g ^RS, ^RT, and ^SS have nearly the same optimal solutions, and the corresponding expected values are 1.67, 1.64, 1.66, and 1.65 m s⁻¹, respectively. It is seen from Figure 6.15c that, the optimal pressure drops for all samples in ^RS, ^RT, and ^SS increase from 3.43 to 7.30, 2.49 to 5.34, and 2.07 to 4.49 Pa, respectively, which are higher than the case of bold-script script g ⁽²⁷⁾ that grows from 1.79 to 3.52 Pa. This is mainly attributed to the fact that ⁽²⁷⁾ has a wider flow channel, leading to a smaller pressure drop. In Figure 6.15d, the optimal Nusselt numbers for all samples in the ⁽²⁷⁾ increase from 20.01 to 30.03, which are slightly lower than that in the bold-script script g ^RS (21.55–30.05), ^RT (20.82–30.58), and ^SS (20.61–30.44). This is because a larger S_T leads to larger fully developed regions with a smaller Nusselt number. It should be noted that the effect of S_T on the Nusselt number is less than that on pressure drop.

A set of four different configurations of a tubular air cooler model, each with unique tube arrangements and spacing parameters. It provides the values of the geometric parameters for each configuration, allowing for a quantitative comparison. A. g power (27) configuration has a large fin spacing and a small fin thickness. B. g power RS configuration has a slightly smaller fin spacing and a larger fin thickness compared to A. C. g power RT configuration has a further reduced fin spacing and a larger fin thickness compared to B. — **Figure 6.16** Optimal geometric details of (a) the most preferred solution (⁽²⁷⁾), (b) rotated square (^RS), (c) regular triangle (^RT), and (d) standard square (^SS).

bold-script script g — **Figure 6.16** Optimal geometric details of (a) the most preferred solution (⁽²⁷⁾), (b) rotated square (^RS), (c) regular triangle (^RT), and (d) standard square (^SS).

A set of four graphs depicts A. A graph of probability ranges from 0 to 0.08 versus sample ranges from 0 to 50. It represents a scatter plot graph. B. A graph of U subscript in ranges from 1 to 2 versus sample ranges from 0 to 50. It represents a plot graph with different shapes and shades labeled as g power 27, g power RT, g power RS, g power SS. C. A graph of delts p ranges from 0 to 8 versus sample ranges from 0 to 50. It represents a plot graph with different shapes and shades labeled as g power 27, g power RT, g power RS, g power SS. — **Figure 6.17** The optimal results for all uncertain samples in ⁽²⁷⁾, ^RS, ^RT, and ^SS. (a) The probability of occurrence, (b) pressure drop, (c) Nusselt number, and (d) inlet air velocity. All samples are rearranged in ascending order of the dry-bulb temperature of inlet air.

6.5 Conclusion

Geometry optimization is essential for heat exchanger design, but it is often limited by the high computational time and costs associated with traditional trial-and-error or empirical methods. By utilizing PINN to parameterize the geometric and operating inputs of a heat sink system, this study proposed a new inverse design method that started with the desired objectives and worked backward to find the optimal designs. For simulating heat and mass transfer processes, specialized neural network structures have been developed. It efficiently decomposed the standard PINN model into multiple interconnected subnetworks with identical architecture, which were functionally designed to approximate the latent solutions of governing PDEs, i.e. Navier–Stokes, heat transfer in fluid, and heat transfer in solid, respectively. Based on this PINN model, a parametric surrogate model was developed according to the specific input and targets, which can be further coupled with multi-objective optimization and decision-making algorithms. The performance is verified by discovering the best-performing geometric design and the corresponding optimal operating conditions in the following illustrative examples.

A 3D finned heat sink system was used to demonstrate the proposed method’s ability to provide good quantitative agreement and sufficient confidence in describing the underlying behaviors of fluid flow and heat transfer inside the heat sink system. It not only significantly accelerated the design optimization process from more than 216 hours for the traditional methods to about 113.9 hours for the proposed method. For better physical inspection, three representative scenarios, namely high-performance design, equilibrium design, and low-cost design, were further considered according to the weighting factors allocated to each objective in decision-making.
We leveraged the capabilities of physics-informed deep learning and transfer learning to develop a novel computational framework that accelerates the design of a heat exchanger system. We developed a sequential space decomposition strategy that generated a batch of discrete geometric designs, followed by finite stochastic samples from the combined space of operating and uncertain variables. The total training time can be sharply reduced from 27 295 to 326 hours, indicating that the developed method is effective in promoting the application of the deep learning method in practical engineering design and optimization.

In the foreseeable future, the training costs for such PINN models can be further reduced to a more advantageous level, e.g. a few days or even hours, with the advances in computational resources and acceleration algorithms or strategies. Some potential avenues for achieving this include: (i) developing new differential and optimization algorithms specifically designed for the loss functions, such as coupled automatic and numerical differential methods [70, 71]; (ii) exploring more efficient training techniques, like reinforcement learning, meta-learning, and multi-task learning; and (iii) devising next-generation computational paradigms by synergistically combining the strengths of classical numerical methods and PINN techniques, such as the random feature method [72].

References

1 Wu, Z., Lu, Z., Zhang, B. et al. (2022). Stochastic bi-objective optimization for closed wet cooling tower systems based on a simplified analytical model. Energy 250: 123703.
2 Khattak, Z. and Ali, H.M. (2019). Air cooled heat sink geometries subjected to forced flow: a critical review. International Journal of Heat and Mass Transfer 130: 141–161.
3 Alihosseini, Y., Zabetian Targhi, M., Heyhat, M.M., and Ghorbani, N. (2020). Effect of a micro heat sink geometric design on thermo-hydraulic performance: a review. Applied Thermal Engineering 170: 114974.
4 Zobeiry, N. and Humfeld, K.D. (2021). A physics-informed machine learning approach for solving heat transfer equation in advanced manufacturing and engineering applications. Engineering Applications of Artificial Intelligence 101: 104232.
5 Xie, G., Wang, Q., and Sunden, B. (2010). Application of a genetic algorithm for thermal design of fin-and-tube heat exchangers. Heat Transfer Engineering 29 (7): 597–607.
6 Cho, D.-H., Seo, S.-K., Lee, C.-J., and Lim, Y. (2017). Optimization of layer patterning on a plate fin heat exchanger considering abnormal operating conditions. Applied Thermal Engineering 127: 1036–1048.
7 Chu, W.-X., Tsai, M.-K., Jan, S.-Y. et al. (2020). CFD analysis and experimental verification on a new type of air-cooled heat sink for reducing maximum junction temperature. International Journal of Heat and Mass Transfer 148: 119094.
8 Wang, C.-C., Hung, C.-I., and Chen, W.-H. (2012). Design of heat sink for improving the performance of thermoelectric generator using two-stage optimization. Energy 39 (1): 236–245.
9 Nguyen, N.P., Maghsoudi, E., Roberts, S.N., and Kwon, B. (2023). Shape optimization of pin fin array in a cooling channel using genetic algorithm and machine learning. International Journal of Heat and Mass Transfer 202: 123769.
10 Karniadakis, G.E., Kevrekidis, I.G., Lu, L. et al. (2021). Physics-informed machine learning. Nature Reviews Physics 3 (6): 422–440.
11 Lu, L., Pestourie, R., Yao, W. et al. (2021). Physics-informed neural networks with hard constraints for inverse design. SIAM Journal on Scientific Computing 43 (6): B1105–B1132.
12 Raissi, M., Perdikaris, P., and Karniadakis, G.E. (2019). Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational Physics 378: 686–707.
13 Xie, X., Liu, H., He, C. et al. (2019). Deciphering the heat and mass transfer behaviors of staggered tube bundles in a closed wet cooling tower using a 3-D VOF model. Applied Thermal Engineering 161: 114202.
14 Biegler, L.T., Lang, Y.-D., and Lin, W. (2014). Multi-scale optimization for process systems engineering. Computers and Chemical Engineering 60: 17–30.
15 Lang, Y., Zitney, S.E., and Biegler, L.T. (2011). Optimization of IGCC processes with reduced order CFD models. Computers and Chemical Engineering 35 (9): 1705–1717.
16 Lemos, J.C., Costa, A.L.H., and Bagajewicz, M.J. (2018). Globally optimal linear approach to the design of heat exchangers using threshold fouling modeling. AIChE Journal 64 (6): 2089–2102.
17 Souza, P.A., Costa, A.L.H., and Bagajewicz, M.J. (2018). Globally optimal linear approach for the design of process equipment: the case of air coolers. AIChE Journal 64 (3): 886–903.
18 Doodman, A.R., Fesanghary, M., and Hosseini, R. (2009). A robust stochastic approach for design optimization of air cooled heat exchangers. Applied Energy 86 (7): 1240–1245.
19 Raissi, M., Yazdani, A., and Karniadakis, G.E. (2020). Hidden fluid mechanics: learning velocity and pressure fields from flow visualizations. Science 367 (6481): 1026–1030.
20 Cai, S., Mao, Z., Wang, Z. et al. (2021). Physics-informed neural networks (PINNs) for fluid mechanics: a review. Acta Mechanica Sinica 37 (12): 1727–1738.
21 Kharazmi, E., Cai, M., Zheng, X. et al. (2021). Identifiability and predictability of integer- and fractional-order epidemiological models using physics-informed neural networks. Nature Computational Science 1 (11): 744–753.
22 Yuan, L., Ni, Y.-Q., Deng, X.-Y., and Hao, S. (2022). A-PINN: auxiliary physics informed neural networks for forward and inverse problems of nonlinear integro-differential equations. Journal of Computational Physics 462: 111260.
23 Pang, G., Lu, L., and Karniadakis, G.E. (2019). fPINNs: fractional physics-informed neural networks. SIAM Journal on Scientific Computing 41 (4): A2603–A2626.
24 Yang, L., Zhang, D., and Karniadakis, G.E. (2020). Physics-informed generative adversarial networks for stochastic differential equations. SIAM Journal on Scientific Computing 42 (1): A292–A317.
25 Zhang, D., Lu, L., Guo, L., and Karniadakis, G.E. (2019). Quantifying total uncertainty in physics-informed neural networks for solving forward and inverse stochastic problems. Journal of Computational Physics 397: 108850.
26 Baydin, A.G., Pearlmutter, B.A., Radul, A.A., and Siskind, J.M. (2015). Automatic differentiation in machine learning: a survey. The Journal of Machine Learning Research, 18(1): 5595–5637.
27 Cai, S., Mao, Z., Wang, Z. et al. (2022). Physics-informed neural networks (PINNs) for fluid mechanics: a review. Acta Mechanica Sinica 37: 1–12.
28 He, Q., Barajas-Solano, D., Tartakovsky, G., and Tartakovsky, A.M. (2020). Physics-informed neural networks for multiphysics data assimilation with application to subsurface transport. Advances in Water Resources 141: 103610.
29 Cao, Y., Xu, R., and Jiang, P. (2023). Physics-informed machine learning based RANS turbulence modeling convection heat transfer of supercritical pressure fluid. International Journal of Heat and Mass Transfer 201: 123622.
30 Kissas, G., Yang, Y., Hwuang, E. et al. (2020). Machine learning in cardiovascular flows modeling: predicting arterial blood pressure from non-invasive 4D flow MRI data using physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering 358: 112623.
31 Yin, M., Zheng, X., Humphrey, J.D., and Em, K.G. (2021). Non-invasive inference of thrombus material properties with physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering 375: 113603.
32 Arzani, A., Wang, J.-X., and D’Souza, R.M. (2021). Uncovering near-wall blood flow from sparse data with physics-informed neural networks. Physics of Fluids 33 (7): 071905.
33 Kim, K.M., Hurley, P., and Duarte, J.P. (2022). Physics-informed machine learning-aided framework for prediction of minimum film boiling temperature. International Journal of Heat and Mass Transfer 191: 122839.
34 Lu, Z., Li, Y., He, C. et al. (2022). Integrating physics-informed neural networks with partitioned coupling strategy for modeling conjugate heat transfer. CIESC Journal 73 (12): 5483–5493.
35 Lu, Z., Qu, J., Liu, H. et al. (2021). Surrogate modeling for physical fields of heat transfer processes based on physics-informed neural network. CIESC Journal 72 (3): 1496–1503.
36 Xie, J., Chai, Z., Xu, L. et al. (2022). 3D temperature field prediction in direct energy deposition of metals using physics informed neural network. The International Journal of Advanced Manufacturing Technology 119 (5–6): 3449–3468.
37 Raabe, D., Mianroodi, J.R., and Neugebauer, J. (2023). Accelerating the design of compositionally complex materials via physics-informed artificial intelligence. Nature Computational Science 3 (3): 198–209.
38 Zhu, Q., Liu, Z., and Yan, J. (2021). Machine learning for metal additive manufacturing: predicting temperature and melt pool fluid dynamics using physics-informed neural networks. Computational Mechanics 67 (2): 619–635.
39 Chamoli, S., Lu, R., Chen, H. et al. (2019). Numerical optimization of design parameters for a modified double-layer microchannel heat sink. International Journal of Heat and Mass Transfer 138: 373–389.
40 Kumar, S., Sarkar, M., Singh, P.K., and Lee, P.S. (2019). Study of thermal and hydraulic performance of air cooled minichannel heatsink with novel geometries. International Communications in Heat and Mass Transfer 103: 31–42.
41 Rao, R.V. and Waghmare, G.G. (2015). Multi-objective design optimization of a plate-fin heat sink using a teaching-learning-based optimization algorithm. Applied Thermal Engineering 76: 521–529.
42 Lu, Z., Li, Y., He, C. et al. (2024). Multi-objective inverse design of finned heat sink system with physics-informed neural networks. Computers and Chemical Engineering 180: 108500.
43 Wu, Z., Zhang, B., Yu, H. et al. (2023). Accelerating heat exchanger design by combining physics-informed deep learning and transfer learning. Chemical Engineering Science 282: 119285.
44 Zhu, Y., Zabaras, N., Koutsourelakis, P.-S., and Perdikaris, P. (2019). Physics-constrained deep learning for high-dimensional surrogate modeling and uncertainty quantification without labeled data. Journal of Computational Physics 394: 56–81.
45 Lu, L., Meng, X., Mao, Z., and Karniadakis, G.E. (2021). DeepXDE: a deep learning library for solving differential equations. SIAM Review 63 (1): 208–228.
46 NVIDIA Corporation. (2021). Modulus user guide (release v21.06). https://developer.nvidia.com/modulus (accessed 03 April 2023).
47 Sukumar, N. and Srivastava, A. (2022). Exact imposition of boundary conditions with distance functions in physics-informed deep neural networks. Computer Methods in Applied Mechanics and Engineering 389: 114333.
48 Zhao, Z., Zhu, Q., and Yan, J. (2021). A thermal multi-phase flow model for directed energy deposition processes via a moving signed distance function. Computer Methods in Applied Mechanics and Engineering 373: 113518.
49 Hennigh, O., Narasimhan, S., Nabian, M.A. et al. (2021). NVIDIA SimNet™: an AI-accelerated multi-physics simulation framework. International Conference on Computational Science (16–18 June 2021). pp. 447–461. Krakow, Poland: Springer.
50 Laszczyk, M. and Myszkowski, P.B. (2019). Improved selection in evolutionary multi–objective optimization of multi–skill resource–constrained project scheduling problem. Information Sciences 481: 412–431.
51 Gutjahr, W.J. and Pichler, A. (2016). Stochastic multi-objective optimization: a survey on non-scalarizing methods. Annals of Operations Research 236 (2): 475–499.
52 Bonnel, H. and Collonge, J. (2014). Stochastic optimization over a Pareto set associated with a stochastic multi-objective optimization problem. Journal of Optimization Theory and Applications 162 (2): 405–427.
53 Giagkiozis, I. and Fleming, P.J. (2015). Methods for multi-objective optimization: an analysis. Information Sciences 293: 338–350.
54 Deb, K., Pratap, A., Agarwal, S., and Meyarivan, T. (2002). A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Transactions on Evolutionary Computation 6 (2): 182–197.
55 Lai, Y.J., Liu, T.Y., and Hwang, C.L. (1994). TOPSIS for MODM. European Journal of Operational Research 76 (3): 486–500.
56 Wang, T.H., Wu, H.C., Meng, J.H., and Yan, W.M. (2020). Optimization of a double-layered microchannel heat sink with semi-porous-ribs by multi-objective genetic algorithm. International Journal of Heat and Mass Transfer 149: 119217.
57 Electronic chip cooling, COMSOL multiphysics application library, application ID: 47721, created in COMSOL multiphysics 5.5. https://cn.comsol.com/model/electronic-chip-cooling-47721 (accessed 16 February 2022).
58 Xiao, H., Liu, Z., and Liu, W. (2021). Conjugate heat transfer enhancement in the mini-channel heat sink by realizing the optimized flow pattern. Applied Thermal Engineering 182: 116131.
59 Yang, X.-H., Tan, S.-C., Ding, Y.-J., and Liu, J. (2017). Flow and thermal modeling and optimization of micro/mini-channel heat sink. Applied Thermal Engineering 117: 289–296.
60 Ermagan, H. and Rafee, R. (2018). Geometric optimization of an enhanced microchannel heat sink with superhydrophobic walls. Applied Thermal Engineering 130: 384–394.
61 Li, J., Zhou, G., Tian, T., and Li, X. (2021). A new cooling strategy for edge computing servers using compact looped heat pipe. Applied Thermal Engineering 187: 116599.
62 Electronic chip cooling, application ID: 47721, created in COMSOL multiphysics 5.5. https://cn.comsol.com/model/electronic-chip-cooling-47721 (accessed 03 April 2023).
63 Laubscher, R. (2021). Simulation of multi-species flow and heat transfer using physics-informed neural networks. Physics of Fluids 33 (8): 087101.
64 Kong, Y., Yang, L., Du, X., and Yang, Y. (2016). Air-side flow and heat transfer characteristics of flat and slotted finned tube bundles with various tube pitches. International Journal of Heat and Mass Transfer 99: 357–371.
65 Ramachandran, P., Zoph, B., and Le, Q.V. (2017). Searching for activation functions. arXiv preprint arXiv:1710.05941.
66 Pan, S.J. and Yang, Q. (2009). A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering 22 (10): 1345–1359.
67 Li, M., Liu, Y., Liu, X. et al. (2020). The deep learning compiler: a comprehensive survey. IEEE Transactions on Parallel and Distributed Systems 32 (3): 708–727.
68 Leary, C. and Wang, T. (2017). XLA: TensorFlow, compiled. TensorFlow Dev Summit 2 (3).
69 Tancik, M., Srinivasan, P., Mildenhall, B. et al. (2020). Fourier features let networks learn high frequency functions in low dimensional domains. Advances in Neural Information Processing Systems 33: 7537–7547.
70 Chiu, P.-H., Wong, J.C., Ooi, C. et al. (2022). CAN-PINN: a fast physics-informed neural network based on coupled-automatic–numerical differentiation method. Computer Methods in Applied Mechanics and Engineering 395: 114909.
71 Wu, Z., Wang, H., He, C. et al. (2023). The application of physics-informed machine learning in multiphysics modeling in chemical engineering. Industrial and Engineering Chemistry Research 62 (44): 18178–18204.
72 Chen, J., Chi, X., Weinan, E., and Yang, Z. (2022). Bridging traditional and machine learning-based algorithms for solving PDEs: the random feature method. Journal of Machine Learning 1 (3): 268–298.

Tags: Applied AI Techniques in the Process Industry From Molecular Design to Process Design and Optimization

May 11, 2025 | Posted by admin in General Engineer | Comments Off

Chemistry Engineer Key

Fastest Chemistry Engineer Engine

Reverse Design of Heat Exchange Systems Using Physics-Informed Machine Learning

6
Reverse Design of Heat Exchange Systems Using Physics-Informed Machine Learning

6.1 Introduction

6.2 PINN-Based Inverse Design Method

6.2.1 Overview of Inverse Design

6.2.1.1 Standard Physics-Informed Neural Networks

6.2.1.2 Design Optimization and Decision-making Methods

6.3 Example 1: Finned Heat Sink Model

6.3.1 System Description and Objectives

6.3.2 Improved PINN Structure

6.3.3 Results

6.4 Illustrative Example 2: Tubular Air Cooler Model

6.4.1 System Description and Objectives

6.4.2 Improved PINN Structure

6.4.3 Transfer Learning

6.4.4 Results

6.5 Conclusion

References

Related

Chemistry Engineer Key

Fastest Chemistry Engineer Engine

Reverse Design of Heat Exchange Systems Using Physics-Informed Machine Learning

6.1 Introduction

6.2 PINN-Based Inverse Design Method

6.2.1 Overview of Inverse Design

6.2.1.1 Standard Physics-Informed Neural Networks

6.2.1.2 Design Optimization and Decision-making Methods

6.3 Example 1: Finned Heat Sink Model

6.3.1 System Description and Objectives

6.3.2 Improved PINN Structure

6.3.3 Results

6.4 Illustrative Example 2: Tubular Air Cooler Model

6.4.1 System Description and Objectives

6.4.2 Improved PINN Structure

6.4.3 Transfer Learning

6.4.4 Results

6.5 Conclusion

References

Share this:

Related

Related posts: