Nomenclature
- DNN
-
Deep Neural Network
- ROM
-
Reduced Order Model
- ReLU
-
Rectified Linear Unit
- Adam
-
Adaptive Moment Estimation
- MAE
-
Mean Absolute Error
- MSE
-
Mean Squared Error
- RMSE
-
Root Mean Squared Error
1.0 Introduction
Vector nozzle technology can improve the short takeoff and landing capability [Reference Burcham, Ray, Conners and Walsh1], and promote the manoeuvering agility of fighter aircraft [Reference Wallace and Bowers2]. The axisymmetric vector nozzle is one of the main types of vector nozzles [Reference Nguyen and Gilert4]. It consists of complex multistage connecting rods with flexible components and a large number of joint clearances [Reference Changpeng, Yerong, Zheng and Zhang31]. These factors can lead to changes in the kinematic characteristics and driving force of the nozzle mechanism, affecting the stability of its operation [Reference Miao, Li, Liu, He and Zhu9], and pose significant challenges for the rapid modeling and evaluation of nozzle mechanism performance [Reference Xiulong, Yonghao, Yu and Qing10].
Various external factors such as machining, assembly errors and wear errors produce different sizes of joint clearance, which will significantly affect the motion accuracy and dynamic response of the axisymmetric vector nozzle [Reference Du and Li32]. By establishing a deep neural network (DNN) model of the axisymmetric vector nozzle, the effect of joint clearance parameters on the kinematic and dynamic characteristics of the mechanism can be efficiently investigated [Reference Zheng, Zhao, Zhang and Wang33]. The dynamic response of the nozzle mechanism is sensitive to the variation of the joint clearance parameters. Therefore, it is necessary to study the effect of joint clearance on the dynamic response for axisymmetric vector nozzle mechanisms with high accuracy requirements.
Machine learning and agent models are used to study the dynamic problems of vehicles, robots and railroads or to identify nonlinear control parameters [Reference Li, Wu, Tedrake, Tenenbaum and Torralba11], but very little research has been done in the field of aero-engine adjusting mechanisms. Kraft et al. used a black box modeling approach to simulate multibody dynamics of vehicle system, estimate the track geometry and analyse vehicle response to measured trajectories [Reference Kraft, Causse and Martinez16]. Ansari et al. established a neural network-based nonlinear tire model to simulate the dynamic response of a real tire, and speed up the calculation of the dynamic response of the vehicle model [Reference Ansari, Tupy, Datar and Negrut14]. Yu et al. proposed a DNN-based simulation model for rail vehicles to predict the acceleration response of axle boxes corresponding to different dimensions and vehicle speeds [Reference Yunguang, Shi, Krause and Hecht25]. Kurvinen et al. used a data-driven modeling approach to accelerate the design process and provide efficient models to address control challenges. Neural networks are able to solve multibody dynamics problems and predict the response quickly and reliably [Reference Choi, An, Han, Kim, Jung and Choi20, Reference Bernal22], provide strong means to identify nonlinear parameters in the control equations or estimating multibody dynamics in a probabilistic sense [Reference Jian, Xiaochuan, Jiayin, Jingmang and Ping17, Reference Byravan and Fox19]. Go et al. proposed that a date driven based reduced order model (ROM) can effectively improve the simulation efficiency of multibody dynamics [Reference Go, Han, Lim and Kim12]. Ting et al. attempted to apply Bayesian networks in multibody dynamics simulations, which were able to automatically detect relevant features of the model and reduce the effect of noise on the model [Reference Ting, Mistry, Peters, Schaal and Nakanishi23, Reference Kim and Joo24]. The above research papers show that the application of machine learning in multibody dynamics systems achieves excellent results. DNN is chosen in this paper because it offers significant advantages over other surrogate models like kriging [Reference Koziel, Çalık, Mahouti and Belen3]. For example, DNN has strong expressive power and flexibility, can handle complex nonlinear relationships, and automatically extract features, making them suitable for high-dimensional and large-scale problems [Reference Kirkwood, Economou, Pugeault and Odbert6]. The accuracy and effectiveness of DNN in the multibody dynamics field have been validated, and the axisymmetric vector nozzle is a typical multibody dynamics system, therefore DNN is selected as the surrogate model for analysing axisymmetric vector nozzles.
The axisymmetric vector nozzle mechanism is a typical multibody dynamics system [Reference Counts8], and considering the cited literature on the application of machine learning in multibody dynamics systems, the data-driven approach based on DNN is expected to build highly accurate and low-cost agent models for the axisymmetric vector nozzle mechanism. The development of machine learning provides a valuable opportunity for efficiently and accurately designing and analysing axisymmetric vector nozzles.
The axisymmetric vector nozzle mechanism is a typical multibody dynamics system. In recent years, machine learning has been extensively researched within the field of multibody dynamics. This study verifies the feasibility of using machine learning for multibody dynamics analysis. Furthermore, it offers the possibility of employing DNN to simulate the axisymmetric vector nozzle mechanism. Considering the complex structure and strong nonlinearity of the axisymmetric vector nozzle, a well-trained DNN is built to analyse the kinematics and dynamics of the axisymmetric vector nozzle system. The effect of the joint clearance on the dynamic response of the axisymmetric vector nozzle is investigated. DNN is proved to be an effective model capable of predicting the dynamic response of axisymmetric vector nozzles with high accuracy, low cost and high efficiency.
2.0 Multibody dynamics modeling approach
2.1 The dynamic equations for the axisymmetric vector nozzle mechanism
A typical axisymmetric vector nozzle mechanism mainly contains: casing, actuator, throat area adjusting ring, steering drive ring, tie rods, convergence adjusting piece and expansion adjusting piece [Reference Choi, An, Han, Kim, Jung and Choi20, Reference Choi, An, Kim, Jung and Choi21]. The axisymmetric vector nozzle model diagram is shown in Fig. 1.
The expansion blade, convergence blade and the related connecting structure present an axisymmetric distribution therefore a branch containing the expansion blade and convergence blade and connecting structures can also reflect the dynamics of the axisymmetric vector nozzle mechanism to a certain extent effectively. To reduce the complexity of the mechanism and computation burden, one single link branch is selected for dynamic modeling and analysis.
In axisymmetric vector nozzle systems, expansion blade F and convergence blade G are subjected to high-aerodynamic load, which will be transmitted to drag link E. To accurately describe the system’s dynamic characteristics, the force transmitting component E is described as a flexible component, and the rest components are described as rigid. The absolute node coordinate formulation is used to describe the flexible component. The radial contact force, tangential friction force, and wear at the clearance joints are calculated by Flores contact force model [Reference Hedrih40], modified Coulomb friction model [Reference Flamm, Deere, Mason, Berrier and Johnson7] and Archard wear law [Reference Rajesh and Doddamani5], respectively.
The Lagrange equation for the $i$ th body of nozzle mechanism system can be expressed as:
where, ${T^i}$ , $\phi _{{q^i}}^T$ and $\lambda $ are the kinetic energy, the Jacobi matrix and the Lagrange multiplier, respectively. ${\rho ^i}$ , ${V^i}$ and ${M^i}$ are the density, volume and mass of the $i$ th body, respectively.
Considering the constraints imposed on the system, the equation of motion of the multibody system is obtained, as follows:
where $Q$ is the generalised external force, $M$ is the assembled mass matrix, and $F$ is the assembled generalised elastic force. The generalised coordinates and Lagrange multipliers of the equations can be solved using numerical integration such as the Newmark-beta method [Reference Newmark27] and HHT-13 [Reference Negrut, Rampalli, Ottarsson and Sajdak28].
Equation of motion of nozzle mechanism system considering joint clearance, rigid and flexible coupling can be written as:
where, ${M}_{\sum }$ and $K$ are respectively the system mass matrix, damping matrix and stiffness matrix assembled from the individual components. ${F_c}$ is the generalised contact force, which includes the clearance collision force and friction force. ${Q_e}$ is the generalised external force. $\Phi $ is the constraint equation and ${\Phi _q}$ is the Jacobi matrix of the constraint equation [Reference Li, Wang and Huang29].
Considering that the rigid-flexible coupled system, the mass matrix and generalised coordinates can be written as follows:
where, ${M_f}$ and ${M_r}$ represent the mass matrices of the flexible and rigid parts, respectively, and ${q_f}$ , ${q_r}$ represent the generalised coordinates of the flexible and rigid parts, respectively.
3.0 Deep neural network models
DNN is an ML algorithm, which is mainly used to process large-scale complex data, such as images and natural language. DNN can learn highly abstract representations and patterns through multilevel nonlinear transformations.
The structure of DNN includes input, hidden, output layers and all the layers are connected through weights and biases [Reference Choi, An, Kim, Jung and Choi21]. An effective and reliable dataset is very important for the training of DNN, and the appropriate hyperparameters also significantly affect the performance of DNN, including activation function, loss function, number of hidden layers, number of epochs, number of neurons and optimiser selection.
Activation function is a nonlinear function that is responsible for introducing nonlinear properties to the model, allowing the neural network to learn and represent complex functional relationships [Reference Zheng, Saponara, Tian, Yu, Elhanashi and Yu36]. The batch size is the count of training data samples used in performing a single weight update. Due to limitations such as memory, it is generally not recommended to use all available data samples for training at once [Reference Zheng, Zhao, Wang, Elhanashi and Saponara34]. Increasing the batch size can reduce the computational cost required for training. The loss function measures the discrepancy between the model’s predictions and the actual labels. When training a neural network, the model parameters are optimised by minimising the loss function, allowing the model to make more accurate predictions. Stochastic optimisation methods are used to optimise neural networks [Reference Zheng, Zhao, Zhang and Wang33]. Usually, a small batch of data is obtained, which is treated as stochastic data and gradient descent is performed with this stochastic data [Reference Zheng, Tian, Yu, Ding, Elhanashi, Saponara and Kpalma37].
In deep learning, regularisation is used to prevent over fitting by adding constraints and thus improve the robustness of the model [Reference Zheng, Zhao, Zhang and Wang33, Reference Zheng, Tian, Yu, Wang, Elhanashi and Saponara35]. It involves saving and updating the best parameters during training, stopping when improvements cease and using the last best parameters. This technique limits the optimisation to a smaller parameter space [Reference Zheng, Zhao, Wang, Elhanashi and Saponara34, Reference Zheng, Saponara, Tian, Yu, Elhanashi and Yu36]. Early Stopping is chosen as the regularisation method in our DNN model and end the training when the performance of the trained model evaluated through the validation set is no longer increasing.
The selection of appropriate hyperparameters is very important to the performance of the DNN model. We choose the random search method [Reference He, Zhao and Chu38] to optimise hyperparameters of the DNN model and finally determine the optimal hyperparameters. Each hidden layer uses the ReLU (Rectified Linear Unit) as activation function, ReLU can prevent the unsaturation of the gradient, which has good results in the application of the multibody dynamics field, and the optimiser is Adam (adaptive moment estimation). It should be mentioned that although AdamW is an improved algorithm based on Adam and L2 regularisation, Adam outperforms AdamW in the DNN model of axisymmetric vector nozzle mechanism. The comparison results will be shown in Section 4. A very large learning rate might prevent convergence to the optimal value, so a learning rate of 0.0001 is chosen, with batch size of 8192. MSE is used as loss function to evaluate the average squared error between the predicted and actual values of the model, respectively. The smaller the value of loss function, the better the performance of the model. As the error decreases, the gradient also decreases, which facilitates convergence even when using a fixed learning rate [Reference Bernal22]. The formulas of loss function are given below:
where, $N,y,\hat y$ are the number of outputs, baseline values and predicted values in the dataset, respectively.
4.0 Numerical results
The DNN model of the axisymmetric vector nozzle is established by limited training data. The adjusted R2, MAE, MSE and RMSE are used as evaluation function to comprehensively evaluate and verify the accuracy of the DNN model. The dynamic response of the DNN at different clearance values was quantified and analysed.
The closer the adjusted R2 value is to 1, the more accurate the prediction results are, and the closer the MAE, MSE and RMSE values are to 0, the more accurate the prediction results are. The formulas for adjusted R2, MAE, MSE and RMSE as shown in the below equations (6)–(9):
where, $y,\hat y,\bar y$ are the baseline value, predicted value and average of the baseline values in the dataset, respectively, N is the number of samples and p is the number of features.
4.1 Deep neural network model for axisymmetric vector nozzle mechanism
The dynamic equations of axisymmetric vector nozzle mechanism considering joint clearance and rigid-flexible coupling are established and numerically solved, the relevant parameters are shown in Table 1.
The hyperparameters of DNN for the axisymmetric vector nozzle mechanism are summarized in Table 2.
The clearance value of joint in Fig. 1 varies from 0.00 cm to 0.50 cm with a step size of 0.01 cm, and the time varies from 0s to 1s with a step size of 0.125s. The dynamic response of each clearance value is recorded with 800 data points. These data points serve as the training and validation dataset for the neural network. In total, there are 40,800 (501 $ \times $ 800) data points, with 80% used for training and 20% for testing.
Figure 2 shows the structure of an axisymmetric vector nozzle DNN, including input, hidden and output layers.
Figure 3 shows the angle, angular velocity, displacement, angular acceleration of the expansion blade F and driving force of actuator B under a randomly disrupted test set based on the DNN model.
From the resultant plots of predicted data versus simulated data in Fig. 3, it is clearly shown that the kinematic parameters and driving forces of the axisymmetric vector nozzle remain highly accurate even with randomly test data. It is verified that with appropriate hyperparameters, DNN can predict the dynamic response of the axisymmetric vector nozzle with high accuracy after reasonable training.
In addition, the DNN model of axisymmetric vector nozzles is validated using data outside of the training and test datasets to confirm the robustness of the model that the model maintains a high level of accuracy despite reasonably variation of input parameters, where clearance values from 0.151 cm to 0.159 cm are used as the validation set.
From Fig. 4, it is indicated that the kinematic parameters and driving forces of the axisymmetric vector nozzle from prediction are coincide with numerical simulation results, which ensures high accuracy even with the validation data set other than the training and test sets. The results show that the DNN model of the axisymmetric vector nozzle can predict the dynamic response of the axisymmetric vector nozzle with high accuracy and robustness. Next, the performance of the DNN model is evaluated by adjusted R2, MAE, MSE and RMSE.
Table 3 shows that the adjusted R2 values of angle, angular velocity, position, angular acceleration and driving force are almost 1. The values of MAE, MSE and RMSE are much smaller compared with the target and predicted values. This indicates that although the presence of joint clearance and flexible deformations in the nozzle system increases the degree of nonlinearity, the neural network model is still able to make high accuracy prediction.
Figure 5 shows the Adjusted R 2 of the kinematic parameters and driving force of the DNN of axisymmetric vector nozzle with high accuracy.
To investigate the applicability of different optimiser, the DNN model is retrained with AdamW optimiser, while the rest parameters of DNN keep unchanged. Figures 6-8 show the comparison between the test and validation sets with AdamW optimiser. Table 4 shows the values of adjusted R2, MAE, MSE and RMSE with AdamW optimiser. Through comparison of the DNN prediction capability between Adam optimiser and AdamW optimiser shown in Figs 3-8, as well as the adjusted R2, MAE, MSE and RMSE values shown in Table 3 and Table 4, it is clearly reflected that Adam outperforms AdamW.
The DNN model in this paper is trained using GPU (NVIDIA GeForce GTX 1660 TI).
Table 5 shows the time comparison between traditional multibody computation methods and DNN, which may vary according to computer conditions. The time recorded in Table 4 can reflect the computational efficiency of DNN compared with traditional methods.
4.2 Dynamics analysis of axisymmetric vector nozzle based on DNN model
Using the trained DNN model, the effect of different clearance values on the dynamic response of the kinematic parameters of the axisymmetric vector nozzle expending blade is analysed.
Five different clearance values (0.00cm, 0.10cm, 0.15cm, 0.20cm and 0.25cm, respectively) are chosen to study the effect of clearance on the angle of rotation, angular velocity, position, angular acceleration and driving force of the axisymmetric vector nozzle.
The motion of the axisymmetric vector nozzle is a reciprocal process, 0-0.5s the expansion blade contracts and 0.5-1.0s expansion blade returns to its original position. The dynamic response of the DNN model indicates that the joint clearance has a certain effect on the angle of rotation, angular velocity and position. The influence of clearance on the expansion blade at 0s-0.2s and 0.8s-1.0s is more pronounced compared with 0.2s-0.6s, because the clearance compensates for the changes of the angle and position of the expansion blade to a certain extent, and the variation of angular velocity are caused by the changing of the contact state between the shaft and the bushing due to the clearance. The clearance has a significant effect on the angular acceleration and driving force, especially at the maximum and minimum values, because the clearance causes impacts and collisions between the shaft and the bushing, and leads to the increase of the relative velocity and contact force.
The partial enlargement view in Fig. 9 clearly shows the there is a crossover of the response of different parameters. During the reciprocating movement, the joint clearance leads to a change of contact state between the shaft and the bushing, and results in the opposite effect of the clearance on the parameters of expending blade.
Figure 10 shows the variation of angle, angular velocity, position, angular acceleration and driving force with different clearance values versus no clearance condition, respectively. Overall, larger clearance value results in more pronounced fluctuations of the differential response of the expanding blade. The clearance imposes significant nonlinearity on the dynamic response of the nozzle mechanism. Also with larger clearance value, the peak of the dynamic response appears earlier. With larger clearance, the shaft needs more time to contact with bushing and transmit the movement afterward and also possess higher impact energy due to the longer freely movement time.
From Table 6, it can be seen that the effect of the clearance value on the angular acceleration and driving force of the expansion blade is the most obvious, comes after angle and angular velocity, the impact on position is the minimum. The collision and impact between shaft and bushing due to clearance cause an abrupt change of the contact force, which affects the driving force and angular acceleration significantly.
The results in this section are obtained by the well-trained DNN model of vector nozzle mechanism, although these computational results can also be obtained by the traditional method, it takes more time, as shown in Table 6. Compared with the multibody dynamics simulation by traditional algorithms, DNN can get the prediction results of the mechanism dynamic response in millisecond, and the high accuracy of the dynamic response prediction results is guaranteed. Therefore, DNN as an agent model significantly reduces the computational burden associated with traditional multibody dynamics simulation and provides the possibility of real-time monitoring of the dynamic response.
5.0 Conclusion
The appropriately trained DNN model of the axisymmetric vector nozzle mechanism shows high accuracy and efficiency in the offline prediction and can be used to make real-time predictions of the dynamic response of the complex axisymmetric vector nozzle system saving huge computation costs. This study analyses the effects of joint clearance on the kinematics and dynamics of the axisymmetric vector nozzle mechanism using a DNN model. It has been demonstrated that joint clearance influences the angular acceleration of the expanding blade and the driving force predominantly, also exerting a significant effect on the angle, angular velocity and position of the expanding blade. The results indicate that an increase in joint clearance value leads to a notable intensification of the fluctuations in the dynamic response of the mechanism, which is due to the increase of the relative velocity and contact force between pin and bushing of the joints. For further analysis of complex axisymmetric vector nozzles mechanism with more variables, training high-precision DNN models require even larger amounts of data and the cost of training DNN models increases significantly. This is the limitation of neural network supervised learning itself, which requires model degradation and data-handling techniques. In addition, the clearance of joints is uncertain quantity essentially due to various external factors such as wear in services environment. It is necessary to study the dynamic response of the mechanism, taking into account the uncertainty of the joint clearance. Considering the large samplings during uncertainty analysis, DNN offers new possibilities for the study of joint clearance uncertainty.
Data availability
This manuscript does not report data generation or analysis.
Acknowledgements
This work is supported by the National Natural Science Foundation of China (52375086), Stable Support for Military Research Institutes-Research on High-Load, Wide-Range Aero-engine Adjusting Mechanism Technology, Joint Program on Science and Technology (Applied Basic Research Projects) of Liaoning Province(2023JH2/101700300), the Fundamental Research Funds for the Central Universities of China (2023GFYD14).
Conflicts of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.