1. Introduction
Typhoons are a type of tropical cyclone, formed by tropical oceanic atmospheric circulation systems, characterised by intense rotating storms. These storms are often accompanied by hazardous weather conditions such as strong winds, heavy rain, floods and storm surges, causing significant damage to areas along their path, including destruction of buildings, infrastructure and crops, as well as loss of life. For instance, in 2023, Typhoon Doksuri struck Fujian Province in China, affecting 725,000 people, causing power outages for 1.21 million households and resulting in direct economic losses of approximately 177 million RMB. As the second most powerful typhoon to make landfall in Fujian since 1949, Doksuri’s destructive force caused extensive damage to the communities along its path. This underscores the critical importance of accurate typhoon path prediction in safeguarding lives and property. Particularly for super typhoons, short-term forecasting is crucial for ships unable to return to port and for residents in affected areas to obtain timely information to avoid the typhoon and protect themselves. The nowcasting typhoon track prediction proposed in this study targets the prediction of typhoon movement within 0–12 h. This time frame is set based on relevant standards from the World Meteorological Organization. According to these standards, the ‘nowcasting’ of tropical cyclones typically refers to forecasts within 0–6 h. However, some studies extend this period to 12 h for the sake of operational continuity.
Traditional methods for predicting typhoon tracks primarily consist of numerical statistical methods and meteorological dynamics methods. Numerical statistics and meteorological physical models can also be combined. For instance, Chen and Duan (Reference Chen and Duan2018) employ a statistical-dynamic model to assess typhoon risk in southeastern China. This model comprises generation, movement and intensity models, improving traditional empirical footprint models through statistical-dynamic relationships. Hon (Reference Hon2020) uses the WRF model for typhoon path prediction. However, these prediction methods require extensive data support and high hardware demands. To address the shortcomings of traditional forecasting methods, many researchers have turned to machine learning technologies for predicting typhoon paths. Chen et al. (Reference Chen, Zhang and Wang2020) indicate that combining decision tree analysis, random forests and deep learning methods can enhance the accuracy of tropical cyclone path predictions. While these methods have improved prediction accuracy, challenges remain in feature extraction and response to external disturbances. Consequently, other researchers have opted for predictive models using recurrent neural networks (RNNs) and their advanced variant, long short-term memory (LSTM) networks (Pang et al., Reference Pang, Zhao, Yan and Liu2021; Gao et al., Reference Gao, Zhao, Pan, Li, Zhou, Xu, Zhong and Shi2018; Suo et al., Reference Suo, Chen, Claramunt and Yang2020; Jiang et al., Reference Jiang, Yu, Ma, Zhuang, Ye, Yuan and Su2024; Liang et al., Reference Liang, Yang, Sun, He, Dai, Su and Yu2023). Gao et al. (Reference Gao, Zhao, Pan, Li, Zhou, Xu, Zhong and Shi2018) trained an LSTM neural network using typhoon path observation data from 1949 to 2012, provided by the North China Sea Prediction Center. By comparing the predicted results with observed data, the feasibility of LSTM networks in typhoon path prediction was validated and the impact of different training dataset sizes on prediction accuracy was studied. Additionally, Song et al. (Reference Song, Li, Meng, Xie and Xu2022) used CNN layers to model and extract spatial relationships between meteorological variables and tropical cyclone positions, and GRU layers to mine deep features of time series, constructing an integrated deep learning model. Jabbar et al. (Reference Jabbar, Li and Omar2021) mentions using GAN models to predict typhoon paths, addressing issues in GAN models such as Nash equilibrium, internal covariate shift, mode collapse, gradient vanishing and lack of proper evaluation metrics. These models can quickly and accurately perform future predictions. Beyond the aforementioned models, some researchers have enhanced prediction model performance by diversifying information feature extraction (Jiang et al., Reference Jiang, Yu, Ma, Zhuang, Ye, Yuan and Su2024; Liang et al., Reference Liang, Yang, Sun, He, Dai, Su and Yu2023; Qin et al., Reference Qin, Tang, Lu and Lao2022; Fu et al., Reference Fu, Gu and Wang2022; Ren et al., Reference Ren, Xu and Cui2022; Lu et al., Reference Lu, Sun, Xu, Wang, Zheng, Xie and Wang2022a, 2022b, 2023, Li et al., Reference Li, Lai, Nie, Liu, Liang and Lv2024). Ren et al. (Reference Ren, Xu and Cui2022) combines convolutional neural networks (CNNs) and LSTM networks, proposing a deep learning model named DeepTyphoon for predicting typhoon paths. By labelling satellite images and combining CBAM’s and LSTM’s feature extraction and prediction methods, the model predicts typhoon paths using two datasets released by the Japan Meteorological Agency (JMA). Lu et al. (Reference Lu, Sun, Xu, Wang, Zheng, Xie and Wang2022) processes typhoon path data through first-order differencing and cointegration tests to ensure data stationarity and cointegration relationships among variables. Subsequently, a C-LSTM model, combining CNN and LSTM networks, was constructed for typhoon path prediction. The study experimentally verifies the advantages of the C-LSTM model in typhoon path prediction and compares it with traditional LSTM models. Lu et al. (Reference Lu, Sun, Xu, Wang, Zheng, Xie and Wang2022) employs a ConvLSTM model, comprising convolution operations and gating units, to better extract image features. By marking the typhoon centre’s positions before and after on the generated heatmap of physical variables, the model learns these key features. Single-layer and multi-layer ConvLSTM experiments were conducted to validate the performance improvement of the multi-layer model. Additionally, some researchers have combined transformer and time-series analysis methods to enhance model performance (Zhan et al., Reference Zhan, Lu, Jiang, Li, Zhong and Lv2023; Jiang et al., Reference Jiang, Shi, Li, Ma, Li and Shi2023; Jung et al., Reference Jung, Baek, Moon, Lee and Sohn2024). For instance, Zhan et al. (Reference Zhan, Lu, Jiang, Li, Zhong and Lv2023) construct a method based on TRAN, achieving prediction errors mostly below an average error of 40.93 km, with the error frequency below the average error being 62.74%. Although this method shows reliability, it still requires validation and improvement in practical applications. Issues such as model complexity, workload, forgetting problems and parameter adjustment complexity persist.
In recent years, the Large Language Model (LLM) has rapidly advanced, with explosive growth in various application modes across different fields (Chib, Reference Chib2024). Peng et al. (Reference Peng, Chen, Zhu, Chen, Yang, Wang and Wang2024) reported a novel interpretable lane change prediction model named LC-LLM, using LLM’s powerful reasoning and self-explanation capabilities, which has shown significant improvements in prediction accuracy and interpretability. Lan et al. (Reference Lan, Liu, Fan, Lv, Ren and Cui2025) propose a dynamic traffic trajectory prediction method based on LLM, named Traj-LLM, envisioning its application in autonomous driving. Traj-LLM achieves high-precision prediction of complex trajectories and the embedment of physical rules by encoding multimodal trajectory data into language sequences, using the long-range dependency modelling and prompt-controlled generation capabilities of large language models. This approach allows for encoding trajectory points, time, and environmental factors (such as wind speed and pressure) into natural language sequences, breaking through the data fragmentation bottleneck of traditional models to form unified multimodal modelling. By dynamically controlling the output with prompts (such as ‘predict the path under the westward extension of the subtropical high’), it supports real-time scenario simulations for interactive controlled generation. Chib (Reference Chib2024), Peng et al. (Reference Peng, Chen, Zhu, Chen, Yang, Wang and Wang2024) and Lan et al. (Reference Lan, Liu, Fan, Lv, Ren and Cui2025) all use large language models as the foundational models for prediction, processing the data used in the research to make it understandable by the large language models, and incorporating them into prediction scenarios across various research contexts. Human movement trajectories and vehicle movement trajectories provide insights into how large language models can be applied to typhoon trajectory prediction. Considering that typhoon movement is a manifestation of the complex interaction of weather systems, the study focuses on short-term intervals in near-term predictions. Additionally, given the characteristics of typhoon trajectory data, retrieval-enhanced generation technologies are employed to update the large language model in real-time, enabling interaction with the model to complete the prediction process. Based on the application experience of these large language models in other fields, the research on near-term prediction of typhoon trajectories using Retrieval-Augmented Generation (RAG) technology and LLM is feasible. Synthesising the latest advancements in trajectory prediction and previous work in typhoon path prediction, this study is the first to propose the application of combining LLM and RAG for typhoon path prediction.
2. Methodology
2.1. Related work
2.1.1. RAG
Retrieval-Augmented Generation combines retrieval and generation technologies to tackle complex natural language processing tasks (Munir and Sheraz Anjum, Reference Munir and Sheraz Anjum2018). In this study, the Gemini large language model is used as the foundation, optimised for the task of typhoon trajectory prediction, with RAG introduced to enhance performance.
The basic principles of RAG can be summarised in the following steps (as shown in Figure 1).
-
1. Data Retrieval: based on the input.
-
2. Information Integration: the retrieved information is integrated with the input query.
-
3. Generation: the integrated data are used to generate the final output.

Figure 1. Diagram of RAG structure principle.
The formulation of the probability distribution for the generated output is given by
where θ represents the parameters of the generator.
Additionally, RAG involves several supplementary technologies such as attention mechanisms and fusion strategies to better integrate the retrieved information with the input query and generate more accurate responses. While these technologies may introduce additional parameters and computational steps, the fundamental principles and equations remain based on the aforementioned retriever and generator.
In this study, the Retrieval-Augmented Generation question-answering (RAG QA) model was employed to enhance the performance of large language models. An augmented human–computer interaction interface was used to accomplish specific tasks. The detailed process is as follows.
The RAG QA approach was specifically selected, which combines retrieval and generation. This method retrieves information relevant to the questions from a large corpus and uses this information to assist in generating more accurate and targeted answers. The performance of the LLM, enhanced through RAG technology, was improved, enabling better understanding of user input and providing answers that meet expectations.
To implement this process, a human–computer interaction interface was constructed using the railway framework. Users can input questions through this interface and interact with the enhanced LLM. The interface transmits the user’s question to the RAG system, which retrieves relevant information from the corpus and inputs it along with the question into the LLM for processing. The LLM generates answers based on the retrieved information and the question, and then returns the answers to the user. This approach resulted in an efficient, accurate and user-friendly human–computer interaction system.
The specific process is illustrated in Figure 2: (1) the user poses a question through the interface; (2) the server processes the question (e.g. rephrasing), followed by retrieval services; (3, 4) retrieval calls local documents and databases for more efficient search; (5) generating relevant contextual data; (6, 7) the integrated prompt template is used to call the LLM; (8, 9) the LLM generates the answer, which is parsed and post-processed through the prompt; and (10) the final answer is presented to the user. This process seamlessly integrates retrieval and generation, providing users with a more intelligent and convenient service experience.

Figure 2. Question and answering based on RAG and LLM.
2.1.2. Lucas–Kanade
The Lucas–Kanade method (Yang, Reference Yang2018) is used to determine the motion vectors of each pixel in an image by minimising the sum of squared pixel residuals. For the specific task of image processing–analysing the relative motion between two input images, the following steps are proposed to estimate the motion of the camera or objects.
Selection of feature points within the typhoon region: initially, the typhoon region is identified from two satellite cloud images. Within this region, a series of feature points are selected, evenly distributed along the typhoon’s contour. These points possess significant characteristics, including the typhoon centre, corners or edge points.
Calculation of local image gradients: for each selected feature point, its precise location in the first cloud image is determined. Based on the pixel information surrounding this point, a local image gradient is computed to capture the variation trends in the image structure near the point.
Feature point matching and iterative optimisation: in the second cloud image, the position of the feature point from the first image is used as the initial position. Using the local image gradient calculated in the first step, an iterative search is conducted along the gradient direction, as described by equations (4) and (5), to find the optimal matching position that minimises the pixel residuals between the two images at the corresponding feature point locations. This iterative process continuously optimises the feature point positions until a predefined convergence condition is met.
$$\sum\limits_{\Delta {\rm{p }}{{\rm{ }}^{x,y}}}^{\min } {{{\left( {{I_1}\left( {x,y} \right) - {I_2}\left( {x + \Delta x,y + \Delta y} \right)} \right)}^2}}$$
In this context, Δp=[Δx,Δy] represents the displacement vector. By differentiating this equation, a system of linear equations can be obtained:

where
${I_x}$
and
${I_y}$
denote the gradients in the x and y directions, respectively.
Global motion vector calculation: the process of feature point matching and iterative optimisation is repeated until the corresponding motion vectors for all selected feature points are computed. These motion vectors reflect the displacement of each feature point in the cloud images.
2.2. Datasets
2.2.1. Cloud data of the typhoon
This study used real-time infrared cloud image data provided by the FY-4 meteorological satellite, collecting over 8,000 images spanning from July 28, 2023, to September 17, 2023. These images cover the East Asia region from the coast of China to the Western Pacific. It is worth noting that over 8,000 images are sourced from multiple typhoons, including Typhoons Khanun, Saola and Haikui.
High-quality datasets are crucial for monitoring typhoon dynamics and issuing potential meteorological disaster alerts affecting oceanic regions. The FY-4A meteorological satellite, as a geostationary satellite, provides high-quality medium-resolution images for real-time weather monitoring, making it particularly suitable for typhoon observation and early warning. By combining automatic contour segmentation with manual key-point selection, complex typhoon image data are simplified into a series of typhoon particles, facilitating the analysis of typhoon dynamics using mathematical models and computational methods. Analysing Typhoon Kanu’s data allows for the study of its movement trajectory and speed variation, which is essential for understanding and predicting typhoon behaviour. This analysis aids in better understanding typhoon development patterns, thereby improving the accuracy of typhoon path predictions.
The processed cloud image data include information such as the typhoon’s latitude and longitude coordinates, direction, and speed. By automatically delineating typhoon contours in the images and manually selecting 35 tracking points, a series of typhoon particles are formed. The main focus of this study is analysing Typhoon Kanu’s data, with each analysis conducted in 6-h time steps. Additionally, the FY-4A satellite, representing the top level of Chinese meteorological satellites, ensures the quality of cloud image data used in typhoon trajectory point tracking research through its high-resolution, high-timeliness, multi-spectral, stable and accurate images.
2.2.2. Data processing
In academic research, data acquisition and preprocessing are generally divided into two distinct yet interrelated processes. Initially, the collected images undergo a series of processing steps, which include reading the input images and converting them to the HSV colour space. This is followed by applying colour thresholding operations to create mask images for different colour regions and performing morphological operations (such as erosion and dilation) on the mask images to remove noise and fill gaps. The purpose of this step is to provide clear and denoised images for subsequent analysis.
The second step involves the selection of particles and the calculation of feature parameters from the preprocessed images. This includes finding contours in the processed mask images and generating the original images with boundaries by drawing these contours. Subsequently, optical flow analysis is conducted on consecutive image frames using the Lucas–Kanade method to track feature points. Finally, the displacement distance and direction of the feature points between consecutive frames are calculated. The aim of this step is to extract key information regarding cloud motion, including its position, speed and direction.
Overall, the combined application of technologies such as colour segmentation, morphological operations and optical flow analysis enables the detection and analysis of cloud motion. This process effectively visualises the speed and direction of clouds, as illustrated in Figure 3, which shows the processed cloud images. This methodological approach reveals the dynamic characteristics of cloud movement, providing crucial data support for further meteorological research and decision-making.

Figure 3. (a) Binary image. (b) Image after boundary processing.
2.2.3. Haversine
In cloud image analysis, not all initially selected points can maintain accuracy over extended tracking periods. As shown in Figure 4, which illustrates the initial annotation and subsequent changes of typhoon feature points over a 36-h period, some feature points deviate from the typhoon’s contour during continuous tracking. Therefore, this study aims to identify and select points that accurately represent the typhoon’s central movement trajectory, and to collect their latitude and longitude coordinates. To quantify the horizontal displacement of these selected feature points over a specific period, the study employs the Haversine formula to calculate the distance between the tracked points at different times. The distance between tracked points at different times is calculated using the Haversine formula (Mahmoud and Akkari, Reference Mahmoud and Akkari2016), where (R) is the Earth’s radius, valued at 6,371 km:

Figure 4. Changes in the tracked points marked during the typhoon within 36 h.
2.3. RAG establishment and prediction
2.3.1. Model establishment and improvement
In this study, the model leverages RAG technology to enhance the performance of the LLM in understanding and answering complex questions. By integrating RAG, the LLM can retrieve and incorporate extensive historical typhoon data, significantly improving its contextual understanding and accuracy in responses. This approach allows the model to combine retrieved information with the current context, resulting in deeper comprehension of the questions and the generation of more precise, detailed, and relevant texts and answers. The synergy of a broad knowledge base and targeted data retrieval ensures the model’s superior performance in handling complex queries, thereby improving the quality and relevance of its responses.
In a multi-layered system, the overall framework is illustrated in Figure 5. The first layer serves as the user interaction layer, primarily responsible for deploying user interfaces on Web UI or iOS and Android platforms. This layer aims to provide intuitive user experience and also offers API call services to meet specific prediction requirements and cater to user needs.

Figure 5. Deployment architecture of the prediction model.
The second layer, functioning as the local knowledge processing and vector retrieval layer, is primarily responsible for efficiently handling external input documents. This includes, but is not limited to, document parsing, information extraction, and necessary preprocessing tasks to ensure the accuracy and efficiency of subsequent analyses. The third layer, positioned at the system’s foundation, serves as the large language model layer. This layer provides robust language processing capabilities, enabling the invocation of third-party large language models or the use of locally deployed models as a basis for RAG services. Given that running large language models locally typically requires extremely high hardware configurations, which exceed the capabilities of most computers, API calls are employed when interacting with the aforementioned layers. This method ensures service reliability while reducing dependency on hardware resources.
This layered architecture design not only offers flexible user interaction methods but also ensures efficient data processing and accurate language comprehension, thereby providing robust support for various prediction scenarios.
The specific dialogue process for deploying local data into a large language model is illustrated in Figure 6. The local data are processed into JSON file format and embedded in this format, then saved into the OpenAI GPT-4o model via API calls. Subsequently, the data are loaded into the specific predictive dialogue using the pin function. This approach enables the application of local data in predictive methods.

Figure 6. Specific conversation process for embedding local files into an LLM.
Through the user interface, a prediction command is issued to the locally deployed LLM, as illustrated in Figure 7. Initially, the deployed local data are described. Subsequently, the specific time points for prediction are provided, instructing the LLM to use the relevant knowledge and the uploaded local data to forecast the future trajectory of the typhoon. The LLM then generates the prediction results.

Figure 7. Prediction results with direct querying method.
Evidently, the prediction results provided by the LLM in Figure 8 appear to follow only some basic mathematical patterns. As shown in Figure 8, various methods were employed in the study to query the LLM, and the prediction results from these different methods were plotted for comparison. The outcomes were far from satisfactory, with the trajectories primarily derived using simple linear extrapolation technologies. These suboptimal results were attributed to issues such as hallucinations in the large model, which led to errors in file loading and inadequate prompt phrasing during questioning.

Figure 8. Comparison of real and predicted coordinates using various methods.
To address this issue and enhance the final prediction outcomes, one solution is to limit the amount of local data embedded within the platform. Additionally, prior to making predictions, the LLM can be guided to display the already uploaded local datasets, as shown in Figure 9.

Figure 9. Prediction results with optimised querying method.
Incorporating more specialised terminology, such as ‘long-term patterns’ and ‘short-term trends’, and considering factors like volatility and atmospheric characteristics, can help mitigate the issue of insufficient prompt phrases. Using highly specialised vocabulary to constrain and direct the LLM can result in more accurate and effective predictions.
3. Result and analysis
3.1. Experimental environment
The experimental environment is shown in Table 1.
3.2. Evaluation indicators of method
The accuracy of the predicted location of the typhoon’s centre is evaluated by calculating the Mean Absolute Error (MAE) for distance, longitude and latitude, as well as the Root Mean Squared Error (RMSE) for distance. The absolute error is determined using the Haversine formula, and the Mean Absolute Error is then computed based on the sequence number (n) of the test set. Additionally, the Root Mean Squared Error is calculated based on the absolute error. The calculation formulae are as follows:
$$RMSE = \sqrt {{1 \over n}\sum\limits_{i = 1}^n {{D_i}^2} }$$
where
$la{t_{real}}$
represents the actual latitudinal coordinate of the current typhoon centre,
$la{t_{pred}}$
represents the predicted latitudinal coordinate of the current typhoon centre,
$lo{n_{real}}$
represents the actual longitudinal coordinate of the current typhoon centre and
$lo{n_{pred}}$
represents the predicted longitudinal coordinate of the current typhoon center.
3.3. Experimental result
In the experiment, the prediction of typhoon centre trajectories was conducted by embedding a small amount of pre-prediction trajectory coordinate data into the LLM. These data, consisting of approximately 100 trajectory points from the 12 h prior to the prediction time, had time intervals ranging from 4 to 15 min.
Table 2 provides the prediction error results for four time periods and indicates the specific times of the prediction trajectories. The predicted trajectories for four-time spans (2 h, 4 h, 7 h and 12 h) were consistent with the actual typhoon trajectory trends. Figure 10 compares the predicted and actual trajectories of Typhoon Khanun from July to August 2023, with the red dashed line representing the predicted path and the blue solid line representing the actual path. The predicted typhoon centre coordinates demonstrated excellent performance in terms of both latitude/longitude errors and spherical distance errors.
Table 1. Configuration of experimental platform

Table 2. Comparison of results for different prediction time intervals


Figure 10. Comparison of typhoon trajectory predictions and actual tracks across 2-h, 4-h, 7-h and 12-h intervals.
However, the proposed method has certain limitations. As the prediction time span increases, the errors tend to grow, as shown in Table 2. The prediction errors for 2-h, 4-h and 7-h spans increase with longer time spans. This indicates that for long-term predictions, the performance of the proposed method gradually deteriorates. Interestingly, at a 12-h time span, the error decreases and the prediction performance is even better than that of the 4-h span. This anomaly is likely related to the complexity of typhoon trajectory changes over certain time periods.
3.4. Comparison with other methods
Table 3 provides a detailed comparison of the prediction errors between the proposed method in this study and other existing methods. Our approach employs LLM in conjunction with RAG technology to predict typhoon trajectories, demonstrating superior accuracy in error metrics. The comparative data for our method are based on a 12-h prediction horizon. It is noteworthy that the prediction horizons of the models listed in the table are not uniform; for instance, the model of Lu et al. (Reference Lu, Sun, Xu, Wang, Zheng, Xie and Wang2022) is designed for longer-term forecasts.
Table 3. Comparison of the results with other methods

Maintaining consistency in prediction intervals, our method shows a reduction in
$MA{E_{lat}}$
and
$MA{E_{lon}}$
by an order of magnitude compared with those of Qin et al. (Reference Qin, Tang, Lu and Lao2022). Additionally, the MAE of our method is 58 km less than that reported by Ruttgers et al. (Reference Ruttgers, Jeon, Lee and You2022). Compared with Hao and Jin (Reference Hao and Jin2022) and Lian et al. (Reference Lian, Dong, Zhang, Pan and Liu2020), the reduction in error is even more significant. These results indicate that even with limited data support, our method effectively leverages LLM and RAG technologies to achieve more precise predictions.
This demonstrates that our method can provide high-quality predictions within shorter forecast intervals. Not only does this corroborate the efficiency of our approach in handling complex prediction problems, but it also underscores its reliability in practical applications. These findings further highlight the potential of LLM and RAG technologies in enhancing the accuracy of typhoon trajectory predictions, offering valuable insights for future research in related fields.
The main advantages and innovations of the LLM + RAG prediction method proposed in this paper, compared with various other methods, are as follows.
-
1. The most notable advantage is the significant reduction in dependency on data volume. For example, the model of Qin et al. (Reference Qin, Tang, Lu and Lao2022) applied data covering 76 typhoons affecting or about to affect the Korean Peninsula from 1993 to 2017. To overcome the lack of satellite image data, reanalysis data were introduced, expanding the dataset to 757 typhoons.
-
2. The most distinctive feature is the use of a large language model as the basis for prediction. From a deep learning perspective, the emergence of large language models represents profound exploration and application in this field. Large language models, with their rich data reserves and generalisation capabilities, are suitable for various scenarios. This means that large language models are deep learning models that have been extensively trained and possess a vast knowledge reserve. For instance, previous deep learning training methods achieved a functional leap from 0 to 1, and while various methods and explorations can make predictions more accurate and enhance the model’s generalisation ability, directly using large language models as the prediction foundation is like pursuing prediction accuracy from a height of 100.
3.5. Discussion and prospects
The experimental results indicate that employing LLM and RAG technologies for prediction can yield accurate results with minimal data and without the need for extensive modelling and parameter tuning. The LLM effectively learns the temporal coordinate data of typhoons, producing reliable predictions. However, despite its significant advantages, LLM may still encounter some unknown errors during the prediction process, such as hallucinations and minor information processing biases.
To enhance the accuracy and generalisability of predictions, future work should focus on fine-tuning the LLM. Fine-tuning typically involves making minor adjustments to the network architecture, hyperparameters or training process to optimise model performance. For the specific challenge of typhoon trajectory prediction, fine-tuning could improve the LLM’s ability to capture the patterns of typhoon movements, thereby enhancing prediction accuracy. This may involve adjusting network weights, optimising learning rates or introducing new data augmentation strategies.
Though accurately predicting typhoon trajectories remains a complex challenge, we remain optimistic about the potential of LLM technology as it continues to mature and become more tailored to address specific problems in this domain.
4. Conclusion
This paper introduces an innovative approach to typhoon path near-term prediction by integrating an LLM with RAG technology, marking the first application of this combination in the field. The method leverages the adaptability of LLMs and the retrieval capabilities of RAG to significantly enhance the accuracy and efficiency of predictions. The research uses FY-4 satellite imagery as the primary data source, which, after preprocessing and tracking with the Lucas–Kanade method, yields key dynamic change data of typhoon feature points. These data are meticulously organised into a dataset and embedded into the LLM as a vector database, providing essential input for the RAG process.
The RAG process plays a pivotal role in this study. It enhances the generative capabilities of the LLM by efficiently retrieving and using specific information from the vector database, leading to more precise predictions. Furthermore, an interactive question-and-answer format via the User Interface (UI) facilitates user interaction with the model to obtain immediate prediction results. During model deployment and optimisation, the parameters of the LLM were finely tuned and effective prompting words were introduced to further refine the prediction process.
The experimental results demonstrate the high accuracy of this method in short-term typhoon path prediction. This innovative approach not only offers a new perspective for typhoon path near-term prediction but also paves the way for new technological avenues in meteorological warning and disaster prevention, holding significant academic value and application prospects.
Funding statement
The project was supported by the National Key Research and Development Program of China (2022YFB4301400).



