Framework of Virtual Plantation Forest Modeling and Data Analysis for Digital Twin

1. Introduction
Based on the work in the experimental forest of Triploid Populus Tomentosa in Qingping, Shandong, this study has constructed a framework for virtual plantation forest modeling and data analysis oriented towards digital twin. The main contributions are: (1) proposing a digital twin-based virtual poplar plantation forest system architecture for forest management structures and systematically analyzing the main role of the architecture mechanism; (2) based on collected point cloud data from the forest, proposing a transition particle flow method combined with the AdTree method to model trees and establishing and optimizing virtual plantation forest scene on this basis; and (3) based on the experimental monitoring data, analyzing forest monitoring data and constructing a database, fitting tree growth formulas based on the measured tree diameter at breast height data, and thus predicting and simulating related physiological indicators. From this, a virtual poplar plantation system can be initially built based on virtual reality to support the realization of the virtual forest management model through the networked and digital application of forest land information.
2. Methods
The plantation forest in Qingping, Shandong Province, is a relatively gentle plain terrain with one Triploid Populus Tomentosa sapling planted at 2 m longitudinal and 3 m lateral intervals. A total of 2160 poplar trees seedlings were planted in this stand area.
2.1. Framework Overview
The physical world is the real experimental forest, which includes the actual forest management, forest operation, and various sensing and monitoring equipment; the digital world is a digital “twin” of the physical world, which can be primarily divided into virtual forest component, data analysis component, object interaction component, etc., based on its functional requirements; and the researchers are the actual operators of the architecture. In the perfect poplar plantation forest digital twin system, while the physical world conducts its own management, experiments and operations, the real information and data captured and collected, such as pictures and videos, are transmitted to the digital world in real time. The digital world focuses on the elements of forest ecological environment and employs integrated analysis and mixed modeling to decipher its complexity. Upon receiving information and data from the physical world, the digital world processes and performs intelligent calculations. After simulation and deduction, the digital world provides feedback and decisions to researchers and the physical world to support decision-making and optimization for forest experiments and management. In response to the feedback, the physical world implements intelligent instructions to maintain or promptly change controllable variables of the forest, thereby promoting the conduct of physical world experiments. Meanwhile, researchers collect feedback from both the physical and digital worlds and make corresponding adjustments, thus forming a large digital twin cycle.
Currently, the overall management and experiments of the physical world and digital world in the poplar plantation are operated by “researchers”. Due to the limitations of the experimental site, technology and equipment, the current physical world has not been able to achieve adaptive intelligent instruction control of variables. In addition, the digital world currently does not consider the complex effects among various factors, and direct information feedback to the physical world can only be achieved through manual intervention. On this basis, this paper mainly aims to discuss the framework of virtual forest modeling component and data analysis component in the digital world of the poplar plantation.
In the system architecture of the poplar plantation, the virtual forest of the digital world is the basic carrier of the whole system. It needs to simulate the main research objects and build a virtual plantation scene. The application data in the virtual forest mainly consists of images and point cloud information from the physical world. In terms of data analysis, its main task is to connect and present data from the physical world, including the display of sensor monitoring equipment data and the analysis and prediction of some data. The main input data for data analysis are sensor monitoring equipment data and some manually measured data collected periodically. In this way, key issues analysis and research of the components and functional implementations are carried out, laying a certain foundation for the future construction of the virtual forest digital twin system.
2.2. Virtual Forest Modeling
2.2.1. Point Cloud Data Pre-Processing
2.2.2. Tree Modeling
where the coordinates of the vertex n are (xn, yn).
Then the Dijkstra algorithm is used to search the shortest path of the point cloud with breadth first. The point set S is used to store the vertices on the path, and the point set U is used to store the non-path vertices, {S,U} ∈ P. We defined the array dis to store the weights from the origin to each vertex. We iterated through the weights from the origin to the vertices, added the vertices to the point set S when the weights were smallest, and updated dis. We kept cycling through this process until all the vertices were included in S, thus obtaining the initial skeleton representation of the tree MST.
where R is the fitted radius of the main branch, r is the radius of the sub-branch, and α is the ratio of the thickness of the main branch to the sum of the thicknesses of the side branches.
where ya takes a value equal to h. We performed the calculation of the downward point a″ similarly. The area with a height of 2h′ is the particle flow transition zone, and the part above the transition zone is the random particle flow zone, and the part below is the empty zone. Generally, the value of h′ is set between the main trunk and multiple branches, thus simulating the characteristics of real trees with few leaves at branches.
where Fleaf is the actual number of particles in the region, F is the initial number of emitted particles, RGBblack is the region blackness value, Ntree is the number of tree model vertices in the region, and k is the percentage of particles emitted under RGBblack. When k = 80% and RGBblack = 50% in the transition particle flow region, a more ideal state of the tree leaves emitting particles can be achieved. The particle emitting operation is performed on the overall trunk model to detect the vertex color marker, and the vertices marked as white will not show the emitting particles. The transition area has a higher value of whiteness, but still shows some particles. We set the emitted particles as leaf-shaped facets and the material as a poplar tree leaf map. We randomized the particle size and rotation angle, and emitted according to the markers to obtain the leaf particle stream. The higher the number of particles initially emitted, the higher the overall number of leaves generated. By controlling the amount of foliage to select the season and time of year of the aspen trees, the virtual scene is more selective.
2.2.3. Virtual Plantation Simulation
2.2.4. Scene Optimization
The virtual forest contains a large number of models with a high degree of redundancy. There are different sizes and levels of detail depending on how much detail is required. The simulation of the environment will also involve some specific dynamic scenes, which will inevitably lead to serious memory consumption, rendering pressure, and occasional lagging problems.
2.3. Data Analysis
By using tree growth equations and existing data, it is possible to predict the biomass of trees that are difficult to measure. Predictions involve assessing the future growth status of trees, and accurate predictions can facilitate guidance for field experiments. For future monitoring data, exponential smoothing algorithms can be used to simulate the past and current data, and the goodness of fit R2 can represent its prediction quality. Thus, by combining existing breast diameter data with data visualization methods, the trends in tree growth can be intuitively represented in the system, and the future trends and range of the data can also be clearly indicated.
Analyzing and predicting the data can provide more effective decision-making for experimental forest management and basic field work. In combination with the experimental results of researchers, corresponding solutions can be proposed for abnormal situations that may occur in the analysis of forest data, such as excessively high peaks. Targeted measures can be formed and feedback can be provided in a timely manner when such situations arise. This can help forestry researchers prevent events such as forest droughts and fires.
2.3.1. Database Construction
Add the relevant class libraries to connect to the SQL Server database in the system project. In the NET framework, the System.Data.SqlClient namespace is introduced to create objects that receive data variables and connect to the database through the SqlConnection object. Connection is equivalent to the connection channel between the system and the database, and the main access type is determined by the database. Command can execute commands and return results from the source data. We used SqlDataAdapter adapter to make database table calls and create DataSet for instantiation. DataAdapter acts as a connection between the instantiated object and the source data, it can fill the DataSet by retrieving the source data, and also modify the source data by Update. After instantiating the data into the DataSet object, it can directly call the data for analysis and simulation, etc., and also conduct the foundation for future data linkage in real time.
2.3.2. Data Visualization
Some of the continuous data from the DataSet were selected for visualization. In this paper, three parts of sensor monitoring data are selected for visualization and analysis: groundwater depth data, tension gauge data and stem diameter sensor data. Their main semantic fields are date, different processing methods of experiments, and different monitoring points. Combining the characteristics of the data and the needs of foresters, line charts is used to present the trend changes between the different data. Using the date of data monitoring as the x-axis and the data of different treatments as the y-axis, multiple line graphs are drawn by UGUI controls. If the data from the same device includes multiple monitoring points, they are marked using different legends and colors to facilitate the observation of status information of different tree collection points in the same time state.
2.3.3. Specific Data Analysis and Simulation
For stand growth status, predicting tree growth under forest management status from specific data or simulating future trends of impact factors can help researchers to develop subsequent management plans and disaster prevention. In this paper, we choose to simulate tree height, above-ground biomass, below-ground biomass, and total biomass as physiological indicators through regular manual measurements of tree diameter at breast height (DBH).
where H is the tree height, D is the tree DBH, α1 takes the value of 8.586 and β1 takes the value of 9.1749. The relationship between DBH and total biomass BS for a given range is shown in Equation (6):
where BS1 includes trunk, branch, root stump, thick root and thin root parts, α2 takes the value of 1.3121 and β2 takes the value of 0.2907; BS2 increased both leaves and deciduous leaves compared to BS1, with α3 taking the value of 1.7956 and β3 taking the value of 0.2708. The relationship between DBH and aboveground biomass BA for a given range is shown in Equation (7):
where BA1 represents the above-ground biomass including only the trunk and branch parts, with values of 0.0319 for α4 and 2.8303 for β4; BA2 represents the above-ground biomass with four parts: trunk, branch, leaf and deciduous leaf, and the value of α5 is 0.0673 and β5 is 2.5651. The relationship between DBH and belowground biomass BU for a given range is shown in Equation (8):
where BU mainly includes two parts, root stump and thick root, and the value of α6 is taken as 3.3678 and the value of β6 is taken as 4.0467.
where R2 represents the percentage of the regression sum of squares in the total sum of squares of variance for object y, yi is the sample observation, and
is the point on the regression line as shown in Equation (10). From Equation (9), we know that the range of R2 is (0, 1), and the better fit is when R2 is closer to 1. For Equations (5)–(8), the R2 ranges from 74% to 95% as can be seen from Table 2. When the amount of data n ≥ 100, R2 ≥ 87%. The physiological data obtained from real-time calculations based on DBH and tree growth equations are continuous. However, the tree model presents the same visual changes in tree height following the update cycle of Table 1, in which the measurement period of DBH still has an impact on it. In addition, other monitoring data, such as transpiration rate, stomatal conductance, and leaf water potential, which are related to stem sap flow, can also be simulated by this method.
Comments are closed.