本文排版定稿已在中国知网网络首发,如需阅读全文请打开知网首页,并搜索该论文题目即可查看。
基于无人机激光雷达数据的杉木人工林生物量混合效应模型研究
Study on Mixed-Effects Models for Biomass Estimation of Cunninghamia lanceolata Plantations Based on ULS Data
-
摘要: 基于广东省6个县(市)105个样地实测生物量与无人机激光雷达扫描(ULS)点云特征,对46项高度变量进行主成分分析并以PC1构建林分高度主成分(H_PCA),结合Elastic Net从候选点云特征变量中筛选关键因子,分别构建GLM、GLMM、GAM与GAMM(GLMM/GAMM引入县域随机效应),采用全数据拟合与五折交叉验证评价模型性能,并以组内相关系数(ICC)量化县域尺度差异贡献。结果表明:PCA与Elastic Net组合可有效筛选出高度主成分(H_PCA)、叶面积指数(LAI)和密度变量(D1)等关键因子,模型稳定性和解释性显著提升。在所有模型中,GAMM表现最优,其全数据拟合精度(R2=0.889,RMSE=21.362 t/hm2,MAE=16.068 t/hm2)及五折交叉验证结果(R2=0.831,RMSE=23.818 t/hm2,MAE=17.683 t/hm2)均优于其他模型。GLMM模型的ICC为0.243,表明约24.3%的生物量变异可归因于县域层间差异,引入县域分层能够显著提升模型的解释能力与预测精度。主成分分析与Elastic Net相结合的变量筛选策略能够有效简化点云特征体系;同时,将县域作为随机效应引入混合效应模型,有助于刻画县域尺度综合差异并显著提升区域尺度杉木人工林生物量估算的精度与稳健性。Abstract: Based on field-measured biomass data from 105 sample plots across six counties/county-level cities in Guangdong Province and point-cloud metrics derived from unmanned aerial vehicle laser scanning (ULS), principal component analysis (PCA) was first performed on 46 height-related variables, and the first principal component (PC1) was used to construct a stand-height principal component (H_PCA). Elastic Net was then applied to select key predictors from candidate point-cloud metrics. Four models, including a generalized linear model (GLM), generalized linear mixed-effects model (GLMM), generalized additive model (GAM), and generalized additive mixed model (GAMM), were established, with county-level random effects incorporated into the GLMM and GAMM. Model performance was evaluated using full-data fitting and five-fold cross-validation, and the intra-class correlation coefficient (ICC) was used to quantify the contribution of county-level variation. The results showed that the combination of PCA and Elastic Net effectively selected key predictors, including the stand-height principal component (H_PCA), leaf area index (LAI), and density metric D1, thereby improving model stability and interpretability. Among all models, the GAMM performed best, with full-data fitting accuracy of R2 = 0.889, RMSE = 21.362 t/hm2, and MAE = 16.068 t/hm2, and five-fold cross-validation accuracy of R2 = 0.831, RMSE = 23.818 t/hm2, and MAE = 17.683 t/hm2, outperforming the other models. The ICC of the GLMM was 0.243, indicating that approximately 24.3% of the variation in biomass could be attributed to inter-county differences, and that introducing county-level stratification could substantially improve the explanatory power and predictive accuracy of the model. Overall, the integration of PCA and Elastic Net provides an effective strategy for simplifying the point-cloud feature system. Moreover, incorporating county as a random effect in mixed-effects models helps characterize county-level composite differences and improves the accuracy and robustness of regional-scale biomass estimation for Chinese fir plantations.
下载: