Today,approximately 56% of the world's population—4.4 billion people—live in cities. Urban trees play a significant role in mitigating global climate change[
Journal of Infrared and Millimeter Waves, Volume. 44, Issue 2, 197(2025)
Urban tree
Urban tree species provide various essential ecosystem services in cities, such as regulating urban temperatures, reducing noise, capturing carbon, and mitigating the urban heat island effect. The quality of these services is influenced by species diversity, tree health, and the distribution and composition of trees. Traditionally, data on urban trees has been collected through field surveys and manual interpretation of remote sensing images. In this study, we evaluated the effectiveness of multispectral airborne laser scanning (ALS) data in classifying 24 common urban roadside tree species in Espoo, Finland. Tree crown structure information, intensity features, and spectral data were used for classification. Eight different machine learning algorithms were tested, with the extra trees (ET) algorithm performing the best, achieving an overall accuracy of 71.7% using multispectral LiDAR data. This result highlights that integrating structural and spectral information within a single framework can improve classification accuracy. Future research will focus on identifying the most important features for species classification and developing algorithms with greater efficiency and accuracy.
Introduction
Today,approximately 56% of the world's population—4.4 billion people—live in cities. Urban trees play a significant role in mitigating global climate change[
Airborne laser scanning(ALS)is effective for extracting biophysical variables and revising forest inventory maps. The successful use of ALS data has been demonstrated for various applications. For example,ALS has been used to estimate tree height[
Previous studies have also revealed that combining multispectral information with 3D ALS data can improve the accuracy of tree extraction and tree species classification,as we can take advantage of both datasets. However,challenging factors limit the effective operational use of the fused datasets[
Given the limitations of traditional optical remote sensing in capturing three-dimensional forest structures,it is essential to explore the potential of multispectral laser scanning for urban tree inventories,particularly for species classification. This study aims to assess the feasibility of using multispectral ALS data for urban tree species classification and to analyze the information content of features derived from point clouds and intensity data.
1 Materials and methods
1.1 Study area and establishment of sample plots
The MLS datasets used in this study were acquired in a suburban area in Espoolahti,southern Finland(60°9′18″N,24°38´24″E)in the southern Boreal Forest Zone. We choose around 822 trees in this area as our field dataset. The land area is approximately 5 km2. In our research,we concentrated solely on the vegetated areas,excluding the sea using a water mask created from topographic map data. The area included a diverse range of boreal tree species.
The points were updated through visual interpretation of Titan data and open datasets from the City of Espoo,the National Land Survey of Finland,Google Maps,and Google Street View. Field checks validated the analysis and resolved uncertainties. The reference points' attributes included species,geographic location,living conditions,tree height,and planting date for each tree.
Figure 1.Map of the study area and tree samples in the research area.
1.2 Multispectral ALS data
Multispectral Optech Titan data(Teledyne Optech,Toronto,ON,Canada)for the study area were collected in May and June 2016 in collaboration with TerraTec Oy(Helsinki,Finland)from a 650 m flight height. The data acquisition was carried out using a fixed-wing aircraft flying at a constant altitude. The sensor comprises three Titan channels:green(532 nm),near-infrared(1 064 nm),and shortwave infrared(1 550 nm). Each channel provided separate point clouds. In our preprocessed dataset,the point densities over land areas were approximately 9 points/m² for Channel 1,9 points/m² for Channel 2,and 8 points/m² for Channel 3.
TerraScan(TerraSolid Oy,Helsinki,Finland)was used to preprocess the ALS data and differentiate between ground and nonground points using a standardized procedure. This procedure involved removing noise,such as points detected below the ground level or above the canopy. Subsequently,the point clouds were height-normalized. Ground elevation was subtracted from the point cloud height measurements using a digital terrain model created from the classified ground points of the three channels to eliminate potential discrepancies.
Radiometric calibration of ALS intensity is crucial to ensure successful classification. Therefore,in this study,we implemented relative radiometric calibration. We observed that the intensity values were higher in the middle of the flight path compared to other areas and decreased with scanning height. A range correction was applied to mitigate such effects.
where
1.3 Creating canopy height model and single tree detection
Individual trees were detected using a minimum curvature-based algorithm,which started with creating a canopy height model(CHM). According to our field dataset of each tree coordination,we set the potential crown area within 5 m2. A local maximum filtering algorithm was used to find the treetops in this area. Subsequently,the watershed segmentation method was used to delineate tree crown boundaries without setting a flow threshold in the CHM. Eventually,the point cloud of each tree from the multispectral ALS dataset was created. In the segmentation process,the shape and position of individual tree crowns were identified using the segment boundaries and the location of the highest point within each segment. In this study,first return points from all three channels were utilized to generate CHM.
1.4 Multispectral ALS data feature extraction
In this experiment,the features were primarily divided into two types:intensity features and geometric features. The maximum height(Hmax)of each tree was calculated from the highest point of all point cloud in each tree segment.
Simultaneously,we got 137 features in each channel from the multispectral ALS data.
|
1.5 Tree species classification and accuracy evaluation
In this study,we use 8 machine learning algorithms to compare the classification of tree species.:extra trees(ET),random forest(RF),K-nearest neighbour(KNN),logistic regression(LR),linear discriminant analysis(LDA),classification and regression tree(CART),naive bayes(NB),support vector machine(SVM). Tree species were estimated based on prediction models by 8 machine learning algorithms using tree features as predictors and tree species as a response for correctly detected trees.
2 Results
2.1 Accuracy of classification
As presented in
Figure 2.Titan intensity image of Study area in Espoolahti(Red:Channel 1;Green:Channel 2;Blue:Channel 3).
Figure 3.The comparison of classification accuracy of 24 tree species:ET,RF,KNN,LR,LDA,CART,NB,SVM
The confusion matrix analysis reveals a model that performs well for most classes but struggles with a few,particularly Quercus and Sorbus according to
|
Figure 4.The confusion matrix of classification with geometric and intensity features for each species.
2.2 Feature importance analysis
We also investigated which input features and channels are most relevant for tree species classification based on the measure provided by the RF algorithm for assessing feature importance. If a feature influences the prediction,permuting its values should affect the model error. If a feature is not influential,then permuting its values should have little or no effect on the model error.
|
3 Conclusions
Multispectral LiDAR data improved the classification accuracy by approximately 5% to 10% for all channels compared to each channel. This proves our hypothesis about the ability of mALS features in classification. For example,the overall accuracy of 71.7% was obtained in multispectral LiDAR all-channel data,while accuracies of 65.7%,68.3%,and 64.8% were achieved when using only Channel 1,Channel 2,and Channel 3,respectively. Our findings demonstrated the advantage of combining multichannel features over single-channel data in classifying urban trees. However,the sample size of each tree species in this experiment was uneven,which may have affected the model's accuracy. Consequently,a larger and more representative sample will be used in future research. The imbalance in measurement samples reduced classification accuracy to some extent. Addressing this limitation will be a key focus in subsequent studies.
In this study,eight machine learning algorithms were evaluated for their classification performance,each demonstrating distinct strengths and limitations. The selection of an appropriate classification algorithm depends on the specific characteristics of the dataset,including size,dimensionality,and the underlying relationship between features and class labels. Extra trees(ET)and random forests(RF)proved effective in our study due to their ability to handle large,high-dimensional datasets and their robustness against overfitting,which suited the conditions of our dataset. Naive Bayes(NB)was efficient and scalable,especially for high-dimensional data,but its assumption of feature independence limited its applicability in cases with high feature correlation.
It is also important to note that overall accuracy(OA)is influenced by factors such as species composition,stand structure,age,and the methods used to select the best features,which vary among studies. In this research,however,the intensity of laser returns was not calibrated. This limitation can be addressed in future studies. First,we can investigate whether calibrated intensity affects classification results. Second,the use of MCI features in this study mitigated potential variations in intensity.
In conclusion,the ability of mALS compared to single-channel ALS(SCI-Ch)data to characterize tree species in urban areas was assessed in this study. Our classification results indicate that mALS data provided more accurate results than single-channel ALS data for urban tree species classification.
Get Citation
Copy Citation Text
Pei-Lun HU, Yu-Wei CHEN, Mohammad IMANGHOLILOO, Markus HOLOPAINEN, Yi-Cheng WANG, Juha HYYPPÄ. Urban tree
Category: Infrared Spectroscopy and Remote Sensing Technology
Received: Jun. 26, 2024
Accepted: --
Published Online: Mar. 14, 2025
The Author Email: CHEN Yu-Wei (chinaway.fgi@gmail.com), WANG Yi-Cheng (skl_wyc@163.com)