Aerial and Optical Images-Based Plant Species Segmentation Using Enhancing Nested Downsampling Features

Lin, Chih-Wei; Lin, Mengxiang; Hong, Yu

Title:	Aerial and Optical Images-Based Plant Species Segmentation Using Enhancing Nested Downsampling Features
Authors:	Lin, Chih-Wei Lin, Mengxiang Hong, Yu
Keywords:	deep learning;plant species;semantic segmentation;features enhancing;SEMANTIC SEGMENTATION;CLASSIFICATION;IDENTIFICATION;VEGETATION
Issue Date:	Dec-2021
Publisher:	MDPI
Journal Volume:	12
Journal Issue:	12
Source:	Forests
Abstract:	Plant species, structural combination, and spatial distribution in different regions should be adapted to local conditions, and the reasonable arrangement can bring the best ecological effect. Therefore, it is essential to understand the classification and distribution of plant species. This paper proposed an end-to-end network with Enhancing Nested Downsampling features (END-Net) to solve complex and challenging plant species segmentation tasks. There are two meaningful operations in the proposed network: (1) A compact and complete encoder-decoder structure nests in the down-sampling process; it makes each downsampling block obtain the equal feature size of input and output to get more in-depth plant species information. (2) The downsampling process of the encoder-decoder framework adopts a novel pixel-based enhance module. The enhanced module adaptively enhances each pixel's features with the designed learnable variable map, which is as large as the corresponding feature map and has nxn variables; it can capture and enhance each pixel's information flexibly effectively. In the experiments, our END-Net compared with eleven state-of-the-art semantic segmentation architectures on the self-collected dataset, it has the best PA (Pixel Accuracy) score and FWloU (Frequency Weighted Intersection over Union) accuracy and achieves 84.52% and 74.96%, respectively. END-Net is a lightweight model with excellent performance; it is practical in complex vegetation distribution with aerial and optical images. END-Net has the following merits: (1) The proposed enhancing module utilizes the learnable variable map to enhance features of each pixel adaptively. (2) We nest a tiny encoder-decoder module into the downsampling block to obtain the in-depth plant species features with the same scale in- and out-features. (3) We embed the enhancing module into the nested model to enhance and extract distinct plant species features. (4) We construct a specific plant dataset that collects the optical images-based plant picture captured by drone with sixteen species.
URI:	http://scholars.ntou.edu.tw/handle/123456789/23872
ISSN:	1999-4907
DOI:	10.3390/f12121695
Appears in Collections:	電機工程學系

Show full item record

Page view(s)

142

checked on Jun 30, 2025

Google Scholar^TM

Check

DSpace CRIS

Page view(s)

Google Scholar^TM

Altmetric

Altmetric

Page view(s)

Google ScholarTM

Altmetric

Altmetric

Google Scholar^TM