nav emailalert searchbtn searchbox tablepage yinyongbenwen piczone journalimg journalInfo journalinfonormal searchdiv searchzone qikanlogo popupnotification paper paperNew
2025, 06, v.24 610-617+663
结合注意力机制的HRNet高分辨率遥感影像光伏分割研究
基金项目(Foundation): 浙江省生态环境科研和成果推广项目(2022HT0031)
邮箱(Email): huzunying@zjemc.org.cn;
DOI: 10.19926/j.cnki.issn.1674-232X.2024.04.302
摘要:

为准确提取高分辨率遥感影像中的光伏空间分布信息,解决因背景信息复杂导致的分类精度低和边界分割模糊等问题,提出了EMAHRNet模型.该模型使用HRNet(high-resolution net)作为主干网络,采用高、低分辨率并行融合的方式学习图像特征信息并引入可训练的融合权重,优化融合过程.同时,引入OCR(object-contextual representations)模块,根据图像的像素特征和目标对象的区域特征形成图像上下文信息,建立像素与区域的相关性,增强模型对光伏边缘信息的提取能力.此外,使用多尺度注意力模块EMA(efficient multi-scale attention)捕捉不同尺度区域特征,有效提升模型在复杂空间背景下的分割能力.基于浙江省2022年2 m分辨率的遥感影像自建光伏数据集,实验结果表明,EMAHRNet模型的F1分数、交并比和准确率分别达到95.1%、84.8%和97.7%,相较于HRNet模型,分别提升1.8、2.4和0.8百分点,且在建筑、水面、耕地、林地和工业用地等复杂背景下均取得较好的分割效果.

Abstract:

Accurate extraction of photovoltaic(PV) spatial distribution from high-resolution remote sensing imagery is often challenged by complex backgrounds, leading to low classification accuracy and blurred boundaries. To address these issues, this paper proposed the EMAHRNet model. The model employed HRNet(high-resolution net) as its backbone, utilizing a parallel architecture that integrated high-and low-resolution features for learning image characteristics, and introduced trainable weights to optimize the feature fusion process. An OCR(object-contextual representations) module was incorporated to establish contextual relationships between pixels and object regions, enhancing the extraction of PV edge information. Furthermore, the efficient multi-scale attention(EMA) module was applied to capture regional features at different scales, significantly improving the model's segmentation capability in complex spatial contexts. Experimental results on a self-built PV dataset comprising 2-meter resolution remote sensing images from Zhejiang Province in 2022 demonstrated that the EMAHRNet model achieved a F1-score of 95.1%, an intersection of union of 84.8%, and an accuracy of 97.7%. Compared to HRNet model, these metrics represented improvements of 1.8,2.4 and 0.8 percentage points, respectively. The proposed model also exhibited superior segmentation performance across diverse and complex backgrounds, including buildings, water bodies, farmland, forested areas, and industrial land.

参考文献

[1] 魏景东,赵增海,郭雁珩,等.2021年中国光伏发电发展现状与展望[J].水力发电,2022,48(10):4-8.

[2] 刘宏哲,陶相如,徐成,等.一种优化多尺度特征融合的人体姿态估计方法[J].机械工程学报,2024,60(16):306-313.

[3] 王胜利.机器学习方法在光伏电站遥感提取中的应用[D].徐州:江苏师范大学,2018.

[4] 李彦甫.基于高分辨率影像的光伏电场信息提取鲁棒性分析及其热环境效应研究[D].兰州:兰州大学,2016.

[5] 刘芸,杨娟,黄林峰,等.基于高分卫星影像的威宁县光伏电站提取及分布特征分析[J].能源与环保,2024,46(2):204-208.

[6] 吴永静,吴锦超,林超,等.基于深度学习的高分辨率遥感影像光伏用地提取[J].测绘通报,2021(5):96-101.

[7] RONNEBERGER O,FISCHER P,BROX T.U-Net:convolutional networks for biomedical image segmentation[C]//International Conference on Medical Image Computing and Computer-assisted Intervention.Cham:Springer,2015:234-241.

[8] HE K M,ZHANG X Y,REN S Q,et al.Deep residual learning for image recognition[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE,2016:770-778.

[9] ZHAO H S,SHI J P,QI X J,et al.Pyramid scene parsing network[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE,2017:6230-6239.

[10] QI X Q,LI K Q,LIU P K,et al.Deep attention and multi-scale networks for accurate remote sensing image segmentation[J].IEEE Access,2020,8:146627-146639.

[11] WANG J D,SUN K,CHENG T H,et al.Deep high-resolution representation learning for visual recognition[J].IEEE Trans Pattern Anal Mach Intell,2020,43(10):3349-3364.

[12] MORADI SIZKOUHI A M,AGHAEI M,ESMAILIFAR S M,et al.Automatic boundary extraction of large-scale photovoltaic plants using a fully convolutional network on aerial imagery[J].IEEE J Photovolt,2020,10(4):1061-1067.

[13] PéREZ-GONZáLEZ A,JARAMILLO-DUQUE á,CANO-QUINTERO J B.Automatic boundary extraction for photovoltaic plants using the deep learning U-Net model[J].Appl Sci,2021,11(14):6524.

[14] 李青,李海涛,李辉,等.注意力机制和全局卷积在光伏板分割中的应用[J].计算机工程与应用,2024,60(4):237-248.

[15] 刘桂生,丁鑫,祝锐,等.结合HDC和Attention的高分遥感影像光伏板提取研究[J].计算机工程与应用,2024,60(14):357-366.

[16] OUYANG D L,HE S,ZHANG G Z,et al.Efficient multi-scale attention module with cross-spatial learning[C]//2023 IEEE International Conference on Acoustics,Speech and Signal Processing.Rhodes Island:IEEE,2023:1-5.

[17] YUAN Y H,CHEN X L,WANG J D.Object-contextual representations for semantic segmentation[C]//16th European Conference on Computer Vision.Cham:Springer,2020:173-190.

基本信息:

DOI:10.19926/j.cnki.issn.1674-232X.2024.04.302

中图分类号:TP751;TM615

引用信息:

[1]陈耀忠,沈家晓,王嘉芃,等.结合注意力机制的HRNet高分辨率遥感影像光伏分割研究[J].杭州师范大学学报(自然科学版),2025,24(06):610-617+663.DOI:10.19926/j.cnki.issn.1674-232X.2024.04.302.

基金信息:

浙江省生态环境科研和成果推广项目(2022HT0031)

投稿时间:

2024-04-30

投稿日期(年):

2024

终审时间:

2025-10-28

终审日期(年):

2025

审稿周期(年):

2

发布时间:

2025-11-30

出版时间:

2025-11-30

检 索 高级检索

引用

GB/T 7714-2015 格式引文
MLA格式引文
APA格式引文