Cong-Zhong Wu, Jun Sun, Jing Wang, Liang-Feng Xu, Shu Zhan. Encoding-decoding Network With Pyramid Self-attention Module For Retinal Vessel Segmentation. International Journal of Automation and Computing.
Citation: Cong-Zhong Wu, Jun Sun, Jing Wang, Liang-Feng Xu, Shu Zhan.

Encoding-decoding Network With Pyramid Self-attention Module For Retinal Vessel Segmentation

. International Journal of Automation and Computing.

Encoding-decoding Network With Pyramid Self-attention Module For Retinal Vessel Segmentation

doi: 10.1007/s11633-020-1277-0
More Information
  • Author Bio:

    Cong-Zhong Wu received the M. Sc. degree in microwave specialty from Institute of Plasma Physics, Chinese Academy of Sciences, China in 1990. He is now an associate professor of School of Computer and Information Engineering, Hefei University of Technology, China. His research interests include wireless communication, DSP technology and application, embedded application system development. E-mail: 17368836384@163.comORCID iD: 0000-0001-7261-5053

    Jun Sun received the B.Sc. degree in electronic information engineering from Jiangsu University, China in 2018. Now, he is a graduate student in Hefei University of technology, China. His research interests include deep learning and computer vision. E-mail: iD: 0000-0002-5568-496X

    Jing Wang received the M. D. degree in ophthalmic optometry from Anhui Medical University, China in 2008. She is now an attending physician in the Second Affiliated Hospital of Anhui Medical University, China. Her research interests include second laser treatment of myopia, high myopia comprehensive treatment, myopia and hyperopia correction.E-mail:

    Liang-Feng Xu received the B. Sc. degree in electronic information engineering from Hefei University of technology, China in 1993, and received the M.Eng. degree in signal and information processing from School of Computer and Information Technology, Hefei University of Technology, China in 2003. Since 1993, he has been engaged in teaching and scientific research in School of Computer and Information, Hefei University of Technology, China. Her research interests include wireless communication, image processing and facial expression recognition. E-mail:

    Shu Zhan received Ph.D. degree in information and communication system from University of Science and Technology of China, China in 2000. He has been a postdoctoral researcher in School of Information, Tokyo University of Japan from 2002 to 2004. He has been a professor in School of Computer Science and Information Engineering, Hefei University of Technology, China since 1993. His research interests include digital image analysis, medical image analysis and pattern recognition. E-mail: (Corresponding author) ORCID iD: 0000-0002-5568-0264

  • Received Date: 2020-09-20
  • Accepted Date: 2021-01-06
  • Retina vessel segmentation is a vital step in diagnosing ophthalmologic diseases. Traditionally, ophthalmologists segment retina vessels by hand, which is time-consuming and error-prone. Thus, more and more researchers are committed to the research of automatic segmentation algorithms. With the development of convolution neural networks (CNNs), many tasks can be solved by CNNs. In this paper, we propose an encoding-decoding network with a pyramid self-attention module (PSAM) to segment retinal vessels. The network follows a U shape structure, and it comprises stacked feature selection blocks (FSB) and a PSAM. The proposed FSB consists of two convolution blocks with the same weight and a channel-wise attention block. At the head of the network, we apply a PSAM consisting of three parallel self-attention modules to capture long-range dependence of different scales. Due to the power of PSAM and FSB, the performance of the network improves. We have evaluated our model on two public datasets: DRIVE and CHASE_DB1. The results show the performance of our model is better than other methods. The F1, Accuracy, and area under curve (AUC) are 82.21%/80.57%, 95.65%/97.02%, and 98.16%/98.46% on DRIVE and CHASE_DB1, respectively.


  • loading
  • [1]
    C. Kirbas, F. Quek. A review of vessel extraction techniques and algorithms. ACM Computing Surveys, vol. 36, no. 2, pp. 81–121, 2004. DOI: 10.1145/1031120.1031121.
    M. M. Fraz, P. Remagnino, A. Hoppe, B. Uyyanonvara, A. R. Rudnicka, C. G. Owen, S. A. Barman. Blood vessel segmentation methodologies in retinal images–a survey. Computer Methods and Programs in Biomedicine, vol. 108, no. 1, pp. 407–433, 2012. DOI: 10.1016/j.cmpb.2012.03.009.
    B. Al-Diri, A. Hunter, D. Steel. An active contour model for segmenting and measuring retinal vessels. IEEE Transactions on Medical Imaging, vol. 28, no. 9, pp. 1488–1497, 2009. DOI: 10.1109/TMI.2009.2017941.
    G. Azzopardi, N. Petkov. Automatic detection of vascular bifurcations in segmented retinal images using trainable COSFIRE filters. Pattern Recognition Letters, vol. 34, no. 8, pp. 922–933, 2013. DOI: 10.1016/j.patrec.2012.11.002.
    S. Aslani, H. Sarnel. A new supervised retinal vessel segmentation method based on robust hybrid features. Biomedical Signal Processing and Control, vol. 30, pp. 1–12, 2016. DOI: 10.1016/j.bspc.2016.05.006.
    L. Wang, A. Bhalerao, R. Wilson. Analysis of retinal vasculature using a multiresolution hermite model. IEEE Transactions on Medical Imaging, vol. 26, no. 2, pp. 137–152, 2007. DOI: 10.1109/TMI.2006.889732.
    Y. Yin, M. Adel, S. Bourennane. Retinal vessel segmentation using a probabilistic tracking method. Pattern Recognition, vol. 45, no. 4, pp. 1235–1244, 2012. DOI: 10.1016/j.patcog.2011.09.019.
    Q. Qi, Q. D. Li, Y. Q. Cheng, Q. Q. Hong. Skeleton marching-based parallel vascular geometry reconstruction using implicit functions. International Journal of Automation and Computing, vol. 17, no. 1, pp. 30–43, 2020. DOI: 10.1007/s11633-019-1189-4.
    W. J. Bai, M. Sinclair, G. Tarroni, O. Oktay, M. Rajchl, G. Vaillant, A. M. Lee, N. Aung, E. Lukaschuk, M. M. Sanghvi, F. Zemrak, K. Fung, J. M. Paiva, V. Carapella, Y. J. Kim, H. Suzuki, B. Kainz, P. M. Matthews, S. E. Petersen, S. K. Piechnik, S. Neubauer, B. Glocker, D. Rueckert. Human-level CMR image analysis with deep fully convolutional networks. [Online], Available:, 2017.
    Z. W. He, L. Zhang, F. Y. Liu. DiscoStyle: Multi-level logistic ranking for personalized image style preference inference. International Journal of Automation and Computing, vol. 17, no. 5, pp. 637–651, 2020. DOI: 10.1007/s11633-020-1244-1.
    Y. R. Liang, Z. Y. Xiao. Image encryption algorithm based on compressive sensing and fractional DCT via polynomial interpolation. International Journal of Automation and Computing, vol. 17, no. 2, pp. 292–304, 2020. DOI: 10.1007/s11633-018-1159-2.
    F. Z. Liao, M. Liang, Z. Li, X. L. Hu, S. Song. Evaluate the malignancy of pulmonary nodules using the 3-D deep leaky noisy-OR network. IEEE Transactions on Neural Networks and Learning Systems, vol. 30, no. 11, pp. 3484–3495, 2019. DOI: 10.1109/TNNLS.2019.2892409.
    P. Moeskops, M. Veta, M. W. Lafarge, K. A. J. Eppenhof, J. P. W. Pluim. Adversarial training and dilated convolutions for brain MRI segmentation. In Proceedings of the 3rd International Workshop on Deep Learning in Medical Image Analysis and the 7th International Workshop on Multimodal Learning for Clinical Decision Support, Springer, Quebec City, Canada, pp. 56−64, 2017. DOI: 10.1007/978-3-319-67558-9_7.
    S. Wang, M. Zhou, Z. Y. Liu, Z. Y. Liu, D. S. Gu, Y. L. Zang, D. Dong, O. Gevaert, J. Tian. Central focused convolutional neural networks: Developing a data-driven model for lung nodule segmentation. Medical Image Analysis, vol. 40, pp. 172–183, 2017. DOI: 10.1016/
    Z. Y. Liu. Retinal vessel segmentation based on fully convolutional networks. [Online], Available:, 2019.
    Z. W. Gu, J. Cheng, H. Z. Fu, K. Zhou, H. Y. Hao, Y. T. Zhao, T. Y. Zhang, S. H. Gao, J. Liu. CE-Net: Context encoder network for 2D medical image segmentation. IEEE Transactions on Medical Imaging, vol. 38, no. 10, pp. 2281–2292, 2019. DOI: 10.1109/TMI.2019.2903562.
    L. Luo, D. Y. Xue, X. L. Feng. HybridNetSeg: A compact hybrid network for retinal vessel segmentation. [Online], Available:, 2019.
    J. Son, S. J. Park, K. H. Jung. Retinal vessel segmentation in fundoscopic images with generative adversarial networks. [Online], Available:, 2017.
    J. Hu, L. Shen, S. Albanie, G. Sun, E. H. Wu. Squeeze-and-excitation networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 8, pp. 2011–2023, 2020. DOI: 10.1109/TPAMI.2019.2913372.
    J. Fu, J. Liu, H. J. Tian, Y. Li, Y. J. Bao, Z. W. Fang, H. Q. Lu. Dual attention network for scene segmentation. In Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, Long Beach, USA, pp. 3141−3149, 2019. DOI: 10.1109/CVPR.2019.00326.
    E. Shelhamer, J. Long, T. Darrell. Fully convolutional networks for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 4, pp. 640–651, 2017. DOI: 10.1109/TPAMI.2016.2572683.
    O. Ronneberger, P. Fischer, T. Brox. U-Net: Convolutional networks for biomedical image segmentation. In Proceedings of the 18th International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer, Munich, Germany, pp. 234-241, 2015. DOI: 10.1007/978-3-319-24574-4_28.
    N. Abraham, N. M. Khan. A novel focal Tversky loss function with improved attention U-Net for lesion segmentation. [Online], Available:, 2018.
    M. Z. Alom, M. Hasan, C. Yakopcic, T. M. Taha, V. K. Asari. Recurrent residual convolutional neural network based on U-Net (R2U-Net) for medical image segmentation. [Online], Available:, 2018.
    H. Z. Fu, J. Cheng, Y. W. Xu, D. W. K. Wong, J. Liu, X. C. Cao. Joint optic disc and cup segmentation based on multi-label deep network and polar transformation. IEEE Transactions on Medical Imaging, vol. 37, no. 7, pp. 1597–1605, 2018. DOI: 10.1109/TMI.2018.2791488.
    L. C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, A. L. Yuille. DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 4, pp. 834–848, 2018. DOI: 10.1109/TPAMI.2017.2699184.
    Q. Zhang, Z. P. Cui, X. G. Niu, S. J. Geng, Y. Qiao. Image segmentation with pyramid dilated convolution based on ResNet and U-Net. In Proceedings of the 24th International Conference International Conference on Neural Information Processing, Springer, Guangzhou, China, pp. 364−372, 2017. DOI: 10.1007/978-3-319-70096-0_38.
    H. Y. Xia, W. F. Sun, S. X. Song, X. W. Mou. Md-Net: Multi-scale dilated convolution network for CT images segmentation. Neural Processing Letters, vol. 51, no. 3, pp. 2915–2927, 2020. DOI: 10.1007/s11063-020-10230-x.
    D. G. Xiao, P. Zhong. Image semantic segmentation using deep convolutional nets, fully connected conditional random fields, and dilated convolution. In Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; the 17th IEEE International Conference on Smart City; the 5th IEEE International Conference on Data Science and Systems, IEEE, Zhangjiajie, China, pp. 1872−1877, 2019. DOI: 10.1109/HPCC/SmartCity/DSS.2019.00258.
    X. L. Wang, R. Girshick, A. Gupta, K. M. He. Non-local neural networks. [Online], Available:, 2017.
    H. Tang, H. Liu, D. Xu, P. H. S. Torr, N. Sebe. AttentionGAN: Unpaired image-to-image translation using attention-guided generative adversarial networks. [Online], Available:, 2019.
    B. W. Li, X. J. Qi, T. Lukasiewicz, P. H. S. Torr. Controllable text-to-image generation. [Online], Available:, 2019.
    O. Oktay, J. Schlemper, L. L. Folgoc, M. Lee, M. Heinrich, K. Misawa, K. Mori, S. McDonagh, N. Y. Hammerla, B. Kainz, B. Glocker, D. Rueckert. Attention U-Net: Learning where to look for the pancreas. [Online], Available:, 2018.
    A. D. Hoover, V. Kouznetsova, M. Goldbaum. Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response. IEEE Transactions on Medical Imaging, vol. 19, no. 3, pp. 203–210, 2000. DOI: 10.1109/42.845178.
    M. Niemeijer, J. Staal, B. Van Ginneken, M. Loog, M. D. Abràmoff. Comparative study of retinal vessel segmentation methods on a new publicly available database. In Proceedings of SPIE 5370, Medical Imaging 2004: Image Processing, San Diego, USA, pp. 648-656, 2004. DOI: 10.1117/12.535349.
    S. Roychowdhury, D. D. Koozekanani, K. K. Parhi. Blood vessel segmentation of fundus images by major vessel extraction and subimage classification. IEEE Journal of Biomedical and Health Informatics, vol. 19, no. 3, pp. 1118–1128, 2015. DOI: 10.1109/JBHI.2014.2335617.
    Q. L. Li, B. W. Feng, L. P. Xie, P. Liang, H. S. Zhang, T. F. Wang. A cross-modality learning approach for vessel segmentation in retinal images. IEEE Transactions on Medical Imaging, vol. 35, no. 1, pp. 109–118, 2016. DOI: 10.1109/TMI.2015.2457891.
    M. Melinscak, P. Prentasic, S. Loncaric. Retinal vessel segmentation using deep neural networks. In Proceedings of the 10th International Conference on Computer Vision Theory and Applications, Berlin, Germany, pp. 577−582, 2015.
    P. Liskowski, K. Krawiec. Segmenting retinal blood vessels with deep neural networks. IEEE Transactions on Medical Imaging, vol. 35, no. 11, pp. 2369–2380, 2016. DOI: 10.1109/TMI.2016.2546227.
    J. T. Zhuang J. LadderNet: Multi-path networks based on U-Net for medical image segmentation. [Online], Available:, 2018.
  • 加载中


    通讯作者: 陈斌,
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(7)  / Tables(3)

    Article Metrics

    Article views (80) PDF downloads(13) Cited by()
    Proportional views


    DownLoad:  Full-Size Img  PowerPoint