TY - JOUR
T1 - Performance analysis of semantic segmentation algorithms for finely annotated new UAV aerial video dataset (manipaluavid)
AU - Girisha, S.
AU - Pai, Manohara M.M.
AU - Verma, Ujjwal
AU - Pai, Radhika M.
PY - 2019/1/1
Y1 - 2019/1/1
N2 - Semantic segmentation of videos helps in scene understanding, thereby assisting in other automated video processing techniques like anomaly detection, object detection, event detection, etc. However, there has been limited study on semantic segmentation of videos acquired using Unmanned Aerial Vehicles (UAV), primarily due to the absence of standard dataset. In this paper, a new UAV aerial video dataset (ManipalUAVid) for semantic segmentation is presented. The videos have been acquired in a closed university campus, and fine annotation is provided for four background classes viz. constructions, greeneries, roads, and waterbodies. Also, the performance of four semantic segmentation approaches: Conditional Random Field (CRF), U-Net, Fully Convolutional Network (FCN) and DeepLabV3+ are analysed on ManipalUAVid dataset. It is seen that these algorithms perform competitively on UAV aerial video dataset and achieves an mIoU of 0.86, 0.86, 0.86 and 0.83 respectively.
AB - Semantic segmentation of videos helps in scene understanding, thereby assisting in other automated video processing techniques like anomaly detection, object detection, event detection, etc. However, there has been limited study on semantic segmentation of videos acquired using Unmanned Aerial Vehicles (UAV), primarily due to the absence of standard dataset. In this paper, a new UAV aerial video dataset (ManipalUAVid) for semantic segmentation is presented. The videos have been acquired in a closed university campus, and fine annotation is provided for four background classes viz. constructions, greeneries, roads, and waterbodies. Also, the performance of four semantic segmentation approaches: Conditional Random Field (CRF), U-Net, Fully Convolutional Network (FCN) and DeepLabV3+ are analysed on ManipalUAVid dataset. It is seen that these algorithms perform competitively on UAV aerial video dataset and achieves an mIoU of 0.86, 0.86, 0.86 and 0.83 respectively.
UR - http://www.scopus.com/inward/record.url?scp=85074587883&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85074587883&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2019.2941026
DO - 10.1109/ACCESS.2019.2941026
M3 - Article
AN - SCOPUS:85074587883
SN - 2169-3536
VL - 7
SP - 136239
EP - 136253
JO - IEEE Access
JF - IEEE Access
M1 - 2941026
ER -