计算机科学与探索
計算機科學與探索
계산궤과학여탐색
JOURNAL OF FRONTIERS OF COMPUTER SCIENCE & TECHNOLOGY
2014年
3期
305-312
,共8页
李晓龙%张兆翔%王蕴红%刘庆杰
李曉龍%張兆翔%王蘊紅%劉慶傑
리효룡%장조상%왕온홍%류경걸
航拍%场景分类%视觉词袋%深度学习%高分辨率
航拍%場景分類%視覺詞袋%深度學習%高分辨率
항박%장경분류%시각사대%심도학습%고분변솔
aerial image%scene classification%bags of feature%deep learning%high resolution
最近几十年来,航拍图片和视频在城市规划、沿海地区监视、军事任务等方面都得到了广泛的运用。因而了解航拍图片中所包含的内容,研究航拍视频所拍摄的场景类型就显得异常重要。目前流行的场景分类算法大多是针对自然场景的,很少有针对高分辨率航拍场景分类的算法。针对高分辨率航拍图片的场景分类给出了一种分层式算法。该算法首先用尺度不变特征转换(scale-invariant feature transform,SIFT)算法提取鲁棒的块局部特征,然后在视觉词袋的基础上,用经局限型波兹曼模型(restricted Boltzmann machine,RBM)初始化的深层信念网络(deep belief network,DBN)来表示低层特征与高层视频特征之间的关系;同时深层信念网络也起到了分类器的作用。实验结果表明,该算法在处理高分辨率航拍图片场景分类问题时都要略好于目前主流算法。
最近幾十年來,航拍圖片和視頻在城市規劃、沿海地區鑑視、軍事任務等方麵都得到瞭廣汎的運用。因而瞭解航拍圖片中所包含的內容,研究航拍視頻所拍攝的場景類型就顯得異常重要。目前流行的場景分類算法大多是針對自然場景的,很少有針對高分辨率航拍場景分類的算法。針對高分辨率航拍圖片的場景分類給齣瞭一種分層式算法。該算法首先用呎度不變特徵轉換(scale-invariant feature transform,SIFT)算法提取魯棒的塊跼部特徵,然後在視覺詞袋的基礎上,用經跼限型波玆曼模型(restricted Boltzmann machine,RBM)初始化的深層信唸網絡(deep belief network,DBN)來錶示低層特徵與高層視頻特徵之間的關繫;同時深層信唸網絡也起到瞭分類器的作用。實驗結果錶明,該算法在處理高分辨率航拍圖片場景分類問題時都要略好于目前主流算法。
최근궤십년래,항박도편화시빈재성시규화、연해지구감시、군사임무등방면도득도료엄범적운용。인이료해항박도편중소포함적내용,연구항박시빈소박섭적장경류형취현득이상중요。목전류행적장경분류산법대다시침대자연장경적,흔소유침대고분변솔항박장경분류적산법。침대고분변솔항박도편적장경분류급출료일충분층식산법。해산법수선용척도불변특정전환(scale-invariant feature transform,SIFT)산법제취로봉적괴국부특정,연후재시각사대적기출상,용경국한형파자만모형(restricted Boltzmann machine,RBM)초시화적심층신념망락(deep belief network,DBN)래표시저층특정여고층시빈특정지간적관계;동시심층신념망락야기도료분류기적작용。실험결과표명,해산법재처리고분변솔항박도편장경분류문제시도요략호우목전주류산법。
In recent decades, aerial image/video processing has been widely studied for urban planning, coastal mon-itoring and military tasks. Therefore, understanding the contents contained in aerial images and studying the scene classification of aerial videos are very important. However, currently most popular scene classification algorithms are mainly for natural scenes, rarely for high resolution aerial scene classification. This paper proposes a hierarchical scene classification model for aerial videos/images. Firstly, the scale-invariant feature transform (SIFT) vector is extracted as the patch feature. Then, on the basis of utilizing bag of words, the deep belief network (DBN) initialized by restricted Boltzmann machine (RBM) is used to obtain the latent variables which describe the relationship between low-level region features and high-level global features. The DBN also plays as a classifier. The proposed method achieves promising performance compared with the state of art scene classification methods.