Detection of vehicles and buildings in drone aerial images

Amante, Rita Filipa dos Santos

Please use this identifier to cite or link to this item: http://hdl.handle.net/10773/34992

Title:	Detection of vehicles and buildings in drone aerial images
Other Titles:	Deteção de veículos e edifícios em imagens aéreas obtidas por drone
Author:	Amante, Rita Filipa dos Santos
Advisor:	Neves, António José Ribeiro Silva, José Silvestre Serra da
Keywords:	Artificial intelligence Machine learning Deep learning Transfer learning Computer vision Object detection UAV
Defense Date:	21-Jul-2022
Abstract:	The need to develop software for aerial image analysis, captured by Unmanned Aerial Vehicles, has increased over the years because their use has become more prevalent in different day-to-day scenarios. Object detection, a Computer Vision technique, is one of the most explored problems in this area and consists of identifying and locating objects in images or videos, with the help of Artificial Intelligence technologies. The aim of this dissertation is to analyze the performance of Deep Learning algorithms for detecting vehicles and buildings in aerial images. Two of the main algorithms described in literature, Faster R-CNN and YOLO, the latter in the third and fifth versions, were chosen to verify which one is capable of better performance. The dataset provided by the Portuguese Military Academy, which was annotated and pre-processed, was used for the training of each algorithm and the performance of tests. The results obtained in the abovementioned dataset demonstrate that there is a considerable discrepancy between the two algorithms, both in terms of performance and speed. Faster R-CNN only proved to be superior to the two versions of YOLO in terms of training speed, as it was the algorithm that required less time for training. Among the versions of YOLO, the fifth version showed the best results. A necessidade de desenvolver software para a análise de imagem aérea, capturada por Veículos Aéreos Não Tripulados, tem vindo a aumentar ao longo dos anos devido ao facto de serem cada vez mais utilizadas em diversos cenários do dia-a-dia. A deteção de objetos, técnica da Visão Computacional, é um dos problemas mais explorados nesta área e consiste na identificação e localização de objetos em imagens ou vídeos, com o auxílio de tecnologias de Inteligência Artificial. Pretende-se com esta dissertação analisar o desempenho de algoritmos de Aprendizagem Profunda, para a deteção de veículos e edifícios em imagens aéreas. Foram escolhidos dois dos principais algoritmos descritos na literatura, Faster R-CNN e YOLO, este último na terceira e quinta versão, por forma a verificar qual apresenta melhor desempenho. Para o treino de cada algoritmo e realização de testes foi utilizado um conjunto de dados fornecido pela Academia Militar Portuguesa, o qual foi anotado e pré-processado. Os resultados obtidos, no referido conjunto de dados, demonstraram que existe uma discrepância considerável entre os dois algoritmos, tanto a nível do desempenho como do tempo de deteção. O Faster R-CNN apenas se mostrou superior em relação às duas versões do YOLO no tempo de treino, pois foi o algoritmo que precisou de menos tempo. Entre as versões do YOLO, a quinta versão foi a que apresentou melhores resultados.
URI:	http://hdl.handle.net/10773/34992
Appears in Collections:	UA - Dissertações de mestrado DETI - Dissertações de mestrado

Files in This Item:

File	Description	Size	Format
Documento_Rita_Amante.pdf		6.29 MB	Adobe PDF	View/Open

Show full item record