Automated abundance analysis of underwater video using artificial intelligence techniques
Fearn, RC (2010) Automated abundance analysis of underwater video using artificial intelligence techniques. PhD thesis, University of Tasmania.
This thesis discusses research into the various factors associated with the detection, classification and tracking of commercial scallops in underwater video (from a
moving camera), for the purpose of computing abundance analysis statistics. Such statistics are sought by the Tasmanian Aquaculture and Fisheries Institute (TAFI) to
support the sustainable management of Tasmanian commercial scallop fisheries. The use of video is preferable in terms of environmental impact to a traditional approach
such as dredging, but the partially buried nature of the scallops makes the task very challenging.
The research led to the development of a multi-stage video analysis system using a range of existing artificial intelligence techniques from the fields of computer vision
and machine learning. The system comprises five main stages: instance detection, feature extraction, instance classification, motion estimation and temporal instance
This system may be required to analyse many hours of video footage. Therefore we have explored many different computer-vision-related techniques and performed numerous comparisons on these techniques in an effort to maximise system throughput without compromising the overall accuracy achieved. This includes using the University of Southern California’s iLab Neuromorphic Vision C++ Toolkit
(iNVT) [iLab, 2010], during the initial stages of processing, to quickly reduce the overall search space of our system down to the analysis of conspicuous or salient
regions within the footage. In addition to this we also investigated solutions that allow our system to skip frames during analysis. This decreases the overall
processing time as fewer frames are presented to the system, but can also adversely affect the accuracy of motion estimation. To overcome this we then use a simple and
efficient outlier detection method capable of smoothing inconsistencies within the data prior to predicting instance locations in future frames.
The performance of each stage of our system has a direct impact on the performance of the remaining stages within the system. However of particular interest is the final
stage which uses clustering to achieve temporal instance tracking, as it has become evident that this stage is not only capable of tracking instances through time but is
also capable of performing a second round of classification based on cluster density that plays a vital role in the elimination of false positive instances introduced in
earlier stages. As a result our system performs well, being robust enough to overcome significant inconsistencies within the existing video footage provided by
|Item Type:||Thesis (PhD)|
|Additional Information:||© 2010 the author|
|Deposited By:||INVALID USER|
|Deposited On:||24 Dec 2010 10:57|
|Last Modified:||05 Jul 2011 14:59|
|ePrint Statistics:||View statistics for this ePrint|
Repository Staff Only: item control page