Zero-shot classification of salmon lice images by siamese neural networks

Lian, Kristian Mølbach

dc.contributor.author	Lian, Kristian Mølbach
dc.date.accessioned	2023-08-17T23:40:03Z
dc.date.available	2023-08-17T23:40:03Z
dc.date.issued	2023-06-22
dc.date.submitted	2023-08-15T22:00:27Z
dc.identifier.uri	https://hdl.handle.net/11250/3084674
dc.description	Revised version: some errors corrected.
dc.description.abstract	Deep learning models, such as neural networks and its variations, have proven exceptionally useful in the current state of society. However, facilitating competitive performances requires large amounts of data for the models to train on, which is especially true in the problem of classification. Addressing this issue for the scarce image datasets containing salmon lice images used in this thesis, can be done by recasting the problem of "which class does this image belong to?", to rather be a question of image similarity, i.e. "is image i similar to j?". In regards to this thesis, siamese neural networks are employed to distinguish images, rather than to explicitly classify them, which has the effect of producing more data points for training. Exactly how many data points for training is readily developed in this thesis (specifically triplet cardinality). Furthermore, the thesis extensively compares the performance measures of F1-score and TAR@FAR(p) in regards to siamese neural networks, and finds that they differ in terms of prediction strictness and what elements of the confusion matrix they focus on. Specifically, TAR@FAR is designed to be more strict because a bound can be set on the allowance of percentage p of false accepts, whereas F1-score also considers false rejects. Moving on, the thesis is the first work to cover the procedure of cylindrical convolution in siamese neural networks, and shows that they in fact contribute in addressing the problem of rotated images. Additionally, cylindrical convolution seemingly solves the problem of inconsistent distribution of data. Conclusively, the best model at predicting image similarity on the synthetic dataset was Siamese_LeNet5_var with cylindrical convolutions. On this dataset augmented 100 times, it performed a testing F1-score of 72.5 ± 2.6% and a testing TAR of 72.8 ± 3.0% (mean ± std). In terms of the real dataset, testing performances could not be calculated due to dataset scarcity. Regardless, the model that performed the best on the validation dataset was also Siamese_LeNet5_var with cylindrical convolutions. On this dataset augmented 100 times, it performed a median validation F1-score of 60.9% and a median TAR@FAR(0.01) of 46.7\%.
dc.language.iso	eng
dc.publisher	The University of Bergen
dc.rights	Copyright the Author. All rights reserved
dc.subject	siamese neural networks
dc.subject	salmon lice
dc.subject	cylindrical convolutions
dc.subject	zero-shot classification
dc.subject	deep learning
dc.title	Zero-shot classification of salmon lice images by siamese neural networks
dc.type	Master thesis
dc.date.updated	2023-08-15T22:00:27Z
dc.rights.holder	Copyright the Author. All rights reserved
dc.description.degree	Masteroppgave i anvendt og beregningsorientert matematikk
dc.description.localcode	MAB399
dc.description.localcode	MAMN-MAB
dc.subject.nus	753109
fs.subjectcode	MAB399
fs.unitcode	12-11-0

Tilhørende fil(er)

Filnavn:: Masteroppgave-Revisjon1_lian.pdf
Størrelse:: 8.739Mb
Format:: PDF
Beskrivelse:: master thesis

Åpne

Denne innførselen finnes i følgende samling(er)

Master theses [115]

Vis enkel innførsel