LVQ Treatment for Zero-Shot Learning

İsmailoğlu, Fırat

LVQ Treatment for Zero-Shot Learning

dc.authorid	https://orcid.org/0000-0002-6680-7291	tr
dc.contributor.author	İsmailoğlu, Fırat
dc.date.accessioned	2024-03-04T13:25:14Z
dc.date.available	2024-03-04T13:25:14Z
dc.date.issued	23.01.2023	tr
dc.department	Mühendislik Fakültesi	tr
dc.description.abstract	In image classification, there are no labeled training instances for some classes, which are therefore called unseen classes or test classes. To classify these classes, zero-shot learning (ZSL) was developed, which typically attempts to learn a mapping from the (visual) feature space to the semantic space in which the classes are represented by a list of semantically meaningful attributes. However, the fact that this mapping is learned without using instances of the test classes affects the performance of ZSL, which is known as the domain shift problem. In this study, we propose to apply the learning vector quantization (LVQ) algorithm in the semantic space once the mapping is determined. First and foremost, this allows us to refine the prototypes of the test classes with respect to the learned mapping, which reduces the effects of the domain shift problem. Secondly, the LVQ algorithm increases the margin of the 1-NN classifier used in ZSL, resulting in better classification. Moreover, for this work, we consider a range of LVQ algorithms, from initial to advanced variants, and applied them to a number of state-of-the-art ZSL methods, then obtained their LVQ extensions. The experiments based on five ZSL benchmark datasets showed that the LVQ-empowered extensions of the ZSL methods are superior to their original counterparts in almost all settings.	tr
dc.identifier.doi	10.55730/1300-0632.3980	en_US
dc.identifier.issue	1	tr
dc.identifier.scopus	2-s2.0-85151502548	en_US
dc.identifier.scopusquality	N/A
dc.identifier.startpage	Abstract: In image classification, there are no labeled training instances for some classes, which are therefore called unseen classes or test classes. To classify these classes, zero-shot learning (ZSL) was developed, which typically attempts to learn a mapping from the (visual) feature space to the semantic space in which the classes are represented by a list of semantically meaningful attributes. However, the fact that this mapping is learned without using instances of the test classes affects the performance of ZSL, which is known as the domain shift problem. In this study, we propose to apply the learning vector quantization (LVQ) algorithm in the semantic space once the mapping is determined. First and foremost, this allows us to refine the prototypes of the test classes with respect to the learned mapping, which reduces the effects of the domain shift problem. Secondly, the LVQ algorithm increases the margin of the 1-NN classifier used in ZSL, resulting in better classification. Moreover, for this work, we consider a range of LVQ algorithms, from initial to advanced variants, and applied them to a number of state-of-the-art ZSL methods, then obtained their LVQ extensions. The experiments based on five ZSL benchmark datasets showed that the LVQ-empowered extensions of the ZSL methods are superior to their original counterparts in almost all settings. Key words: Zero-shot learning, learning vector quantization, image classification, prototype learning, large margin classifiers 1. Introduction When dealing with the problem of image classification/visual recognition, we usually assume that a number of labeled training instances is available for each class of interest. However, in practice, this may not be feasible, since collecting and annotating instances for each class results in a huge cost. Moreover, after training a classifier, new unseen classes may emerge dynamically [1]. In fact, new plant and animal species are constantly being discovered, making the classification of such target classes a challenge for image classification [1–3]. To address the above problem, zero-shot learning (ZSL) was developed, which was inspired by the ability of humans to identify novel cases/classes given a high-level description about them [3, 6]. In the context of ZSL, such descriptions are generally given as a list of semantically meaningful properties called attributes. These can be continuous word vectors [4] or binary vectors of visual properties, such as ”has tail”, ”is red” [3]. Using these attributes, ZSL aims to classify test/target classes for which no labeled training instances are available. ZSL achieves this task in the following way. ZSL assumes that the training classes, i.e. the classes whose labeled instances are available during training, are also represented by the same set of the attributes. This results in a space known as semantic (embedding) space that contains representations of both the test classes and training classes, i.e. their prototypes.	tr
dc.identifier.trdizinid	1159774	en_US
dc.identifier.uri	https://hdl.handle.net/20.500.12418/14624
dc.identifier.volume	31	tr
dc.identifier.wos	WOS:001032153500001	en_US
dc.identifier.wosquality	Q4
dc.indekslendigikaynak	Web of Science	en_US
dc.indekslendigikaynak	Scopus	en_US
dc.indekslendigikaynak	TR-Dizin	en_US
dc.language.iso	en	en_US
dc.publisher	Tubitak Academic Journals	tr
dc.relation.ispartof	Turkish Journal of Electrical Engineering and Computer Sciences	en_US
dc.relation.publicationcategory	Ulusal Hakemli Dergide Makale - Kurum Öğretim Elemanı	tr
dc.rights	info:eu-repo/semantics/openAccess	tr
dc.subject	Zero-shot learning	tr
dc.subject	Vector quantization	tr
dc.subject	Image classification	tr
dc.subject	Prototype learning	tr
dc.subject	Large margin classifiers	tr
dc.title	LVQ Treatment for Zero-Shot Learning	en_US
dc.type	Article	en_US