Discovering specific semantic relations among words using neural network methods
Sezerer, Erhan, author.

Discovering specific semantic relations among words using neural network methods

Sezerer, Erhan, author.

Yazar Ek Girişi
Sezerer, Erhan, author.

Fiziksel Tanımlama
viii, 99 leaves: charts;+ 1 computer laser optical disc.

Human-level language understanding is one of the oldest challenges in computer science. Many scientific work has been dedicated to finding good representations for semantic units (words, morphemes, characters) in languages. Recently, contextual language models, such as BERT and its variants, showed great success in downstream natural language processing tasks with the use of masked language modelling and transformer structures. Although these methods solve many problems in this domain and are proved to be useful, they still lack one crucial aspect of the language acquisition in humans: Experiential (visual) information. Over the last few years, there has been an increase in the studies that consider experiential information by building multi-modal language models and representations. It is shown by several studies that language acquisition in humans start with learning concrete concepts through images and then continue with learning abstract ideas through text. In this work, the curriculum learning method is used to teach the model concrete/abstract concepts through the use of images and corresponding captions to accomplish the task of multi-modal language modeling/representation. BERT and Resnet-152 model is used on each modality with attentive pooling mechanism on the newly constructed dataset, collected from the Wikimedia Commons. To show the performance of the proposed model, downstream tasks and ablation studies are performed. Contribution of thiswork is two-fold: a newdataset is constructed fromWikimedia Commons and a new multi-modal pre-training approach that is based on curriculum learning is proposed. Results show that the proposed multi-modal pre-training approach increases the success of the model.

Konu Başlığı
Neural networks (Computer science)
Semantic computing.

Yazar Ek Girişi
Tekir, Selma,

Tüzel Kişi Ek Girişi
İzmir Institute of Technology. Computer Engineering.

Tek Biçim Eser Adı
Thesis (Doctoral)--İzmir Institute of Technology:Computer Engineering.
İzmir Institute of Technology: Energy Engineering --Thesis (Doctoral).

Elektronik Erişim
Access to Electronic Versiyon.

LibraryMateryal TürüDemirbaş NumarasıYer NumarasıDurumu/İade Tarihi
IYTE LibraryTezT002395QA76.87 S522 2021Tez Koleksiyonu