Discovering specific semantic relations among words using neural network methods
by
Sezerer, Erhan, author.
Title
:
Discovering specific semantic relations among words using neural network methods
Author
:
Sezerer, Erhan, author.
Personal Author
:
Sezerer, Erhan, author.
Physical Description
:
viii, 99 leaves: charts;+ 1 computer laser optical disc.
Abstract
:
Human-level language understanding is one of the oldest challenges in computer science. Many scientific work has been dedicated to finding good representations for semantic units (words, morphemes, characters) in languages. Recently, contextual language models, such as BERT and its variants, showed great success in downstream natural language processing tasks with the use of masked language modelling and transformer structures. Although these methods solve many problems in this domain and are proved to be useful, they still lack one crucial aspect of the language acquisition in humans: Experiential (visual) information. Over the last few years, there has been an increase in the studies that consider experiential information by building multi-modal language models and representations. It is shown by several studies that language acquisition in humans start with learning concrete concepts through images and then continue with learning abstract ideas through text. In this work, the curriculum learning method is used to teach the model concrete/abstract concepts through the use of images and corresponding captions to accomplish the task of multi-modal language modeling/representation. BERT and Resnet-152 model is used on each modality with attentive pooling mechanism on the newly constructed dataset, collected from the Wikimedia Commons. To show the performance of the proposed model, downstream tasks and ablation studies are performed. Contribution of thiswork is two-fold: a newdataset is constructed fromWikimedia Commons and a new multi-modal pre-training approach that is based on curriculum learning is proposed. Results show that the proposed multi-modal pre-training approach increases the success of the model.
Subject Term
:
Neural networks (Computer science)
Semantic computing.
Added Author
:
Tekir, Selma,
Added Corporate Author
:
İzmir Institute of Technology. Computer Engineering.
Added Uniform Title
:
Thesis (Doctoral)--İzmir Institute of Technology:Computer Engineering.
İzmir Institute of Technology: Energy Engineering --Thesis (Doctoral).
Electronic Access
:
Library | Material Type | Item Barcode | Shelf Number | Status |
---|
IYTE Library | Thesis | T002395 | QA76.87 S522 2021 | Tez Koleksiyonu |
IYTE Library | Supplementary CD-ROM | ROM3545 | QA76.87 S522 2021 EK.1 | Tez Koleksiyonu |