Cover image for Discovering specific semantic relations among words using neural network methods
Discovering specific semantic relations among words using neural network methods
Title:
Discovering specific semantic relations among words using neural network methods
Author:
Sezerer, Erhan, author.
Personal Author:
Physical Description:
viii, 99 leaves: charts;+ 1 computer laser optical disc.
Abstract:
Human-level language understanding is one of the oldest challenges in computer science. Many scientific work has been dedicated to finding good representations for semantic units (words, morphemes, characters) in languages. Recently, contextual language models, such as BERT and its variants, showed great success in downstream natural language processing tasks with the use of masked language modelling and transformer structures. Although these methods solve many problems in this domain and are proved to be useful, they still lack one crucial aspect of the language acquisition in humans: Experiential (visual) information. Over the last few years, there has been an increase in the studies that consider experiential information by building multi-modal language models and representations. It is shown by several studies that language acquisition in humans start with learning concrete concepts through images and then continue with learning abstract ideas through text. In this work, the curriculum learning method is used to teach the model concrete/abstract concepts through the use of images and corresponding captions to accomplish the task of multi-modal language modeling/representation. BERT and Resnet-152 model is used on each modality with attentive pooling mechanism on the newly constructed dataset, collected from the Wikimedia Commons. To show the performance of the proposed model, downstream tasks and ablation studies are performed. Contribution of thiswork is two-fold: a newdataset is constructed fromWikimedia Commons and a new multi-modal pre-training approach that is based on curriculum learning is proposed. Results show that the proposed multi-modal pre-training approach increases the success of the model.
Added Author:
Added Uniform Title:
Thesis (Doctoral)--İzmir Institute of Technology:Computer Engineering.

İzmir Institute of Technology: Energy Engineering --Thesis (Doctoral).
Electronic Access:
Access to Electronic Versiyon.
Holds: Copies: