Skip to:ContentBottom
Discovering specific semantic relations among words using neural network methods için kapak resmi
Discovering specific semantic relations among words using neural network methods
Başlık:
Discovering specific semantic relations among words using neural network methods
Yazar:
Sezerer, Erhan, author.
Yazar Ek Girişi:
Fiziksel Tanımlama:
viii, 99 leaves: charts;+ 1 computer laser optical disc.
Özet:
Human-level language understanding is one of the oldest challenges in computer science. Many scientific work has been dedicated to finding good representations for semantic units (words, morphemes, characters) in languages. Recently, contextual language models, such as BERT and its variants, showed great success in downstream natural language processing tasks with the use of masked language modelling and transformer structures. Although these methods solve many problems in this domain and are proved to be useful, they still lack one crucial aspect of the language acquisition in humans: Experiential (visual) information. Over the last few years, there has been an increase in the studies that consider experiential information by building multi-modal language models and representations. It is shown by several studies that language acquisition in humans start with learning concrete concepts through images and then continue with learning abstract ideas through text. In this work, the curriculum learning method is used to teach the model concrete/abstract concepts through the use of images and corresponding captions to accomplish the task of multi-modal language modeling/representation. BERT and Resnet-152 model is used on each modality with attentive pooling mechanism on the newly constructed dataset, collected from the Wikimedia Commons. To show the performance of the proposed model, downstream tasks and ablation studies are performed. Contribution of thiswork is two-fold: a newdataset is constructed fromWikimedia Commons and a new multi-modal pre-training approach that is based on curriculum learning is proposed. Results show that the proposed multi-modal pre-training approach increases the success of the model.
Yazar Ek Girişi:
Tek Biçim Eser Adı:
Thesis (Doctoral)--İzmir Institute of Technology:Computer Engineering.

İzmir Institute of Technology: Energy Engineering --Thesis (Doctoral).
Elektronik Erişim:
Access to Electronic Versiyon.
Ayırtma: Copies: