Please use this identifier to cite or link to this item: http://drsr.daiict.ac.in//handle/123456789/993
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorMajumder, Prasenjit
dc.contributor.authorBhat, Shripad Anant
dc.date.accessioned06-05-2022T06:11:53Z
dc.date.available2023-02-18T06:11:53Z
dc.date.issued2021
dc.identifier.citationBhat, Shripad Anant (2021). Compounding-aware Word Embedding for Improved Semantic Representation. Dhirubhai Ambani Institute of Information and Communication Technology. viii, 43 p. (Acc.No: T00932)
dc.identifier.urihttp://drsr.daiict.ac.in//handle/123456789/993
dc.description.abstractExisting word embedding approaches may not adequately capture the inherent complexities of a language, e.g. the word compounding phenomenon. While a class of data-driven approaches has been shown to be effective in embedding words of languages that are relatively simple as per inflections and compounding characteristics (e.g. English), an open area of investigation is ways of integrating language-specific characteristics within the framework of an embedding model. In this work, we explore how words in a highly agglutinative language, e.g. German, can be embedded more effectively by additionally taking into account the contexts around the constituents of a compound word. We propose a word transformation based generalization of the skip-gram algorithm to address these relationships between a compound word and its constituents. Our experiments on standard German word-pair similarity datasets and polarity classification of German compounds confirm our hypothesis that modeling contextual relationships between a compound word and its constituents can improve word representations.
dc.publisherDhirubhai Ambani Institute of Information and Communication Technology
dc.subjectword embedding
dc.subjectcompound words
dc.classification.ddc025.04 BHA
dc.titleCompounding-aware Word Embedding for Improved Semantic Representation
dc.typeDissertation
dc.degreeM. Tech
dc.student.id201911003
dc.accession.numberT00932
Appears in Collections:M Tech Dissertations

Files in This Item:
File Description SizeFormat 
201911003_SHRIPAD_ANANT_BHAT_MTECH_THESIS.pdf
  Restricted Access
2.86 MBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.