Please use this identifier to cite or link to this item: http://drsr.daiict.ac.in//handle/123456789/689
Title: Distant supervision for relation extraction from text
Authors: Jat, P. M.
Sadhwani, Jay Dilipkumar
Keywords: Febrication
Logic device
Memory controller
Static random access memory
Single electron transistor
Issue Date: 2017
Publisher: Dhirubhai Ambani Institute of Information and Communication Technology
Citation: Jay Dilipkumar Sadhwani(2017).Distant Supervision for Relation Extraction from Text.Dhirubhai Ambani Institute of Information and Communication Technology.viii, 29 p.(Acc.No: T00653)
Abstract: "Relation Extraction(RE) is an important part of Information Extraction(IE) which helps to extract facts from unstructured textual data. Supervised relation extraction is challenged by domain dependence and high labeling cost. Also supervised approaches are not scalable to the huge amount of textual data currently available on the web. Challenged by the said issues, there is an evolving trend of using alternativeapproaches:semi-supervised approaches, distant supervision. Distant supervision automatically labels a corpus using freely available knowledge bases.The intuition is that if there is a relation between two entities in the knowledge base then the sentence containing both these entities would indicate the relation between them. However, there are two issues with distant supervision. First, not all sentences containing two entities express the relation between them. This results into false positives as sentence is labeled with the relation which actually it does not express. Second, some or all relations between entities may be missing from the knowledge base. So the sentence would be labeled with no relation in spite of expressing some relation. This would increase false negatives. Most of the recent works have neglected the issue of false negatives. Some have addressed it concurrently with the issue of false positives. Since both the issues as already stated are independent, we believe that dealing with them independently should improve the performance. We propose a strategy where we first address false positives and then false negatives. Our results using this intuition shows improvement in terms of precision and recall. We also introduce some improvements in feature set for relation extraction which resulted in noticeable gains."
URI: http://drsr.daiict.ac.in//handle/123456789/689
Appears in Collections:M Tech Dissertations

Files in This Item:
File Description SizeFormat 
201511040.pdf
  Restricted Access
201511040242.59 kBAdobe PDFThumbnail
View/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.