Show simple item record

dc.contributor.advisorJat, PM
dc.contributor.authorDoshi, Prarthana
dc.date.accessioned2019-03-19T09:30:53Z
dc.date.available2019-03-19T09:30:53Z
dc.date.issued2018
dc.identifier.citationDoshi, Prarthana (2018). Distant Supervision for Relation Extraction. Dhirubhai Ambani Institute of Information and Communication Technology, vii, 26 p. (Acc. No: T00710)
dc.identifier.urihttp://drsr.daiict.ac.in//handle/123456789/744
dc.description.abstractRelation Extraction(RE) is one of important task of Information Extraction. InformationExtraction is used to get data from natural language text. Relation extractionis done using different methods. Most techniques found in the area ofrelation extraction uses labelled data. The downside of using labelled data is thatit is very costly to generate the labelled data as it requires human labour to understandeach sentence and entities and label it accordingly. There is a big amount ofnatural language data available and it is increasing day by day. So, the supervisedtechniques may not scale and adapt well with real time dynamic data.The issue of human annotations is addressed by recent approach of distant supervision.Distant supervision is a task that attempts automatic labelling of data.This is realized by extracting facts from publicly available knowledge bases likeWikidata, DBPedia, etc. Most of the knowledge bases are freely available. Theassumption of distant supervision is that if there is a relation between entitiesin knowledge base, then a sentence, in which those entities are present together,represents that relation. But there are some problems associated with distant supervisionlike incomplete knowledge base or wrong label problem.Most techniques in the area of relation extraction used available NLP toolsfor the feature extraction. These tools themselves have errors. In this work, weexplore convolutional neural network for the task which does not require NLPbased preprocessing.To avoid the wrong label problem, we have used selective attention over instances.It considers the problem as the multi-instance problem and we have concludedthat it gives better result. We have also used CNN with context modelwhere the input of the model is divided in three parts based on the entity position.This helps model to understand the sentence representation and the modelperforms well as compared to basic CNN model.
dc.publisherDhirubhai Ambani Institute of Information and Communication Technology
dc.subjectRelation extraction
dc.subjectInformation extraction
dc.subjectCNN model
dc.subjectDistant supervision
dc.subjectNatural Language Processing
dc.subjectPrecision- recall curve
dc.subjectInformation retrieval
dc.classification.ddc006.35 DOS
dc.titleDistant supervision for relation extraction
dc.typeDissertation
dc.degreeM. Tech
dc.student.id201611019
dc.accession.numberT00710


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record