Distant supervision for relation extraction

Doshi, Prarthana

dc.contributor.advisor	Jat, PM
dc.contributor.author	Doshi, Prarthana
dc.date.accessioned	2019-03-19T09:30:53Z
dc.date.available	2019-03-19T09:30:53Z
dc.date.issued	2018
dc.identifier.citation	Doshi, Prarthana (2018). Distant Supervision for Relation Extraction. Dhirubhai Ambani Institute of Information and Communication Technology, vii, 26 p. (Acc. No: T00710)
dc.identifier.uri	http://drsr.daiict.ac.in//handle/123456789/744
dc.description.abstract	Relation Extraction(RE) is one of important task of Information Extraction. InformationExtraction is used to get data from natural language text. Relation extractionis done using different methods. Most techniques found in the area ofrelation extraction uses labelled data. The downside of using labelled data is thatit is very costly to generate the labelled data as it requires human labour to understandeach sentence and entities and label it accordingly. There is a big amount ofnatural language data available and it is increasing day by day. So, the supervisedtechniques may not scale and adapt well with real time dynamic data.The issue of human annotations is addressed by recent approach of distant supervision.Distant supervision is a task that attempts automatic labelling of data.This is realized by extracting facts from publicly available knowledge bases likeWikidata, DBPedia, etc. Most of the knowledge bases are freely available. Theassumption of distant supervision is that if there is a relation between entitiesin knowledge base, then a sentence, in which those entities are present together,represents that relation. But there are some problems associated with distant supervisionlike incomplete knowledge base or wrong label problem.Most techniques in the area of relation extraction used available NLP toolsfor the feature extraction. These tools themselves have errors. In this work, weexplore convolutional neural network for the task which does not require NLPbased preprocessing.To avoid the wrong label problem, we have used selective attention over instances.It considers the problem as the multi-instance problem and we have concludedthat it gives better result. We have also used CNN with context modelwhere the input of the model is divided in three parts based on the entity position.This helps model to understand the sentence representation and the modelperforms well as compared to basic CNN model.
dc.publisher	Dhirubhai Ambani Institute of Information and Communication Technology
dc.subject	Relation extraction
dc.subject	Information extraction
dc.subject	CNN model
dc.subject	Distant supervision
dc.subject	Natural Language Processing
dc.subject	Precision- recall curve
dc.subject	Information retrieval
dc.classification.ddc	006.35 DOS
dc.title	Distant supervision for relation extraction
dc.type	Dissertation
dc.degree	M. Tech
dc.student.id	201611019
dc.accession.number	T00710

Files in this item

Name:: 201611019_Prarthana Doshi.pdf
Size:: 281.1Kb
Format:: PDF
Description:: 201611019

View/Open

This item appears in the following Collection(s)

M Tech Dissertations [923]

Show simple item record