Show simple item record

dc.contributor.advisorJat, P.M.
dc.contributor.authorGohel, Prashant
dc.date.accessioned2019-03-19T09:31:01Z
dc.date.available2019-03-19T09:31:01Z
dc.date.issued2018
dc.identifier.citationGohel, Prashant (2018). Crime Information Extraction from News Articles. Dhirubhai Ambani Institute of Information and Communication Technology, viii, 26 p. (Acc. No: T00750)
dc.identifier.urihttp://drsr.daiict.ac.in//handle/123456789/782
dc.description.abstractIn the modern era all news reportings are available in digital form. Most newsagencies put it on their website and are freely available. This motivates us totry extracting some information from online news reporting. While understandingnatural language text for information extraction is a complex task,we hopethat extracting information like crime type, crime location, and some profile informationof accused and victim should be feasible. In this work we pulled about1000 crime news articles from NDTV and Indian Express websites. Hand taggingwas done for crime location and crime types of all articles. Through this workwe show that a combination of LSTM and CNN based solution can be effectivelyused for extracting crime location. Using this technique we get 95.58 % precisionand 94.54 % recall. Further, determination of crime type, we found relatively easier.Through simple key word based classification approach we get 95% precision.We also tried out topic modeling for crime type extraction we do not gain any improvement,and we get 79 % precision. Keywords: crime related named entities,deep learning, neural network, LSTM, CNN, NER, NLP
dc.publisherDhirubhai Ambani Institute of Information and Communication Technology
dc.subjectNatural Language Processing
dc.subjectInformation Extraction
dc.subjectMachine Learning
dc.subjectCrime
dc.subjectClassification Techniques
dc.subjectNeural Network
dc.subjectSemantic Analysis
dc.classification.ddc006.35 GOH
dc.titleCrime information extraction from news articles
dc.typeDissertation
dc.degreeM. Tech
dc.student.id201611041
dc.accession.numberT00750


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record