dc.contributor.advisor | Majumder, Prasenjit | |
dc.contributor.author | Iyer, Ganesh R | |
dc.date.accessioned | 2017-06-10T14:42:59Z | |
dc.date.available | 2017-06-10T14:42:59Z | |
dc.date.issued | 2015 | |
dc.identifier.citation | Iyer, Ganesh R (2015). Summarizing medical texts for effective retrieval. Dhirubhai Ambani Institute of Information and Communication Technology, vii, 31 p. (Acc.No: T00514) | |
dc.identifier.uri | http://drsr.daiict.ac.in/handle/123456789/551 | |
dc.description.abstract | User centered health information retrieval is a challenging and important problem
in information retrieval. In this work, we apply medical resources to bridge the
vocabulary mismatch between lay-users and medical documents. We also applied
text summarization techniques to reduce the document to relevant information
while pruning irrelevant information. We provide a survey of medical resources
and application of text summarization in information retrieval. The primary research
goals were to investigate the use of medical resources in query expansion
and text summarization in indexing. The experiments were performed as a part
of a CLEF eHealth Task, overview of which is provided. From our experiments
we observed that a summarized index can be used to replace a full collection index.
Also a compression rate of 40-80% outperformed the baseline indicating that
retrieval on the summarized collection can indeed improve performance. Using
MeSH(Medical Subject Headings) as a thesaurus to supplement the query terms
improved retrieval for certain queries. We obtained the best MAP score of 0.415,
for all teams, using query expansion with discharge summaries. | |
dc.publisher | Dhirubhai Ambani Institute of Information and Communication Technology | |
dc.subject | Information Retrieval | |
dc.subject | Techniques | |
dc.subject | Medical Resources | |
dc.classification.ddc | 025.0661 IYE | |
dc.title | Summarizing medical texts for effective retrieval | |
dc.type | Dissertation | |
dc.degree | M. Tech | |
dc.student.id | 201311019 | |
dc.accession.number | T00514 | |