Understanding user intent in community question answering

Shah, Harsh Kaushikbhai

dc.contributor.advisor	Majumder, Prasenjit
dc.contributor.author	Shah, Harsh Kaushikbhai
dc.date.accessioned	2017-06-10T14:41:36Z
dc.date.available	2017-06-10T14:41:36Z
dc.date.issued	2014
dc.identifier.citation	Shah, Harsh Kaushikbhai (2014). Understanding user intent in community question answering. Dhirubhai Ambani Institute of Information and Communication Technology, vii, 41 p. (Acc.No: T00451)
dc.identifier.uri	http://drsr.daiict.ac.in/handle/123456789/488
dc.description.abstract	Yahoo! Answers, Quora like Community Question Answering (CQA) services are mainly created to remove the limitation of Web search engines by helping users to get information from a community. This CQA system has the so many questions in its memory with possible number of answer. And number of times the questions are repeated. So, if the CQA system understand the user intent of question it helps it to recognize similar kind of questions, find relevant answers and hence, recommend potential answers more effectively and effectively. So, thesis approach is to classify the CQA questions, according to user intent, into three categories: objective, subjective, and social. So, to understand the user intent of questions, we first find the text features and metadata features and then through the machine learning algorithms we build a predictive model that classify the questions into above three categories. This one is supervised learning model. We have a very limited number of labeled questions and large number of unlabeled questions. So, to improve the question classification we also use the co-training, a semi supervised learning algorithm, which uses a small set of labeled questions plus a large number of unlabeled questions for classification. Our results shows that the co-training approach that regards text features and metadata features as two views works better than the supervised learning approach that simply applying these two types of features together. This is because co-training, as a semi-supervised learning method, can make use of a large amount of unlabelled questions in addition to the small set of labeled questions.
dc.publisher	Dhirubhai Ambani Institute of Information and Communication Technology
dc.subject	Undrstanding User Indent
dc.subject	Information Retrieval
dc.subject	Community Question Answering
dc.classification.ddc	004 SHA
dc.title	Understanding user intent in community question answering
dc.type	Dissertation
dc.degree	M. Tech
dc.student.id	201211016
dc.accession.number	T00451

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

M Tech Dissertations [923]

Show simple item record