论文标题

提高帖子在堆栈溢出中的答案集的质量

Improving Quality of a Post's Set of Answers in Stack Overflow

论文作者

Tavakoli, Mohammadrezar, Izadi, Maliheh, Heydarnoori, Abbas

论文摘要

社区问题回答堆栈溢出等平台有助于广泛的用户在线解决他们的挑战。随着这些社区多年来的普及,成员和职位的数量都在升级。同样,由于用户的不同背景,技能,专业知识和观点,每个问题可能会获得多个答案。因此,重点是生产具有对整个社区更有价值的帖子的帖子,而不仅仅是一个旨在仅满足问答者的接受者。与每个通用社区相同,堆栈溢出上的大量低质量帖子需要改进。我们称这些帖子不足,并将其定义为帖子,其中尚无答案,或者其他问题可以改进。在本文中,我们提出了一种自动化此类帖子的识别过程的方法,并利用相关专家的帮助来提高其答案集。在60名参与者的帮助下,我们培训了一个分类模型,通过研究堆栈溢出上发布的3075个问题的特征之间的关系与他们对更好答案的需求之间的关系。然后,我们开发了一个名为SOPI的Eclipse插件,并将插件中的预测模型集成在一起,以将这些缺陷的帖子与相关开发人员联系起来,并帮助他们改善答案集。我们分别评估了插件的功能以及在10和15个专家工业开发人员的帮助下提交堆栈溢出的答案的影响。我们的结果表明,决策树,尤其是J48,预测一个缺乏问题的问题要比其他具有0.945精度和0.903召回的方法更好。我们得出的结论是,我们的插件不仅可以帮助程序员更容易为堆叠溢出做出贡献,还可以提高答案的质量。

Community Question Answering platforms such as Stack Overflow help a wide range of users solve their challenges online. As the popularity of these communities has grown over the years, both the number of members and posts have escalated. Also, due to the diverse backgrounds, skills, expertise, and viewpoints of users, each question may obtain more than one answers. Therefore, the focus has changed toward producing posts that have a set of answers more valuable for the community as a whole, not just one accepted-answer aimed at satisfying only the question-asker. Same as every universal community, a large number of low-quality posts on Stack Overflow require improvement. We call these posts deficient and define them as posts with questions that either have no answer yet or can be improved by other ones. In this paper, we propose an approach to automate the identification process of such posts and boost their set of answers, utilizing the help of related experts. With the help of 60 participants, we trained a classification model to identify deficient posts by investigating the relationship between characteristics of 3075 questions posted on Stack Overflow and their need for better answers set. Then, we developed an Eclipse plugin named SOPI and integrated the prediction model in the plugin to link these deficient posts to related developers and help them improve the answer set. We evaluated both the functionality of our plugin and the impact of answers submitted to Stack Overflow with the help of 10 and 15 expert industrial developers, respectively. Our results indicate that decision trees, specifically J48, predicts a deficient question better than the other methods with 0.945 precision and 0.903 recall. We conclude that not only our plugin helps programmers contribute more easily to Stack Overflow, but also it improves the quality of answers.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源