论文标题
评论viz:协助开发人员对移动应用程序的能源消耗相关评论进行实证研究
ReviewViz: Assisting Developers Perform Empirical Study on Energy Consumption Related Reviews for Mobile Applications
论文作者
论文摘要
提高移动应用程序的能源效率是一个最近引起了很多关注的主题。它已经以多种方式来解决,例如识别能量错误和开发能量模式的目录。先前的工作表明,用户在评论中讨论了应用程序与电池相关的问题(效率低下或能源消耗)。但是,没有工作能够从用户反馈中自动提取与电池相关的问题的自动提取。在本文中,我们报告了一种可视化工具,该工具旨在通过经验研究机器学习算法和文本功能,以自动以最高精度识别能量消耗的特定评论。除了通用的机器学习算法外,我们还利用具有不同单词嵌入的深度学习模型来比较结果。此外,为了帮助开发人员提取评论中讨论的主要主题,采用了两种最新的主题建模算法。主题的可视化代表了每个主题提取的关键字,以及与字符串匹配的结果进行比较。开发的基于Web浏览器的交互式可视化工具是一个开发的新型框架,目的是使应用程序开发人员了解机器学习和深度学习模型的运行时间和准确性以及提取的主题。该工具使开发人员更容易通过文本分类和主题建模算法生成的广泛结果集。用于工具的动态数据结构存储讨论方法的基线分子,并在新数据集上应用时会更新。该工具是开源的,以复制研究结果。
Improving the energy efficiency of mobile applications is a topic that has gained a lot of attention recently. It has been addressed in a number of ways such as identifying energy bugs and developing a catalog of energy patterns. Previous work shows that users discuss the battery-related issues (energy inefficiency or energy consumption) of the apps in their reviews. However, there is no work that addresses the automatic extraction of battery-related issues from users' feedback. In this paper, we report on a visualization tool that is developed to empirically study machine learning algorithms and text features to automatically identify the energy consumption specific reviews with the highest accuracy. Other than the common machine learning algorithms, we utilize deep learning models with different word embeddings to compare the results. Furthermore, to help the developers extract the main topics that are discussed in the reviews, two states of the art topic modeling algorithms are applied. The visualizations of the topics represent the keywords that are extracted for each topic along with a comparison with the results of string matching. The developed web-browser based interactive visualization tool is a novel framework developed with the intention of giving the app developers insights about running time and accuracy of machine learning and deep learning models as well as extracted topics. The tool makes it easier for the developers to traverse through the extensive result set generated by the text classification and topic modeling algorithms. The dynamic-data structure used for the tool stores the baseline-results of the discussed approaches and is updated when applied on new datasets. The tool is open-sourced to replicate the research results.