论文标题
从屏幕截图以标签形式提取的设备信息提取
On- Device Information Extraction from Screenshots in form of tags
论文作者
论文摘要
我们提出了一种使移动屏幕截图易于搜索的方法。在本文中,我们介绍了我们的工作流程:1)在截图的集合中,2)识别脚本呈现图像中的脚本,3)从图像中提取的非结构化文本,4)从图像中识别出提取的文本的语言,5)从本文中提取的关键字,6)基于图像的图像和图像的标签,以确定标签的标签,是8),遵循图像,图像,图像,7)图像,8)图像,8)图像,8)图像,8)图像,8)图像,8)图像,8)图像,8)图像,8)使其可以在设备上搜索。我们制作了支持多种语言并执行的管道,该管道解决了隐私问题。我们为管道中的组件,优化的性能和设备计算的内存开发了新颖的体系结构。我们从实验中观察到,开发的解决方案可以减少整体用户的努力并在搜索时提高最终用户体验,并发布结果。
We propose a method to make mobile screenshots easily searchable. In this paper, we present the workflow in which we: 1) preprocessed a collection of screenshots, 2) identified script presentin image, 3) extracted unstructured text from images, 4) identifiedlanguage of the extracted text, 5) extracted keywords from the text, 6) identified tags based on image features, 7) expanded tag set by identifying related keywords, 8) inserted image tags with relevant images after ranking and indexed them to make it searchable on device. We made the pipeline which supports multiple languages and executed it on-device, which addressed privacy concerns. We developed novel architectures for components in the pipeline, optimized performance and memory for on-device computation. We observed from experimentation that the solution developed can reduce overall user effort and improve end user experience while searching, whose results are published.