论文标题
图像理解和网络
Image understanding and the web
论文作者
论文摘要
研究了Web图像的上下文信息,以解决用语义描述符表征其内容的问题,因此弥合语义差距,即其自动化的低级表示在颜色,纹理,形状方面的差距。 。 。以及他们的语义解释。这种表征可以理解图像内容,并且在重要的基于Web的任务(例如图像索引和检索)中至关重要。尽管我们受到网络上丰富知识的可用性以及商业搜索引擎在使用网页中的上下文信息自动表征图像内容所取得的相对成功的激励,但我们知道,上下文信息的不可预测质量是主要限制因素。解释难以利用图像上下文信息的原因的原因,有些问题与此信息的表征和提取有关。的确,第一个问题是缺乏大规模研究来突出显示图像的相关上下文信息,它位于网页中,以及它是否在不同类型,内容布局和域的网页之间保持一致。同样,与此上下文信息的提取有关的问题是主题的,因为最先进的自动提取工具无法处理异质网络。就上下文信息的处理而言,与文本组件的句法和语义特征相关的问题对于解决语义差距很重要。此外,将出现与这些文本组件组织成相干结构有关的问题,这些结构应在图像索引和检索框架中使用。
The contextual information of Web images is investigated to address the issue of characterizing their content with semantic descriptors and therefore bridge the semantic gap, i.e. the gap between their automated low-level representation in terms of colors, textures, shapes. . . and their semantic interpretation. Such characterization allows for understanding the image content and is crucial in important Web-based tasks such as image indexing and retrieval. Although we are highly motivated by the availability of rich knowledge on the Web and the relative success achieved by commercial search engines in automatically characterizing the image content using contextual information in Web pages, we are aware that the unpredictable quality of the contextual information is a major limiting factor. Among the reasons explaining the difficulty to leverage on the image contextual information, some problems are related to the characterization and extraction of this information. Indeed, the first issue is the lack of large-scale studies to highlight what is considered the relevant contextual information of an image, where it is located in a Web page and whether it is consistent across Web pages of different types, content layouts and domains. Also, the matter related to the extraction of this contextual information is topical as state-of-the-art automated extraction tools are unable to handle the heterogeneous Web. As far as the processing of the contextual information is concerned, problems linked to the syntactic and semantic characterizations of the textual components are important to address in order to tackle the semantic gap. Furthermore, questions pertaining to the organization of these textual components into coherent structures that are usable in image indexing and retrieval frameworks shall arise.