论文标题
通过预先存在的扭曲来预测压缩视频的质量
Predicting the Quality of Compressed Videos with Pre-Existing Distortions
论文作者
论文摘要
在过去的十年中,在线视频行业大大扩展了通过Internet流式传输和共享的视觉数据量。此外,由于视频捕捉的易于越来越轻松,数百万消费者创建和上传了大量的用户生成 - 内容(UGC)视频。与专业摄影师和电影摄影师制作的流媒体电视或电影内容不同,UGC视频最常见的是,幼稚的用户技能和不完美的技术,通常受到高度多样化和混合的捕捉扭曲的困扰。然后,这些UGC视频经常上载以共享到云服务器上,它们进一步压缩以进行存储和传输。我们的论文解决了预测压缩视频质量的高度实用问题(也许是在压缩过程中,以帮助指导它),仅(可能严重)扭曲的UGC视频作为参考文献。为了解决这个问题,我们开发了一种新颖的视频质量评估(VQA)框架,我们称之为1StepVQA(将其与我们讨论的两步方法区分开来)。 1STEPVQA通过利用自然视频和扭曲视频的统计规律来克服全参考,减少参考和无参考VQA模型的限制。我们表明,1STEPVQA能够更准确地预测压缩视频的质量,给定不完美的参考视频。我们还描述了一个新的专用视频数据库,其中包括(通常是扭曲的)UGC参考视频以及其中大量的压缩版本。我们表明,在这种情况下,1STEPVQA模型优于其他VQA模型。我们正在https://live.ece.utexas.edu/research/onestep/index.html免费提供专用的新数据库
Over the past decade, the online video industry has greatly expanded the volume of visual data that is streamed and shared over the Internet. Moreover, because of the increasing ease of video capture, many millions of consumers create and upload large volumes of User-Generated-Content (UGC) videos. Unlike streaming television or cinematic content produced by professional videographers and cinemagraphers, UGC videos are most commonly captured by naive users having limited skills and imperfect technique, and often are afflicted by highly diverse and mixed in-capture distortions. These UGC videos are then often uploaded for sharing onto cloud servers, where they further compressed for storage and transmission. Our paper tackles the highly practical problem of predicting the quality of compressed videos (perhaps during the process of compression, to help guide it), with only (possibly severely) distorted UGC videos as references. To address this problem, we have developed a novel Video Quality Assessment (VQA) framework that we call 1stepVQA (to distinguish it from two-step methods that we discuss). 1stepVQA overcomes limitations of Full-Reference, Reduced-Reference and No-Reference VQA models by exploiting the statistical regularities of both natural videos and distorted videos. We show that 1stepVQA is able to more accurately predict the quality of compressed videos, given imperfect reference videos. We also describe a new dedicated video database which includes (typically distorted) UGC reference videos, and a large number of compressed versions of them. We show that the 1stepVQA model outperforms other VQA models in this scenario. We are providing the dedicated new database free of charge at https://live.ece.utexas.edu/research/onestep/index.html