论文标题
对未来分析的限制高能量物理学中的元数据系统
Constraints on future analysis metadata systems in High Energy Physics
论文作者
论文摘要
在高能量物理(HEP)中,分析元数据有多种形式 - 从理论横截面到校准校正,再到有关文件处理的详细信息。正确应用元数据是分析中的至关重要且通常是耗时的一步,但是设计分析元数据系统历来很少受到直接关注。除其他考虑因素外,新分析仪应易于使用理想的元数据工具,应扩展到大量数据量和多样化的处理范式,并应实现将来的分析重新解释。该文档是由HEP软件基金会组织的社区讨论的产物,按范围和格式对元数据进行了分类,并提供了当前的元数据解决方案的示例。讨论了元数据系统的重要设计注意事项,包括社会学因素,分析保护工作和技术因素。提出了未来分析元数据系统的最佳实践和技术要求列表。这些最佳实践可以指导开发未来的跨实验努力来分析元数据工具。
In High Energy Physics (HEP), analysis metadata comes in many forms -- from theoretical cross-sections, to calibration corrections, to details about file processing. Correctly applying metadata is a crucial and often time-consuming step in an analysis, but designing analysis metadata systems has historically received little direct attention. Among other considerations, an ideal metadata tool should be easy to use by new analysers, should scale to large data volumes and diverse processing paradigms, and should enable future analysis reinterpretation. This document, which is the product of community discussions organised by the HEP Software Foundation, categorises types of metadata by scope and format and gives examples of current metadata solutions. Important design considerations for metadata systems, including sociological factors, analysis preservation efforts, and technical factors, are discussed. A list of best practices and technical requirements for future analysis metadata systems is presented. These best practices could guide the development of a future cross-experimental effort for analysis metadata tools.