论文标题
可靠性校正是可靠的开普勒出现率的关键
Reliability Correction is Key for Robust Kepler Occurrence Rates
论文作者
论文摘要
开普勒DR25行星候选目录是使用基于各种测试的行星候选识别方法生产的。对这些测试进行了调整,以在目录完整性和可靠性之间获得合理但任意的平衡。我们通过改变这些测试来生产具有不同余额和可靠性平衡的新目录,并研究了这些替代目录对发生率的影响。我们发现,如果没有校正可靠性,则不同的目录在统计上不一致的发生率,而如果我们纠正完整性和可靠性,我们将获得统计上一致的发生率。这有很大的迹象表明,对完整性和可靠性的纠正对于准确计算发生率至关重要。此外,我们发现这种结果是使用贝叶斯泊松可能性MCMC或近似贝叶斯计算方法的结果。我们还研究了将Robovetter置换评分切成可靠性校正的替代方法的使用,并发现降低分数确实提高了目录的可靠性,但它并不像执行完整的可靠性校正那样准确。在进行有或没有分数的可靠性校正时,我们会获得相同的结果。因此,去除低分行星可以删除数据而无需提供任何优势,应在可能的情况下避免。我们将替代目录公开可用,并建议将这些目录用作发生率方法的测试,并要求方法应为所有这些目录提供统计上一致的发生率。
The Kepler DR25 planet candidate catalog was produced using an automated method of planet candidate identification based on various tests. These tests were tuned to obtain a reasonable but arbitrary balance between catalog completeness and reliability. We produce new catalogs with differing balances of completeness and reliability by varying these tests, and study the impact of these alternative catalogs on occurrence rates. We find that if there is no correction for reliability, different catalogs give statistically inconsistent occurrence rates, while if we correct for both completeness and reliability, we get statistically consistent occurrence rates. This is a strong indication that correction for completeness and reliability is critical for the accurate computation of occurrence rates. Additionally, we find that this result is the same whether using Bayesian Poisson likelihood MCMC or Approximate Bayesian Computation methods. We also examine the use of a Robovetter disposition score cut as an alternative to reliability correction, and find that while a score cut does increase the reliability of the catalog, it is not as accurate as performing a full reliability correction. We get the same result when performing a reliability correction with and without a score cut. Therefore removing low-score planets removes data without providing any advantage, and should be avoided when possible. We make our alternative catalogs publicly available, and propose that these should be used as a test of occurrence rate methods, with the requirement that a method should provide statistically consistent occurrence rates for all these catalogs.