论文标题
一种非参数方法,用于对消费者选择进行建模的边际
A Nonparametric Approach with Marginals for Modeling Consumer Choice
论文作者
论文摘要
给定有关消费者对不同报价集的选择的数据,一个关键的挑战是开发简约的模型来描述和预测消费者选择行为,同时可以符合规定性任务,例如定价和分类优化。边际分布模型(MDM)就是这样一种模型,它仅需要随机实用程序边际分布的规范。本文旨在建立必要和充分的条件,以使给定的选择数据与MDM假设一致,这是受到随机效用模型的相似特征的有用性(RUM)的启发。这项工作导致了MDM可以代表的选择概率集的精确表征。使用此表征验证选择数据的一致性等于求解多项式线性程序。由于朗姆酒的类似验证任务在计算上是棘手的,而这两个模型都不包含另一个模型,因此MDM有助于达到障碍性和表示能力之间的平衡。然后将表征用于强大的优化,以对新看不见的分类进行数据驱动的销售和收入预测。当选择数据与MDM假设缺乏一致性时,发现最合适的MDM选择概率降低了解决混合整数凸面程序。使用现实世界数据和合成数据的数值结果表明,与朗姆酒和参数模型相比,MDM表现出竞争性的代表性和预测性能,同时在计算中比朗姆酒要快得多。
Given data on the choices made by consumers for different offer sets, a key challenge is to develop parsimonious models that describe and predict consumer choice behavior while being amenable to prescriptive tasks such as pricing and assortment optimization. The marginal distribution model (MDM) is one such model, which requires only the specification of marginal distributions of the random utilities. This paper aims to establish necessary and sufficient conditions for given choice data to be consistent with the MDM hypothesis, inspired by the usefulness of similar characterizations for the random utility model (RUM). This endeavor leads to an exact characterization of the set of choice probabilities that the MDM can represent. Verifying the consistency of choice data with this characterization is equivalent to solving a polynomial-sized linear program. Since the analogous verification task for RUM is computationally intractable and neither of these models subsumes the other, MDM is helpful in striking a balance between tractability and representational power. The characterization is then used with robust optimization for making data-driven sales and revenue predictions for new unseen assortments. When the choice data lacks consistency with the MDM hypothesis, finding the best-fitting MDM choice probabilities reduces to solving a mixed integer convex program. Numerical results using real world data and synthetic data demonstrate that MDM exhibits competitive representational power and prediction performance compared to RUM and parametric models while being significantly faster in computation than RUM.