论文标题
通过二进制约束的在线凸优化
Online Convex Optimization with Binary Constraints
论文作者
论文摘要
我们考虑使用二进制决策变量和凸出损失功能的在线优化。我们设计了一种新的算法,二进制在线梯度下降(BOGD),并约束了预期的动态遗憾。我们提供了一个遗憾的界限,可以在任何时间范围内持有,并在有限的时间范围内进行专门界限。首先,我们表示遗憾,因为放松,连续的圆形最佳跟踪误差和更新的舍入误差的总和,在某些条件下,前者无症状会随时间减少。然后,我们得出了有限的时间结合,该结合时间是sublinear的,并在松弛的连续圆形优势的累积变化中线性。我们将BOGD应用于恒温控制载荷,要求响应,其中二进制约束模型在/关闭设置中离散。我们还对不确定性和不同负载的可用性进行建模,这取决于温度死条,冷却单元的锁定以及手动覆盖。我们根据需求响应在几个模拟中测试BOGD的性能。模拟证实,在BOGD中随机化的使用不会显着降低性能,同时使问题更加易于处理。
We consider online optimization with binary decision variables and convex loss functions. We design a new algorithm, binary online gradient descent (bOGD) and bound its expected dynamic regret. We provide a regret bound that holds for any time horizon and a specialized bound for finite time horizons. First, we present the regret as the sum of the relaxed, continuous round optimum tracking error and the rounding error of our update in which the former asymptomatically decreases with time under certain conditions. Then, we derive a finite-time bound that is sublinear in time and linear in the cumulative variation of the relaxed, continuous round optima. We apply bOGD to demand response with thermostatically controlled loads, in which binary constraints model discrete on/off settings. We also model uncertainty and varying load availability, which depend on temperature deadbands, lockout of cooling units and manual overrides. We test the performance of bOGD in several simulations based on demand response. The simulations corroborate that the use of randomization in bOGD does not significantly degrade performance while making the problem more tractable.