Communication-efficient distributed online prediction by dynamic model synchronization

Kamp, Michael; Boley, Mario; Keren, D.; Schuster, A.; Sharfman, I.

doi:10.1007/978-3-662-44848-9_40

2014

Conference Paper

Abstract

We present the first protocol for distributed online prediction that aims to minimize online prediction loss and network communication at the same time. This protocol can be applied wherever a prediction-based service must be provided timely for each data point of a multitude of high frequency data streams, each of which is observed at a local node of some distributed system. Exemplary applications include social content recommendation and algorithmic trading. The challenge is to balance the joint predictive performance of the nodes by exchanging information between them, while not letting communication overhead deteriorate the responsiveness of the service. Technically, the proposed protocol is based on controlling the variance of the local models in a decentralized way. This approach retains the asymptotic optimal regret of previous algorithms. At the same time, it allows to substantially reduce network communication, and, in contrast to previous approaches, it remain s applicable when the data is non-stationary and shows rapid concept drift. We demonstrate empirically that the protocol is able to hold up a high predictive performance using only a fraction of the communication required by benchmark methods.

Author(s)

Kamp, Michael

Boley, Mario

Keren, D.

Schuster, A.

Sharfman, I.

Mainwork

Machine learning and knowledge discovery in databases : European conference, ECML PKDD 2014. Pt.1

Conference

European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD) 2014

International Conference on Inductive Logic Programming (ILP) 2014

Options

Communication-efficient distributed online prediction by dynamic model synchronization