Scaling Distributed Machine Learning with the Parameter Server
by James Long, Vanja Josifovski, Amr Ahmed, Alexander Smola, Jun W Park, David Andersen, Mu Li
show details
Details
url: | http://www.cs.cmu.edu/~15712/papers/li14.pdf | abstract: | We propose a parameter server framework for distributed machine learning problems. Both data and workloads are distributed over worker nodes, while the server nodes maintain globally shared parameters, represented as dense or sparse vectors and matrices. The framework manages asynchronous data communication between nodes, and supports flexible consistency models, elastic scalability, and continuous fault tolerance.
To demonstrate the scalability of the proposed frame- work, we show experimental results on petabytes of real data with billions of examples and parameters on prob- lems ranging from Sparse Logistic Regression to Latent Dirichlet Allocation and Distributed Sketching.
|
|
|
You need to log in to add tags and post comments.