PLANC: Parallel Low-rank Approximation with Nonnegativity Constraints
- Georgia Institute of Technology, Atlanta, GA (United States)
- Wake Forest Univ., Winston-Salem, NC (United States)
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
In this work, we consider the problem of low-rank approximation of massive dense nonnegative tensor data, for example, to discover latent patterns in video and imaging applications. As the size of data sets grows, single workstations are hitting bottlenecks in both computation time and available memory. We propose a distributed-memory parallel computing solution to handle massive data sets, loading the input data across the memories of multiple nodes, and performing efficient and scalable parallel algorithms to compute the low-rank approximation. We present a software package called Parallel Low-rank Approximation with Nonnegativity Constraints, which implements our solution and allows for extension in terms of data (dense or sparse, matrices or tensors of any order), algorithm (e.g., from multiplicative updating techniques to alternating direction method of multipliers), and architecture (we exploit GPUs to accelerate the computation in this work). We describe our parallel distributions and algorithms, which are careful to avoid unnecessary communication and computation, show how to extend the software to include new algorithms and/or constraints, and report efficiency and scalability results for both synthetic and real-world data sets.
- Research Organization:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC); National Science Foundation (NSF)
- Grant/Contract Number:
- AC05-00OR22725; OAC-1642385; OAC-1642410; AC02-05CH11231; SC0020347
- OSTI ID:
- 1820808
- Journal Information:
- ACM Transactions on Mathematical Software, Vol. 47, Issue 3; ISSN 0098-3500
- Publisher:
- Association for Computing MachineryCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Parallel Nonnegative CP Decomposition of Dense Tensors
TuckerMPI: A Parallel C++/MPI Software Package for Large-scale Data Compression via the Tucker Tensor Decomposition