File tree Expand file tree Collapse file tree 1 file changed +9
-0
lines changed Expand file tree Collapse file tree 1 file changed +9
-0
lines changed Original file line number Diff line number Diff line change @@ -282,6 +282,15 @@ and distributed matrices. Converting a distributed matrix to a different format
282
282
global shuffle, which is quite expensive. We implemented three types of distributed matrices in
283
283
this release and will add more types in the future.
284
284
285
+ The basic type is called ` RowMatrix ` . A ` RowMatrix ` is a row-oriented distributed
286
+ matrix without meaningful row indices, e.g., a collection of feature vectors.
287
+ It is backed by an RDD of its rows, where each row is a local vector.
288
+ We assume that the number of columns is not huge for a ` RowMatrix ` .
289
+ An ` IndexedRowMatrix ` is similar to a ` RowMatrix ` but with row indices,
290
+ which can be used for identifying rows and joins.
291
+ A ` CoordinateMatrix ` is a distributed matrix stored in [ coordinate list (COO)] ( https://en.wikipedia.org/wiki/Sparse_matrix ) format,
292
+ backed by an RDD of its entries.
293
+
285
294
*** Note***
286
295
287
296
The underlying RDDs of a distributed matrix must be deterministic, because we cache the matrix size.
You can’t perform that action at this time.
0 commit comments