MLib is a machine learning library built on top of Spark. from pyspalk.mllib.clustering import KMeans KMeans(rdd) where you pass the MLib a PySpark RDD

[[curator]]
I'm the Curator. I can help you navigate, organize, and curate this wiki. What would you like to do?