layout: global type: “page singular” title: MLlib description: MLlib is Apache Spark's scalable machine learning library, with APIs in Java, Scala, Python, and R. subproject: MLlib

<div style="margin-top: 15px; text-align: left; display: inline-block;">
  <div class="code">
    data = spark.read.format(<span class="string">"libsvm"</span>)\<br/>
    &nbsp;&nbsp;.load(<span class="string">"hdfs://..."</span>)<br/>
    <br/>
    model = <span class="sparkop">KMeans</span>(k=10).fit(data)
  </div>
  <div class="caption">Calling MLlib in Python</div>
</div>