./devel/py-pyspark, Apache Spark Python API

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 3.0.1, Package name: py310-spark-3.0.1, Maintainer: kamel.derouiche

Spark is a unified analytics engine for large-scale data
processing. It provides high-level APIs in Scala, Java,
Python, and R, and an optimized engine that supports
general computation graphs for data analysis. It also
supports a rich set of higher-level tools including Spark
SQL for SQL and DataFrames, MLlib for machine learning,
GraphX for graph processing, and Structured Streaming
for stream processing.


Master sites:

RMD160: d7c1cb855f861a2100ed948d1b68d7ab64c95672
Filesize: 199454.602 KB

Version history: (Expand)