Package pyspark
[frames] | no frames]

Package pyspark

source code

PySpark is the Python API for Spark.

Public classes:

Submodules

Classes
  SparkContext
Main entry point for Spark functionality.
  RDD
A Resilient Distributed Dataset (RDD), the basic abstraction in Spark.
  SparkFiles
Resolves paths to files added through SparkContext.addFile().
  StorageLevel
Flags for controlling the storage of an RDD.