PySpark Algorithms
by Mahmoud Parsian
English | 2019 | ISBN: B07WQHTVCJ | 682 Pages | EPUB | 16 MB
by Mahmoud Parsian
English | 2019 | ISBN: B07WQHTVCJ | 682 Pages | EPUB | 16 MB
This book is about PySpark: Python API for Spark.
Apache Spark is an analytics engine for large-scale
data processing. Spark is the open source cluster
computing system that makes data analytics fast
to write and fast to run. This book provides a large
set of recipes for implementing big data processing
and analytics using Spark and Python. The goal of this
book is to show working examples in PySpark so that
you can do your ETL and analytics easier. You may
cut and paste examples to deliver your applications
in PySpark.