Commit Graph

32 Commits (master)

Author SHA1 Message Date
Donne Martin 138cd1054e Add note on DataFrame recomme^Cation over RDD 2016-02-21 06:21:59 -05:00
Donne Martin 34889ce7c8 Add more Spark DataFrame examples 2016-02-21 06:16:34 -05:00
Donne Martin e4e1284a15 Move DataFrames before RDDs 2016-02-21 06:00:40 -05:00
Alessandro b15edb7585 Added DataFrames section and cleared outputs 2016-02-20 19:49:32 +01:00
Alessandro d4450573c3 Added DataFrames section 2016-02-20 19:35:22 +01:00
Donne Martin 2690450e0f Fixed notebook author and license info no longer rendering properly on nbviewer. 2015-11-01 06:44:00 -05:00
Donne Martin 9195809015 Added simplified Spark installation instructions from the repo: https://github.com/donnemartin/dev-setup. 2015-08-11 21:42:53 -04:00
Donne Martin 1405875382 Updated IPython Notebook Setup section to link to Python 3 and Spark 1.4 setup. Labeled Python 2 setup links. 2015-06-23 22:08:53 -04:00
Donne Martin e8b32cffa1 Tweaked RDD collect() description to be more descriptive. 2015-06-23 22:08:00 -04:00
Donne Martin 26349dad5d Added author and GitHub repo link to notebooks. 2015-06-18 21:07:36 -04:00
Donne Martin 2f7866df2f Added instructions link to configure IPython Notebook and PySpark. 2015-06-09 18:16:55 -04:00
Donne Martin 2d1f1e8c36 Added more info about the spark shell, context, and RDDs. 2015-06-09 17:57:18 -04:00
Donne Martin c240bde8c0 Updated notebook to v3. 2015-05-24 10:49:36 -04:00
Donne Martin 2106accb6d Updated notebook to v3. 2015-05-24 10:49:21 -04:00
Donne Martin 011747c17a Added Spark accumulators snippets. 2015-03-31 21:41:21 -04:00
Donne Martin 7195c5bc82 Added Spark broadcast variables snippets. 2015-03-30 19:01:07 -04:00
Donne Martin b3fb4ae219 Added Spark streaming with states snippets. 2015-03-29 17:53:52 -04:00
Donne Martin 10d63efb4a Added Spark streaming snippets. 2015-03-16 16:01:51 -04:00
Donne Martin ce605a6fdf Added snippets for configuring Spark applications. 2015-03-13 08:25:50 -04:00
Donne Martin 53789e0e3e Prefixed Spark commands with ! so they can be executed within IPython Notebook. 2015-03-13 08:09:01 -04:00
Donne Martin 87b017fd37 Prefixed HDFS commands with ! so they can be executed within IPython Notebook. 2015-03-13 08:07:17 -04:00
Donne Martin 1c4e2157a6 Added snippets to demonstrate writing and running a Spark app. 2015-03-12 06:25:40 -04:00
Donne Martin 1815c9a122 Added snippets to checkpoint RDDs in Spark. 2015-03-08 05:55:45 -04:00
Donne Martin 0481497848 Added snippets to cache RDDs in Spark. 2015-03-08 05:55:05 -04:00
Donne Martin 404676a1f7 Added discussion and snippet for working with partitions in Spark. 2015-03-07 09:07:18 -05:00
Donne Martin bef3dfc9fc Added discussion on viewing the Spark application UI. 2015-03-06 07:53:17 -05:00
Donne Martin 72cf3af7f1 Added snippets to run Spark on a cluster. 2015-03-05 07:26:54 -05:00
Donne Martin a5a3da5b28 Added Spark pair RDDs snippets. 2015-03-04 08:28:07 -05:00
Donne Martin e8b481f480 Added snippets for basic RDD operations. 2015-03-03 10:36:30 -05:00
Donne Martin 6c7e7b5239 Added Spark IPython Notebook, currently contains snippets for starting the pyspark shell and viewing the spark context. 2015-03-03 10:32:59 -05:00
Donne Martin ee4d38ae71 Added __init__.py files to spark and aws folders. 2015-02-28 13:07:31 -05:00
Donne Martin 87402ca5b8 Added IPython Notebook containing HDFS snippets. 2015-02-28 12:44:56 -05:00