Commit Graph

196 Commits

Author SHA1 Message Date
Donne Martin
4ad409aa63 Added snippets to start exploring the Titanic data. 2015-03-14 19:56:28 -04:00
Donne Martin
bcfae90101 Added preliminary Kaggle Titanic survivor analysis containing the competition description, evaluation, data set, and snippet to read in the data to pandas. 2015-03-14 19:53:56 -04:00
Donne Martin
1fbbd20c68 Added Kaggle Titanic data files. 2015-03-14 19:49:07 -04:00
Donne Martin
8196b4bdfb Updated repo description. 2015-03-14 09:22:01 -04:00
Donne Martin
ce605a6fdf Added snippets for configuring Spark applications. 2015-03-13 08:25:50 -04:00
Donne Martin
53789e0e3e Prefixed Spark commands with ! so they can be executed within IPython Notebook. 2015-03-13 08:09:01 -04:00
Donne Martin
87b017fd37 Prefixed HDFS commands with ! so they can be executed within IPython Notebook. 2015-03-13 08:07:17 -04:00
Donne Martin
8c251e43cd Prefixed various misc commands with ! so they can be executed within IPython Notebook. 2015-03-13 08:05:56 -04:00
Donne Martin
8c4541ae33 Added git reset and pull commands. 2015-03-13 08:03:01 -04:00
Donne Martin
a9ea93b872 Prefixed Linux commands with ! so they can be executed within IPython Notebook. 2015-03-13 07:59:28 -04:00
Donne Martin
23d3866b8e Prefixed AWS commands with ! so they can be executed within IPython Notebook. 2015-03-13 07:57:12 -04:00
Donne Martin
1c4e2157a6 Added snippets to demonstrate writing and running a Spark app. 2015-03-12 06:25:40 -04:00
Donne Martin
9fd62a73ae Added sed command to delete matching lines in place. Added command to display all matching running processes with full formatting. Tweaked formatting of vim section regarding vimtutor and vim syntax coloring. 2015-03-11 20:30:23 -04:00
Donne Martin
31c4f3299a Updated AWS index. 2015-03-10 17:00:44 -04:00
Donne Martin
5600ab0377 Added Lambda commands. 2015-03-10 17:00:08 -04:00
Donne Martin
cd84ffb2f0 Added Kinesis commands. 2015-03-09 16:10:54 -04:00
Donne Martin
1815c9a122 Added snippets to checkpoint RDDs in Spark. 2015-03-08 05:55:45 -04:00
Donne Martin
0481497848 Added snippets to cache RDDs in Spark. 2015-03-08 05:55:05 -04:00
Donne Martin
404676a1f7 Added discussion and snippet for working with partitions in Spark. 2015-03-07 09:07:18 -05:00
Donne Martin
bef3dfc9fc Added discussion on viewing the Spark application UI. 2015-03-06 07:53:17 -05:00
Donne Martin
72cf3af7f1 Added snippets to run Spark on a cluster. 2015-03-05 07:26:54 -05:00
Donne Martin
a5a3da5b28 Added Spark pair RDDs snippets. 2015-03-04 08:28:07 -05:00
Donne Martin
e8b481f480 Added snippets for basic RDD operations. 2015-03-03 10:36:30 -05:00
Donne Martin
6c7e7b5239 Added Spark IPython Notebook, currently contains snippets for starting the pyspark shell and viewing the spark context. 2015-03-03 10:32:59 -05:00
Donne Martin
a0ac867b7b Added various Linux compression commands. 2015-03-03 10:17:10 -05:00
Donne Martin
47211cb729 Added anchors for each AWS command line topic. Updated README with AWS topics. 2015-03-02 10:32:03 -05:00
Donne Martin
39db0b5057 Added curl commands. 2015-03-01 16:27:48 -05:00
Donne Martin
f1f69fbd19 Added commands to view running processes. 2015-03-01 15:51:25 -05:00
Donne Martin
14ea9025c1 Added Redshift reference tables for create, sort key, dist key, and discussions on how to choose the appropriate keys. 2015-03-01 08:39:59 -05:00
Donne Martin
17e7736974 Changed Vim commands cell type to code for better formatting on nbviewer. 2015-03-01 08:18:24 -05:00
Donne Martin
11d7e041fb Added Vim commands. 2015-03-01 08:16:23 -05:00
Donne Martin
80741219ce Updated repo image. 2015-03-01 07:34:49 -05:00
Donne Martin
d0c4f48469 Tweaked header anchors to work with nbviewer. 2015-03-01 07:00:42 -05:00
Donne Martin
49c6ae6488 Added anchors for each misc command topic. Updated README with misc command anchors. 2015-03-01 06:56:15 -05:00
Donne Martin
ab3bfad838 Added hyperlinks for each topic listed in misc commands IPython Notebook. Tweaked Jekyll description. 2015-03-01 06:25:57 -05:00
Donne Martin
f9f5e6bfe1 Added index of contents to linux IPython Notebook. 2015-03-01 06:22:43 -05:00
Donne Martin
09b45a01fa Added IPython Notebook commands. 2015-03-01 06:19:18 -05:00
Donne Martin
8256531f24 Added Anaconda commands. Added Git description. 2015-03-01 06:01:30 -05:00
Donne Martin
d7b54123e2 Added Ruby commands. Ruby is used to interact with the AWS command line and for Jekyll, a blog framework hosted on GitHub Pages. 2015-03-01 05:57:09 -05:00
Donne Martin
2363787d59 Combined git and jekyll commands to misc commands IPython Notebook. 2015-03-01 05:53:38 -05:00
Donne Martin
7586ae1b73 Removed incomplete snippet for pandas idxmax causing an exception. 2015-02-28 18:37:44 -05:00
Donne Martin
1689cc3e76 Added image source used for slice snippets. 2015-02-28 18:34:17 -05:00
Donne Martin
21f593da5d Added repo image. 2015-02-28 18:29:16 -05:00
Donne Martin
36a8dc504f Updated README to include aws and spark. Removed commands suffix from linux, git, jekyll commands as it seemed redundanct. 2015-02-28 18:28:50 -05:00
Donne Martin
b0fe318517 Added IPython Notebook for git commands. 2015-02-28 15:14:59 -05:00
Donne Martin
ee4d38ae71 Added __init__.py files to spark and aws folders. 2015-02-28 13:07:31 -05:00
Donne Martin
fb40d146e5 Added ozone data files used in pandas IPython Notebooks. 2015-02-28 13:06:28 -05:00
Donne Martin
43c191c8da Moved AWS IPython Notebook to its own directory. 2015-02-28 13:03:06 -05:00
Donne Martin
7fac30fa79 Renamed folder core to python-core to be more explicit about its contents. 2015-02-28 12:48:47 -05:00
Donne Martin
87402ca5b8 Added IPython Notebook containing HDFS snippets. 2015-02-28 12:44:56 -05:00