Commit Graph

381 Commits

Author SHA1 Message Date
Donne Martin
cd84ffb2f0 Added Kinesis commands. 2015-03-09 16:10:54 -04:00
Donne Martin
1815c9a122 Added snippets to checkpoint RDDs in Spark. 2015-03-08 05:55:45 -04:00
Donne Martin
0481497848 Added snippets to cache RDDs in Spark. 2015-03-08 05:55:05 -04:00
Donne Martin
404676a1f7 Added discussion and snippet for working with partitions in Spark. 2015-03-07 09:07:18 -05:00
Donne Martin
bef3dfc9fc Added discussion on viewing the Spark application UI. 2015-03-06 07:53:17 -05:00
Donne Martin
72cf3af7f1 Added snippets to run Spark on a cluster. 2015-03-05 07:26:54 -05:00
Donne Martin
a5a3da5b28 Added Spark pair RDDs snippets. 2015-03-04 08:28:07 -05:00
Donne Martin
e8b481f480 Added snippets for basic RDD operations. 2015-03-03 10:36:30 -05:00
Donne Martin
6c7e7b5239 Added Spark IPython Notebook, currently contains snippets for starting the pyspark shell and viewing the spark context. 2015-03-03 10:32:59 -05:00
Donne Martin
a0ac867b7b Added various Linux compression commands. 2015-03-03 10:17:10 -05:00
Donne Martin
47211cb729 Added anchors for each AWS command line topic. Updated README with AWS topics. 2015-03-02 10:32:03 -05:00
Donne Martin
39db0b5057 Added curl commands. 2015-03-01 16:27:48 -05:00
Donne Martin
f1f69fbd19 Added commands to view running processes. 2015-03-01 15:51:25 -05:00
Donne Martin
14ea9025c1 Added Redshift reference tables for create, sort key, dist key, and discussions on how to choose the appropriate keys. 2015-03-01 08:39:59 -05:00
Donne Martin
17e7736974 Changed Vim commands cell type to code for better formatting on nbviewer. 2015-03-01 08:18:24 -05:00
Donne Martin
11d7e041fb Added Vim commands. 2015-03-01 08:16:23 -05:00
Donne Martin
80741219ce Updated repo image. 2015-03-01 07:34:49 -05:00
Donne Martin
d0c4f48469 Tweaked header anchors to work with nbviewer. 2015-03-01 07:00:42 -05:00
Donne Martin
49c6ae6488 Added anchors for each misc command topic. Updated README with misc command anchors. 2015-03-01 06:56:15 -05:00
Donne Martin
ab3bfad838 Added hyperlinks for each topic listed in misc commands IPython Notebook. Tweaked Jekyll description. 2015-03-01 06:25:57 -05:00
Donne Martin
f9f5e6bfe1 Added index of contents to linux IPython Notebook. 2015-03-01 06:22:43 -05:00
Donne Martin
09b45a01fa Added IPython Notebook commands. 2015-03-01 06:19:18 -05:00
Donne Martin
8256531f24 Added Anaconda commands. Added Git description. 2015-03-01 06:01:30 -05:00
Donne Martin
d7b54123e2 Added Ruby commands. Ruby is used to interact with the AWS command line and for Jekyll, a blog framework hosted on GitHub Pages. 2015-03-01 05:57:09 -05:00
Donne Martin
2363787d59 Combined git and jekyll commands to misc commands IPython Notebook. 2015-03-01 05:53:38 -05:00
Donne Martin
7586ae1b73 Removed incomplete snippet for pandas idxmax causing an exception. 2015-02-28 18:37:44 -05:00
Donne Martin
1689cc3e76 Added image source used for slice snippets. 2015-02-28 18:34:17 -05:00
Donne Martin
21f593da5d Added repo image. 2015-02-28 18:29:16 -05:00
Donne Martin
36a8dc504f Updated README to include aws and spark. Removed commands suffix from linux, git, jekyll commands as it seemed redundanct. 2015-02-28 18:28:50 -05:00
Donne Martin
b0fe318517 Added IPython Notebook for git commands. 2015-02-28 15:14:59 -05:00
Donne Martin
ee4d38ae71 Added __init__.py files to spark and aws folders. 2015-02-28 13:07:31 -05:00
Donne Martin
fb40d146e5 Added ozone data files used in pandas IPython Notebooks. 2015-02-28 13:06:28 -05:00
Donne Martin
43c191c8da Moved AWS IPython Notebook to its own directory. 2015-02-28 13:03:06 -05:00
Donne Martin
7fac30fa79 Renamed folder core to python-core to be more explicit about its contents. 2015-02-28 12:48:47 -05:00
Donne Martin
87402ca5b8 Added IPython Notebook containing HDFS snippets. 2015-02-28 12:44:56 -05:00
Donne Martin
85a316fc29 Added instructions on how to add terminal colors by editing your .bash_profile. 2015-02-28 12:40:18 -05:00
Donne Martin
a2191d03bd Added linux snippet to uncompress all tar.gz files in the current directory to another directory. 2015-02-28 12:38:52 -05:00
Donne Martin
d28c32b8da Added grep snippets to check number of files matching a search term and snippet to check the number of MapReduce records processed. 2015-02-28 12:37:18 -05:00
Donne Martin
133cddb267 Added linux commands to count lines and split files into multiple parts based on line counts. 2015-02-28 12:32:08 -05:00
Donne Martin
a709c709ce Added linux commands IPython Notebook, initially contains command disk usage commands. 2015-02-28 12:30:36 -05:00
Donne Martin
68b826c212 Added note about donnemartin.com, my mirror site. 2015-02-28 09:17:43 -05:00
Donne Martin
fb1768f39e Added comments on the jekyll build and serve commands. Added Jekyll to list of IPython Notebooks. 2015-02-28 09:15:38 -05:00
Donne Martin
06b1d3bf2a Added Redshift snippets. 2015-02-24 07:17:26 -05:00
Donne Martin
89a7bf4b93 Added mrjob snippet to run a job on EMR or locally. 2015-02-23 11:48:03 -05:00
Donne Martin
7b15eb949b Moved S3DistCp below the S3 specific commands to group with upcoming EMR. 2015-02-22 07:12:15 -05:00
Donne Martin
39281aaaa8 Added Jekyll commands IPython Notebook. 2015-02-21 17:57:55 -05:00
Donne Martin
8b82750526 Added console, CLI, s3cmd note. Added URL to s3-parallel-put GitHub repo. 2015-02-20 10:16:35 -05:00
Donne Martin
525da02e17 Added use cases for S3DistCp. 2015-02-19 08:48:01 -05:00
Donne Martin
c8569ed3d5 Added s3cmd intro and note about encryption and performance. 2015-02-19 08:45:11 -05:00
Donne Martin
2ac42e46da Added snippets for s3-parallel-put, a command line tool to upload files to S3 in parallel. 2015-02-19 08:41:07 -05:00