Commit Graph

153 Commits

Author SHA1 Message Date
Donne Martin
36a8dc504f Updated README to include aws and spark. Removed commands suffix from linux, git, jekyll commands as it seemed redundanct. 2015-02-28 18:28:50 -05:00
Donne Martin
b0fe318517 Added IPython Notebook for git commands. 2015-02-28 15:14:59 -05:00
Donne Martin
ee4d38ae71 Added __init__.py files to spark and aws folders. 2015-02-28 13:07:31 -05:00
Donne Martin
fb40d146e5 Added ozone data files used in pandas IPython Notebooks. 2015-02-28 13:06:28 -05:00
Donne Martin
43c191c8da Moved AWS IPython Notebook to its own directory. 2015-02-28 13:03:06 -05:00
Donne Martin
7fac30fa79 Renamed folder core to python-core to be more explicit about its contents. 2015-02-28 12:48:47 -05:00
Donne Martin
87402ca5b8 Added IPython Notebook containing HDFS snippets. 2015-02-28 12:44:56 -05:00
Donne Martin
85a316fc29 Added instructions on how to add terminal colors by editing your .bash_profile. 2015-02-28 12:40:18 -05:00
Donne Martin
a2191d03bd Added linux snippet to uncompress all tar.gz files in the current directory to another directory. 2015-02-28 12:38:52 -05:00
Donne Martin
d28c32b8da Added grep snippets to check number of files matching a search term and snippet to check the number of MapReduce records processed. 2015-02-28 12:37:18 -05:00
Donne Martin
133cddb267 Added linux commands to count lines and split files into multiple parts based on line counts. 2015-02-28 12:32:08 -05:00
Donne Martin
a709c709ce Added linux commands IPython Notebook, initially contains command disk usage commands. 2015-02-28 12:30:36 -05:00
Donne Martin
68b826c212 Added note about donnemartin.com, my mirror site. 2015-02-28 09:17:43 -05:00
Donne Martin
fb1768f39e Added comments on the jekyll build and serve commands. Added Jekyll to list of IPython Notebooks. 2015-02-28 09:15:38 -05:00
Donne Martin
06b1d3bf2a Added Redshift snippets. 2015-02-24 07:17:26 -05:00
Donne Martin
89a7bf4b93 Added mrjob snippet to run a job on EMR or locally. 2015-02-23 11:48:03 -05:00
Donne Martin
7b15eb949b Moved S3DistCp below the S3 specific commands to group with upcoming EMR. 2015-02-22 07:12:15 -05:00
Donne Martin
39281aaaa8 Added Jekyll commands IPython Notebook. 2015-02-21 17:57:55 -05:00
Donne Martin
8b82750526 Added console, CLI, s3cmd note. Added URL to s3-parallel-put GitHub repo. 2015-02-20 10:16:35 -05:00
Donne Martin
525da02e17 Added use cases for S3DistCp. 2015-02-19 08:48:01 -05:00
Donne Martin
c8569ed3d5 Added s3cmd intro and note about encryption and performance. 2015-02-19 08:45:11 -05:00
Donne Martin
2ac42e46da Added snippets for s3-parallel-put, a command line tool to upload files to S3 in parallel. 2015-02-19 08:41:07 -05:00
Donne Martin
c454d12a45 Added Think Stats reference. Removed ref attribute from other References. 2015-02-18 06:25:08 -05:00
Donne Martin
987ca75c6b Added AWS command lines IPython Notebook to README. 2015-02-18 06:13:40 -05:00
Donne Martin
50a79a011c Added snippets for setting up S3cmd and several frequently used commands. 2015-02-18 06:11:40 -05:00
Donne Martin
eff1a1fab4 Added AWS command line snippet to run S3DisctCp. Added snippet to control compression. 2015-02-17 20:10:14 -05:00
Donne Martin
5f554c928e Added AWS commands IPython Notebook. Currently contains command lines to connect to AWS Linux and Ubuntu instances. 2015-02-17 20:03:27 -05:00
Donne Martin
f591857c77 Reworked snippets for opening and closing a file to use 'with open' instead of open/close 2015-02-16 17:11:42 -05:00
Donne Martin
8b974b811a Added snippet to read and write utf-8. 2015-02-16 17:01:10 -05:00
Donne Martin
4af62400e3 Added commands stub folder for upcoming command line focused IPython Notebooks 2015-02-15 17:59:16 -05:00
Donne Martin
8bb029ca75 Removed double spacing for Pandas IPython Notebooks. 2015-02-15 17:57:20 -05:00
Donne Martin
99477591d1 Updated README with Pandas IO, Pandas Cleaning, and note about various command lines (coming soon). 2015-02-15 17:56:06 -05:00
Donne Martin
7563eeae21 Added code to read CSV data to Pandas, describe, list head, then write the CSV to another file 2015-02-15 17:48:13 -05:00
Donne Martin
d6348012a4 Added snippet to drop a column in a DataFrame. Renamed pop to population to avoid clashing with the DataFrame pop function. 2015-02-14 06:42:44 -05:00
Donne Martin
f31a289eab Fixed bug in replace snippet where a copy of a DataFrame was being created instead of doing the replace in place. 2015-02-13 15:54:23 -05:00
Donne Martin
c7bc8e386f Added snippet to concatenate two DataFrames. 2015-02-13 15:53:15 -05:00
Donne Martin
90baf301b6 Added snippet to check for matching values in a specific column for replacement. 2015-02-13 15:51:25 -05:00
Donne Martin
f087dcd6c6 Added comments to the two different flavors of string replacement snippets to better explain the operation. 2015-02-11 16:51:18 -05:00
Donne Martin
89ce172c77 Added IPython Notebook for cleaning data with Pandas. Added snippets for replacing strings. 2015-02-10 17:44:21 -05:00
Donne Martin
23e62231d2 Added snippets for Summarizing and Computing Descriptive Statistics. 2015-02-08 16:34:16 -05:00
Donne Martin
f27f699078 Added References section. 2015-02-06 08:28:25 -05:00
Donne Martin
c20545aff7 Added snippets for Axis Indexes with Duplicate Values. 2015-02-06 08:21:46 -05:00
Donne Martin
12bbfb9678 Added DataFrame ranking snippets. 2015-02-01 07:33:52 -05:00
Donne Martin
3f5e508eb6 Added Series ranking snippets. Tweaked some of the comments positions relative to the code. Minor tweaks to some snippets. 2015-02-01 07:33:19 -05:00
Donne Martin
91cdd02752 Added snippets for sorting Series and DataFrames. Added index to notebook. 2015-01-31 16:39:35 -05:00
Donne Martin
936b8adee4 Added function application and mapping snippets for Series and DataFrames. 2015-01-31 08:46:27 -05:00
Donne Martin
f217da8adf Added snippets for operations between DataFrames and Series. 2015-01-31 08:26:48 -05:00
Donne Martin
ebe737d11e Standardized on df as the DataFrame variable for code snippets. 2015-01-31 08:00:08 -05:00
Donne Martin
45634da9ff Seeded random for more predictability between iterations. Added snippets for setting a fill value for indices that do not overlap for arithmetic operations. 2015-01-31 07:46:16 -05:00
Donne Martin
418676f075 Added snippets for basic Arithmetic and Data Alignment for DataFrames. 2015-01-31 07:37:05 -05:00