Commit Graph

148 Commits

Author SHA1 Message Date
Donne Martin
7fac30fa79 Renamed folder core to python-core to be more explicit about its contents. 2015-02-28 12:48:47 -05:00
Donne Martin
87402ca5b8 Added IPython Notebook containing HDFS snippets. 2015-02-28 12:44:56 -05:00
Donne Martin
85a316fc29 Added instructions on how to add terminal colors by editing your .bash_profile. 2015-02-28 12:40:18 -05:00
Donne Martin
a2191d03bd Added linux snippet to uncompress all tar.gz files in the current directory to another directory. 2015-02-28 12:38:52 -05:00
Donne Martin
d28c32b8da Added grep snippets to check number of files matching a search term and snippet to check the number of MapReduce records processed. 2015-02-28 12:37:18 -05:00
Donne Martin
133cddb267 Added linux commands to count lines and split files into multiple parts based on line counts. 2015-02-28 12:32:08 -05:00
Donne Martin
a709c709ce Added linux commands IPython Notebook, initially contains command disk usage commands. 2015-02-28 12:30:36 -05:00
Donne Martin
68b826c212 Added note about donnemartin.com, my mirror site. 2015-02-28 09:17:43 -05:00
Donne Martin
fb1768f39e Added comments on the jekyll build and serve commands. Added Jekyll to list of IPython Notebooks. 2015-02-28 09:15:38 -05:00
Donne Martin
06b1d3bf2a Added Redshift snippets. 2015-02-24 07:17:26 -05:00
Donne Martin
89a7bf4b93 Added mrjob snippet to run a job on EMR or locally. 2015-02-23 11:48:03 -05:00
Donne Martin
7b15eb949b Moved S3DistCp below the S3 specific commands to group with upcoming EMR. 2015-02-22 07:12:15 -05:00
Donne Martin
39281aaaa8 Added Jekyll commands IPython Notebook. 2015-02-21 17:57:55 -05:00
Donne Martin
8b82750526 Added console, CLI, s3cmd note. Added URL to s3-parallel-put GitHub repo. 2015-02-20 10:16:35 -05:00
Donne Martin
525da02e17 Added use cases for S3DistCp. 2015-02-19 08:48:01 -05:00
Donne Martin
c8569ed3d5 Added s3cmd intro and note about encryption and performance. 2015-02-19 08:45:11 -05:00
Donne Martin
2ac42e46da Added snippets for s3-parallel-put, a command line tool to upload files to S3 in parallel. 2015-02-19 08:41:07 -05:00
Donne Martin
c454d12a45 Added Think Stats reference. Removed ref attribute from other References. 2015-02-18 06:25:08 -05:00
Donne Martin
987ca75c6b Added AWS command lines IPython Notebook to README. 2015-02-18 06:13:40 -05:00
Donne Martin
50a79a011c Added snippets for setting up S3cmd and several frequently used commands. 2015-02-18 06:11:40 -05:00
Donne Martin
eff1a1fab4 Added AWS command line snippet to run S3DisctCp. Added snippet to control compression. 2015-02-17 20:10:14 -05:00
Donne Martin
5f554c928e Added AWS commands IPython Notebook. Currently contains command lines to connect to AWS Linux and Ubuntu instances. 2015-02-17 20:03:27 -05:00
Donne Martin
f591857c77 Reworked snippets for opening and closing a file to use 'with open' instead of open/close 2015-02-16 17:11:42 -05:00
Donne Martin
8b974b811a Added snippet to read and write utf-8. 2015-02-16 17:01:10 -05:00
Donne Martin
4af62400e3 Added commands stub folder for upcoming command line focused IPython Notebooks 2015-02-15 17:59:16 -05:00
Donne Martin
8bb029ca75 Removed double spacing for Pandas IPython Notebooks. 2015-02-15 17:57:20 -05:00
Donne Martin
99477591d1 Updated README with Pandas IO, Pandas Cleaning, and note about various command lines (coming soon). 2015-02-15 17:56:06 -05:00
Donne Martin
7563eeae21 Added code to read CSV data to Pandas, describe, list head, then write the CSV to another file 2015-02-15 17:48:13 -05:00
Donne Martin
d6348012a4 Added snippet to drop a column in a DataFrame. Renamed pop to population to avoid clashing with the DataFrame pop function. 2015-02-14 06:42:44 -05:00
Donne Martin
f31a289eab Fixed bug in replace snippet where a copy of a DataFrame was being created instead of doing the replace in place. 2015-02-13 15:54:23 -05:00
Donne Martin
c7bc8e386f Added snippet to concatenate two DataFrames. 2015-02-13 15:53:15 -05:00
Donne Martin
90baf301b6 Added snippet to check for matching values in a specific column for replacement. 2015-02-13 15:51:25 -05:00
Donne Martin
f087dcd6c6 Added comments to the two different flavors of string replacement snippets to better explain the operation. 2015-02-11 16:51:18 -05:00
Donne Martin
89ce172c77 Added IPython Notebook for cleaning data with Pandas. Added snippets for replacing strings. 2015-02-10 17:44:21 -05:00
Donne Martin
23e62231d2 Added snippets for Summarizing and Computing Descriptive Statistics. 2015-02-08 16:34:16 -05:00
Donne Martin
f27f699078 Added References section. 2015-02-06 08:28:25 -05:00
Donne Martin
c20545aff7 Added snippets for Axis Indexes with Duplicate Values. 2015-02-06 08:21:46 -05:00
Donne Martin
12bbfb9678 Added DataFrame ranking snippets. 2015-02-01 07:33:52 -05:00
Donne Martin
3f5e508eb6 Added Series ranking snippets. Tweaked some of the comments positions relative to the code. Minor tweaks to some snippets. 2015-02-01 07:33:19 -05:00
Donne Martin
91cdd02752 Added snippets for sorting Series and DataFrames. Added index to notebook. 2015-01-31 16:39:35 -05:00
Donne Martin
936b8adee4 Added function application and mapping snippets for Series and DataFrames. 2015-01-31 08:46:27 -05:00
Donne Martin
f217da8adf Added snippets for operations between DataFrames and Series. 2015-01-31 08:26:48 -05:00
Donne Martin
ebe737d11e Standardized on df as the DataFrame variable for code snippets. 2015-01-31 08:00:08 -05:00
Donne Martin
45634da9ff Seeded random for more predictability between iterations. Added snippets for setting a fill value for indices that do not overlap for arithmetic operations. 2015-01-31 07:46:16 -05:00
Donne Martin
418676f075 Added snippets for basic Arithmetic and Data Alignment for DataFrames. 2015-01-31 07:37:05 -05:00
Donne Martin
c2bf4dfda5 Added snippets for basic Arithmetic and Data Alignment for Series. 2015-01-31 07:36:02 -05:00
Donne Martin
c821f2d6f3 Added snippets for indexing, selecting, and filtering on a DataFrame. 2015-01-31 06:30:06 -05:00
Donne Martin
7cd3f6af15 Added snippets for indexing, selecting, and filtering on a series. 2015-01-31 06:29:43 -05:00
Donne Martin
2398a0bf55 Added
dropping entries snippets.
2015-01-29 15:16:25 -05:00
Donne Martin
46c1be9a02 Added note about pandas index objects being immutable and holding certain metadata. 2015-01-29 13:00:02 -05:00