Added section links to index. Some cleanup.

This commit is contained in:
Donne Martin 2015-05-31 05:38:04 -04:00
parent 0463617683
commit a8197d8cc4

View File

@ -4,15 +4,15 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"# Hadoop MapReduce: Python Streaming with mrjob\n",
"# Python Hadoop MapReduce: Analyzing AWS S3 Bucket Logs with mrjob\n",
"\n",
"* Introduction\n",
"* Setup\n",
"* Processing S3 Logs\n",
"* Running Amazon Elastic MapReduce (EMR) Jobs\n",
"* Unit Testing S3 Logs\n",
"* Running S3 Logs Unit Test\n",
"* Sample .mrjob.conf"
"* [Introduction](#Introduction)\n",
"* [Setup](#Setup)\n",
"* [Processing S3 Logs](#Processing-S3-Logs)\n",
"* [Running Amazon Elastic MapReduce Jobs](#Running-Amazon-Elastic-MapReduce-Jobs)\n",
"* [Unit Testing S3 Logs](#Unit-Testing-S3-Logs)\n",
"* [Running S3 Logs Unit Test](#Running-S3-Logs-Unit-Test)\n",
"* [Sample Config File](#Sample-Config-File)"
]
},
{
@ -43,7 +43,7 @@
"\n",
"``python setup.py install``\n",
"\n",
"See \"Sample .mrjob.conf\" section for additional config details."
"See [Sample Config File](#Sample-Config-File) section for additional config details."
]
},
{
@ -207,14 +207,14 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"## Running Amazon Elastic MapReduce (EMR) Jobs"
"## Running Amazon Elastic MapReduce Jobs"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"Run an Amazon EMR job on the given input (must be a flat file hierarchy), placing the results in the output (output directory must not exist):"
"Run an Amazon Elastic MapReduce (EMR) job on the given input (must be a flat file hierarchy), placing the results in the output (output directory must not exist):"
]
},
{
@ -388,7 +388,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"## Sample .mrjob.conf"
"## Sample Config File"
]
},
{
@ -417,15 +417,6 @@
" - sudo apt-get install -y python-pip\n",
" - sudo pip install --upgrade simplejson"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"collapsed": true
},
"outputs": [],
"source": []
}
],
"metadata": {