mirror of
https://github.com/donnemartin/data-science-ipython-notebooks.git
synced 2024-03-22 13:30:56 +08:00
Added section links to index. Some cleanup.
This commit is contained in:
parent
0463617683
commit
a8197d8cc4
|
@ -4,15 +4,15 @@
|
|||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"# Hadoop MapReduce: Python Streaming with mrjob\n",
|
||||
"# Python Hadoop MapReduce: Analyzing AWS S3 Bucket Logs with mrjob\n",
|
||||
"\n",
|
||||
"* Introduction\n",
|
||||
"* Setup\n",
|
||||
"* Processing S3 Logs\n",
|
||||
"* Running Amazon Elastic MapReduce (EMR) Jobs\n",
|
||||
"* Unit Testing S3 Logs\n",
|
||||
"* Running S3 Logs Unit Test\n",
|
||||
"* Sample .mrjob.conf"
|
||||
"* [Introduction](#Introduction)\n",
|
||||
"* [Setup](#Setup)\n",
|
||||
"* [Processing S3 Logs](#Processing-S3-Logs)\n",
|
||||
"* [Running Amazon Elastic MapReduce Jobs](#Running-Amazon-Elastic-MapReduce-Jobs)\n",
|
||||
"* [Unit Testing S3 Logs](#Unit-Testing-S3-Logs)\n",
|
||||
"* [Running S3 Logs Unit Test](#Running-S3-Logs-Unit-Test)\n",
|
||||
"* [Sample Config File](#Sample-Config-File)"
|
||||
]
|
||||
},
|
||||
{
|
||||
|
@ -43,7 +43,7 @@
|
|||
"\n",
|
||||
"``python setup.py install``\n",
|
||||
"\n",
|
||||
"See \"Sample .mrjob.conf\" section for additional config details."
|
||||
"See [Sample Config File](#Sample-Config-File) section for additional config details."
|
||||
]
|
||||
},
|
||||
{
|
||||
|
@ -207,14 +207,14 @@
|
|||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Running Amazon Elastic MapReduce (EMR) Jobs"
|
||||
"## Running Amazon Elastic MapReduce Jobs"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Run an Amazon EMR job on the given input (must be a flat file hierarchy), placing the results in the output (output directory must not exist):"
|
||||
"Run an Amazon Elastic MapReduce (EMR) job on the given input (must be a flat file hierarchy), placing the results in the output (output directory must not exist):"
|
||||
]
|
||||
},
|
||||
{
|
||||
|
@ -388,7 +388,7 @@
|
|||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Sample .mrjob.conf"
|
||||
"## Sample Config File"
|
||||
]
|
||||
},
|
||||
{
|
||||
|
@ -417,15 +417,6 @@
|
|||
" - sudo apt-get install -y python-pip\n",
|
||||
" - sudo pip install --upgrade simplejson"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"collapsed": true
|
||||
},
|
||||
"outputs": [],
|
||||
"source": []
|
||||
}
|
||||
],
|
||||
"metadata": {
|
||||
|
|
Loading…
Reference in New Issue
Block a user