diff --git a/commands/aws.ipynb b/commands/aws.ipynb index f1a03d9..a0be60d 100644 --- a/commands/aws.ipynb +++ b/commands/aws.ipynb @@ -1,7 +1,7 @@ { "metadata": { "name": "", - "signature": "sha256:f75051e8768199fb18b1e41e2b9e76d496309c12f65bffda915754795767ecb3" + "signature": "sha256:a0e8d0242f04dd2d399759e132a0d586ef06b84f9af2e89105e2835e56d40b2a" }, "nbformat": 3, "nbformat_minor": 0, @@ -60,7 +60,9 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "## S3DistCp" + "## S3DistCp\n", + "\n", + "[S3DistCp](http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/UsingEMR_s3distcp.html) is an extension of DistCp that is optimized to work with Amazon S3. S3DistCp is useful for combining smaller files and aggregate them together, taking in a pattern and target file to combine smaller input files to larger ones. S3DistCp can also be used to transfer large volumes of data from S3 to your Hadoop cluster." ] }, {