{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "This notebook was prepared by [Donne Martin](http://donnemartin.com). Source and license info is on [GitHub](https://bit.ly/code-notes)." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Problem: Implement merge sort.\n", "\n", "* [Constraints and Assumptions](#Constraints-and-Assumptions)\n", "* [Test Cases](#Test-Cases)\n", "* [Algorithm](#Algorithm)\n", "* [Code](#Code)\n", "* [Unit Test](#Unit-Test)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Constraints and Assumptions\n", "\n", "*Problem statements are often intentionally ambiguous. Identifying constraints and stating assumptions can help to ensure you code the intended solution.*\n", "\n", "* Are you looking for a naiive solution?\n", " * Yes" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Test Cases\n", "\n", "* Empty input\n", "* One element\n", "* Two or more elements\n", "* Left and right subarrays of different lengths" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Algorithm\n", "\n", "Wikipedia's animation:\n", "![alt text](http://upload.wikimedia.org/wikipedia/commons/c/cc/Merge-sort-example-300px.gif)\n", "\n", "* Recursively split array into left and right halves\n", "* Merge split arrays\n", " * Using two pointers, one for each half starting at index 0\n", " * Add the smaller element to the result array\n", " * Inrement pointer where smaller element exists\n", " * Copy remaining elements to the result array\n", " * Return result array\n", "\n", "Complexity:\n", "* Time: O(n log(n))\n", "* Space: O(n), stable" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Code" ] }, { "cell_type": "code", "execution_count": 1, "metadata": { "collapsed": false }, "outputs": [], "source": [ "def merge(left, right):\n", " l = 0\n", " r = 0\n", " result = []\n", " while l < len(left) and r < len(right):\n", " if left[l] < right[r]:\n", " result.append(left[l])\n", " l += 1\n", " else:\n", " result.append(right[r])\n", " r += 1\n", " while l < len(left):\n", " result.append(left[l])\n", " l += 1\n", " while r < len(right):\n", " result.append(right[r])\n", " r += 1\n", " return result\n", "\n", "def merge_sort(data):\n", " if len(data) < 2:\n", " return data\n", " mid = len(data) / 2\n", " left = data[0:mid]\n", " right = data[mid:len(data)]\n", " left = merge_sort(left)\n", " right = merge_sort(right)\n", " return merge(left, right)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Unit Test\n", "\n", "*It is important to identify and run through general and edge cases from the [Test Cases](#Test-Cases) section by hand. You generally will not be asked to write a unit test like what is shown below.*" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Empty input\n", "One element\n", "Two or more elements\n", "Success: test_merge_sort\n" ] } ], "source": [ "from nose.tools import assert_equal\n", "\n", "class Test(object):\n", " def test_merge_sort(self):\n", " print('Empty input')\n", " data = []\n", " merge_sort(data)\n", " assert_equal(data, [])\n", "\n", " print('One element')\n", " data = [5]\n", " merge_sort(data)\n", " assert_equal(data, [5])\n", "\n", " print('Two or more elements')\n", " data = [5, 1, 7, 2, 6, -3, 5, 7, -1]\n", " data = merge_sort(data)\n", " assert_equal(data, sorted(data))\n", " \n", " print('Success: test_merge_sort')\n", "\n", "if __name__ == '__main__':\n", " test = Test()\n", " test.test_merge_sort()" ] } ], "metadata": { "kernelspec": { "display_name": "Python 2", "language": "python", "name": "python2" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 2 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython2", "version": "2.7.10" } }, "nbformat": 4, "nbformat_minor": 0 }