2015-06-03 05:45:02 +08:00
{
" cells " : [
2015-06-18 04:36:55 +08:00
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
2016-07-31 20:11:18 +08:00
" This notebook was prepared by [Donne Martin](http://donnemartin.com). Source and license info is on [GitHub](https://github.com/donnemartin/interactive-coding-challenges). "
2015-07-04 07:56:11 +08:00
]
} ,
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
" # Solution Notebook "
2015-06-18 04:36:55 +08:00
]
} ,
2015-06-03 05:45:02 +08:00
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
" ## Problem: Implement a hash table with set, get, and remove methods. \n " ,
" \n " ,
2015-06-30 17:55:58 +08:00
" * [Constraints](#Constraints) \n " ,
2015-06-03 05:45:02 +08:00
" * [Test Cases](#Test-Cases) \n " ,
" * [Algorithm](#Algorithm) \n " ,
" * [Code](#Code) \n " ,
2015-06-25 06:21:25 +08:00
" * [Unit Test](#Unit-Test) "
2015-06-03 05:45:02 +08:00
]
} ,
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
2015-06-28 19:13:27 +08:00
" ## Constraints \n " ,
2015-06-03 05:45:02 +08:00
" \n " ,
2015-06-25 06:21:25 +08:00
" * For simplicity, are the keys integers only? \n " ,
" * Yes \n " ,
2016-10-17 09:28:31 +08:00
" * For collision resolution, can we use chaining? \n " ,
2015-06-25 06:21:25 +08:00
" * Yes \n " ,
" * Do we have to worry about load factors? \n " ,
2016-10-17 09:28:31 +08:00
" * No \n " ,
" * Do we have to validate inputs? \n " ,
" * No \n " ,
" * Can we assume this fits memory? \n " ,
" * Yes "
2015-06-03 05:45:02 +08:00
]
} ,
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
" ## Test Cases \n " ,
" \n " ,
2016-10-17 09:28:31 +08:00
" * `get` no matching key -> KeyError exception \n " ,
" * `get` matching key -> value \n " ,
2017-05-16 12:30:12 +08:00
" * `set` no matching key -> new key, value \n " ,
2016-10-17 09:28:31 +08:00
" * `set` matching key -> update value \n " ,
" * `remove` no matching key -> KeyError exception \n " ,
" * `remove` matching key -> remove key, value "
2015-06-03 05:45:02 +08:00
]
} ,
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
" ## Algorithm \n " ,
" \n " ,
" ### Hash Function \n " ,
" \n " ,
" * Return key % table size \n " ,
" \n " ,
" Complexity: \n " ,
" * Time: O(1) \n " ,
" * Space: O(1) \n " ,
" \n " ,
" ### Set \n " ,
" \n " ,
" * Get hash index for lookup \n " ,
" * If key exists, replace \n " ,
" * Else, add \n " ,
" \n " ,
" Complexity: \n " ,
" * Time: O(1) average and best, O(n) worst \n " ,
" * Space: O(1) space for newly added element \n " ,
" \n " ,
" ### Get \n " ,
" \n " ,
" * Get hash index for lookup \n " ,
" * If key exists, return value \n " ,
2016-10-17 09:28:31 +08:00
" * Else, raise KeyError \n " ,
2015-06-03 05:45:02 +08:00
" \n " ,
" Complexity: \n " ,
" * Time: O(1) average and best, O(n) worst \n " ,
" * Space: O(1) \n " ,
" \n " ,
" ### Remove \n " ,
" \n " ,
" * Get hash index for lookup \n " ,
" * If key exists, delete the item \n " ,
2016-10-17 09:28:31 +08:00
" * Else, raise KeyError \n " ,
2015-06-03 05:45:02 +08:00
" \n " ,
" Complexity: \n " ,
" * Time: O(1) average and best, O(n) worst \n " ,
" * Space: O(1) "
]
} ,
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
" ## Code "
]
} ,
{
" cell_type " : " code " ,
2015-06-25 06:21:25 +08:00
" execution_count " : 1 ,
2020-07-04 22:56:49 +08:00
" metadata " : { } ,
2015-06-03 05:45:02 +08:00
" outputs " : [ ] ,
" source " : [
" class Item(object): \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-03 05:45:02 +08:00
" def __init__(self, key, value): \n " ,
" self.key = key \n " ,
" self.value = value \n " ,
" \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-03 05:45:02 +08:00
" class HashTable(object): \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-03 05:45:02 +08:00
" def __init__(self, size): \n " ,
" self.size = size \n " ,
2015-07-06 18:15:53 +08:00
" self.table = [[] for _ in range(self.size)] \n " ,
2015-06-03 05:45:02 +08:00
" \n " ,
2016-11-30 20:16:44 +08:00
" def _hash_function(self, key): \n " ,
2015-06-03 05:45:02 +08:00
" return key % s elf.size \n " ,
" \n " ,
" def set(self, key, value): \n " ,
2016-11-30 20:16:44 +08:00
" hash_index = self._hash_function(key) \n " ,
2015-06-03 05:45:02 +08:00
" for item in self.table[hash_index]: \n " ,
" if item.key == key: \n " ,
" item.value = value \n " ,
" return \n " ,
" self.table[hash_index].append(Item(key, value)) \n " ,
" \n " ,
" def get(self, key): \n " ,
2016-11-30 20:16:44 +08:00
" hash_index = self._hash_function(key) \n " ,
2015-06-03 05:45:02 +08:00
" for item in self.table[hash_index]: \n " ,
" if item.key == key: \n " ,
" return item.value \n " ,
2016-10-17 09:28:31 +08:00
" raise KeyError( ' Key not found ' ) \n " ,
2015-06-03 05:45:02 +08:00
" \n " ,
" def remove(self, key): \n " ,
2016-11-30 20:16:44 +08:00
" hash_index = self._hash_function(key) \n " ,
2016-10-17 09:28:31 +08:00
" for index, item in enumerate(self.table[hash_index]): \n " ,
2015-06-03 05:45:02 +08:00
" if item.key == key: \n " ,
2016-10-17 09:28:31 +08:00
" del self.table[hash_index][index] \n " ,
" return \n " ,
" raise KeyError( ' Key not found ' ) "
2015-06-03 05:45:02 +08:00
]
} ,
2015-06-25 06:21:25 +08:00
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
" ## Unit Test "
]
} ,
2015-06-03 05:45:02 +08:00
{
" cell_type " : " code " ,
2015-06-25 06:21:25 +08:00
" execution_count " : 2 ,
2020-07-04 22:56:49 +08:00
" metadata " : { } ,
2015-06-25 06:21:25 +08:00
" outputs " : [
{
" name " : " stdout " ,
" output_type " : " stream " ,
" text " : [
2015-07-04 07:56:11 +08:00
" Overwriting test_hash_map.py \n "
2015-06-25 06:21:25 +08:00
]
}
] ,
2015-06-03 05:45:02 +08:00
" source " : [
2015-07-04 07:56:11 +08:00
" %% writefile test_hash_map.py \n " ,
2020-07-04 22:56:49 +08:00
" import unittest \n " ,
2015-06-25 06:21:25 +08:00
" \n " ,
2015-07-04 07:56:11 +08:00
" \n " ,
2020-07-04 22:56:49 +08:00
" class TestHashMap(unittest.TestCase): \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-25 06:21:25 +08:00
" # TODO: It would be better if we had unit tests for each \n " ,
2015-06-27 18:30:21 +08:00
" # method in addition to the following end-to-end test \n " ,
2015-06-25 06:21:25 +08:00
" def test_end_to_end(self): \n " ,
" hash_table = HashTable(10) \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-25 06:21:25 +08:00
" print( \" Test: get on an empty hash table index \" ) \n " ,
2020-07-04 22:56:49 +08:00
" self.assertRaises(KeyError, hash_table.get, 0) \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-25 06:21:25 +08:00
" print( \" Test: set on an empty hash table index \" ) \n " ,
" hash_table.set(0, ' foo ' ) \n " ,
2020-07-04 22:56:49 +08:00
" self.assertEqual(hash_table.get(0), ' foo ' ) \n " ,
2015-06-25 06:21:25 +08:00
" hash_table.set(1, ' bar ' ) \n " ,
2020-07-04 22:56:49 +08:00
" self.assertEqual(hash_table.get(1), ' bar ' ) \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-25 06:21:25 +08:00
" print( \" Test: set on a non empty hash table index \" ) \n " ,
" hash_table.set(10, ' foo2 ' ) \n " ,
2020-07-04 22:56:49 +08:00
" self.assertEqual(hash_table.get(0), ' foo ' ) \n " ,
" self.assertEqual(hash_table.get(10), ' foo2 ' ) \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-25 06:21:25 +08:00
" print( \" Test: set on a key that already exists \" ) \n " ,
" hash_table.set(10, ' foo3 ' ) \n " ,
2020-07-04 22:56:49 +08:00
" self.assertEqual(hash_table.get(0), ' foo ' ) \n " ,
" self.assertEqual(hash_table.get(10), ' foo3 ' ) \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-25 06:21:25 +08:00
" print( \" Test: remove on a key that already exists \" ) \n " ,
" hash_table.remove(10) \n " ,
2020-07-04 22:56:49 +08:00
" self.assertEqual(hash_table.get(0), ' foo ' ) \n " ,
" self.assertRaises(KeyError, hash_table.get, 10) \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-25 06:21:25 +08:00
" print( \" Test: remove on a key that doesn ' t exist \" ) \n " ,
2020-07-04 22:56:49 +08:00
" self.assertRaises(KeyError, hash_table.remove, -1) \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-25 06:21:25 +08:00
" print( ' Success: test_end_to_end ' ) \n " ,
" \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-07-04 07:56:11 +08:00
" def main(): \n " ,
" test = TestHashMap() \n " ,
" test.test_end_to_end() \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
" \n " ,
2015-06-25 06:21:25 +08:00
" if __name__ == ' __main__ ' : \n " ,
2015-07-04 07:56:11 +08:00
" main() "
]
} ,
{
" cell_type " : " code " ,
" execution_count " : 3 ,
2020-07-04 22:56:49 +08:00
" metadata " : { } ,
2015-07-04 07:56:11 +08:00
" outputs " : [
{
" name " : " stdout " ,
" output_type " : " stream " ,
" text " : [
" Test: get on an empty hash table index \n " ,
" Test: set on an empty hash table index \n " ,
" Test: set on a non empty hash table index \n " ,
" Test: set on a key that already exists \n " ,
" Test: remove on a key that already exists \n " ,
" Test: remove on a key that doesn ' t exist \n " ,
" Success: test_end_to_end \n "
]
}
] ,
" source " : [
" run -i test_hash_map.py "
2015-06-03 05:45:02 +08:00
]
2020-07-04 22:56:49 +08:00
} ,
{
" cell_type " : " code " ,
" execution_count " : null ,
" metadata " : { } ,
" outputs " : [ ] ,
" source " : [ ]
2015-06-03 05:45:02 +08:00
}
] ,
" metadata " : {
" kernelspec " : {
2016-06-13 11:16:06 +08:00
" display_name " : " Python 3 " ,
2015-06-03 05:45:02 +08:00
" language " : " python " ,
2016-06-13 11:16:06 +08:00
" name " : " python3 "
2015-06-03 05:45:02 +08:00
} ,
" language_info " : {
" codemirror_mode " : {
" name " : " ipython " ,
2016-06-13 11:16:06 +08:00
" version " : 3
2015-06-03 05:45:02 +08:00
} ,
" file_extension " : " .py " ,
" mimetype " : " text/x-python " ,
" name " : " python " ,
" nbconvert_exporter " : " python " ,
2016-06-13 11:16:06 +08:00
" pygments_lexer " : " ipython3 " ,
2020-07-04 22:56:49 +08:00
" version " : " 3.7.2 "
2015-06-03 05:45:02 +08:00
}
} ,
" nbformat " : 4 ,
2020-07-04 22:56:49 +08:00
" nbformat_minor " : 1
2015-06-03 05:45:02 +08:00
}