2015-05-04 17:54:50 +08:00
{
" cells " : [
2015-06-18 04:36:55 +08:00
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
2016-07-31 20:11:18 +08:00
" This notebook was prepared by [Donne Martin](http://donnemartin.com). Source and license info is on [GitHub](https://github.com/donnemartin/interactive-coding-challenges). "
2015-07-04 07:55:56 +08:00
]
} ,
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
" # Solution Notebook "
2015-06-18 04:36:55 +08:00
]
} ,
2015-05-04 17:54:50 +08:00
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
2016-02-09 19:52:32 +08:00
" ## Problem: Compress a string such that ' AAABCCDDDD ' becomes ' A3BC2D4 ' . Only compress the string if it saves space. \n " ,
2015-05-04 17:54:50 +08:00
" \n " ,
2015-06-30 17:55:58 +08:00
" * [Constraints](#Constraints) \n " ,
2015-05-04 17:54:50 +08:00
" * [Test Cases](#Test-Cases) \n " ,
2015-07-06 18:30:00 +08:00
" * [Algorithm](#Algorithm) \n " ,
" * [Code](#Code) \n " ,
2015-06-25 06:20:20 +08:00
" * [Unit Test](#Unit-Test) "
2015-05-04 17:54:50 +08:00
]
} ,
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
2015-06-28 19:13:27 +08:00
" ## Constraints \n " ,
2015-05-04 17:54:50 +08:00
" \n " ,
2015-07-04 07:55:56 +08:00
" * Can we assume the string is ASCII? \n " ,
2015-06-25 06:20:20 +08:00
" * Yes \n " ,
" * Note: Unicode strings could require special handling depending on your language \n " ,
2016-06-05 01:40:15 +08:00
" * Is this case sensitive? \n " ,
" * Yes \n " ,
2016-06-13 11:06:12 +08:00
" * Can we use additional data structures? \n " ,
2015-05-04 17:54:50 +08:00
" * Yes \n " ,
2016-06-05 01:40:15 +08:00
" * Can we assume this fits in memory? \n " ,
2015-07-08 05:15:18 +08:00
" * Yes "
2015-05-04 17:54:50 +08:00
]
} ,
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
" ## Test Cases \n " ,
" \n " ,
2015-07-04 07:55:56 +08:00
" * None -> None \n " ,
2015-05-04 17:54:50 +08:00
" * ' ' -> ' ' \n " ,
2015-07-04 07:55:56 +08:00
" * ' AABBCC ' -> ' AABBCC ' \n " ,
2016-02-09 19:52:32 +08:00
" * ' AAABCCDDDD ' -> ' A3BC2D4 ' "
2015-05-04 17:54:50 +08:00
]
} ,
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
2015-07-06 18:30:00 +08:00
" ## Algorithm \n " ,
2015-05-04 17:54:50 +08:00
" \n " ,
2016-02-09 19:49:29 +08:00
" * For each char in string \n " ,
" * If char is the same as last_char, increment count \n " ,
" * Else \n " ,
" * Append last_char and count to compressed_string \n " ,
" * last_char = char \n " ,
" * count = 1 \n " ,
" * Append last_char and count to compressed_string \n " ,
" * If the compressed string size is < string size \n " ,
" * Return compressed string \n " ,
" * Else \n " ,
" * Return string \n " ,
2015-05-04 17:54:50 +08:00
" \n " ,
" Complexity: \n " ,
" * Time: O(n) \n " ,
2016-02-09 19:49:29 +08:00
" * Space: O(n) \n " ,
" \n " ,
" Complexity Note: \n " ,
" * Although strings are immutable in Python, appending to strings is optimized in CPython so that it now runs in O(n) and extends the string in-place. Refer to this [Stack Overflow post](http://stackoverflow.com/a/4435752). "
2015-05-04 17:54:50 +08:00
]
} ,
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
2015-07-06 18:30:00 +08:00
" ## Code "
2015-05-04 17:54:50 +08:00
]
} ,
{
" cell_type " : " code " ,
2015-06-25 06:20:20 +08:00
" execution_count " : 1 ,
2020-07-04 22:56:49 +08:00
" metadata " : { } ,
2015-05-04 17:54:50 +08:00
" outputs " : [ ] ,
" source " : [
2016-08-13 18:37:48 +08:00
" class CompressString(object): \n " ,
" \n " ,
" def compress(self, string): \n " ,
2016-10-12 19:28:12 +08:00
" if string is None or not string: \n " ,
2016-08-13 18:37:48 +08:00
" return string \n " ,
" result = ' ' \n " ,
" prev_char = string[0] \n " ,
" count = 0 \n " ,
" for char in string: \n " ,
" if char == prev_char: \n " ,
" count += 1 \n " ,
" else: \n " ,
2016-10-12 19:28:12 +08:00
" result += self._calc_partial_result(prev_char, count) \n " ,
2016-08-13 18:37:48 +08:00
" prev_char = char \n " ,
" count = 1 \n " ,
2016-10-12 19:28:12 +08:00
" result += self._calc_partial_result(prev_char, count) \n " ,
" return result if len(result) < len(string) else string \n " ,
" \n " ,
" def _calc_partial_result(self, prev_char, count): \n " ,
" return prev_char + (str(count) if count > 1 else ' ' ) "
2015-05-04 17:54:50 +08:00
]
} ,
2015-06-26 05:36:09 +08:00
{
" cell_type " : " markdown " ,
" metadata " : { } ,
" source " : [
2015-07-04 07:55:56 +08:00
" ## Unit Test "
2015-06-26 05:36:09 +08:00
]
} ,
2015-06-25 06:20:20 +08:00
{
" cell_type " : " code " ,
2015-07-06 18:30:00 +08:00
" execution_count " : 2 ,
2020-07-04 22:56:49 +08:00
" metadata " : { } ,
2015-06-25 06:20:20 +08:00
" outputs " : [
{
" name " : " stdout " ,
" output_type " : " stream " ,
" text " : [
2015-07-04 07:55:56 +08:00
" Overwriting test_compress.py \n "
2015-06-25 06:20:20 +08:00
]
}
] ,
" source " : [
2015-07-04 07:55:56 +08:00
" %% writefile test_compress.py \n " ,
2020-07-04 22:56:49 +08:00
" import unittest \n " ,
2015-05-04 17:54:50 +08:00
" \n " ,
2015-07-04 07:55:56 +08:00
" \n " ,
2020-07-04 22:56:49 +08:00
" class TestCompress(unittest.TestCase): \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-06-25 06:20:20 +08:00
" def test_compress(self, func): \n " ,
2020-07-04 22:56:49 +08:00
" self.assertEqual(func(None), None) \n " ,
" self.assertEqual(func( ' ' ), ' ' ) \n " ,
" self.assertEqual(func( ' AABBCC ' ), ' AABBCC ' ) \n " ,
" self.assertEqual(func( ' AAABCCDDDDE ' ), ' A3BC2D4E ' ) \n " ,
" self.assertEqual(func( ' BAAACCDDDD ' ), ' BA3C2D4 ' ) \n " ,
" self.assertEqual(func( ' AAABAACCDDDD ' ), ' A3BA2C2D4 ' ) \n " ,
2015-06-25 06:20:20 +08:00
" print( ' Success: test_compress ' ) \n " ,
" \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-07-04 07:55:56 +08:00
" def main(): \n " ,
" test = TestCompress() \n " ,
2016-08-13 18:37:48 +08:00
" compress_string = CompressString() \n " ,
" test.test_compress(compress_string.compress) \n " ,
2015-07-04 07:55:56 +08:00
" \n " ,
2015-07-12 03:34:14 +08:00
" \n " ,
2015-07-04 07:55:56 +08:00
" if __name__ == ' __main__ ' : \n " ,
" main() "
]
} ,
{
" cell_type " : " code " ,
2015-07-06 18:30:00 +08:00
" execution_count " : 3 ,
2020-07-04 22:56:49 +08:00
" metadata " : { } ,
2015-07-04 07:55:56 +08:00
" outputs " : [
{
" name " : " stdout " ,
" output_type " : " stream " ,
" text " : [
" Success: test_compress \n "
]
}
] ,
" source " : [
" %r un -i test_compress.py "
2015-05-04 17:54:50 +08:00
]
}
] ,
" metadata " : {
" kernelspec " : {
2016-02-09 19:49:29 +08:00
" display_name " : " Python 3 " ,
2015-05-04 17:54:50 +08:00
" language " : " python " ,
2016-02-09 19:49:29 +08:00
" name " : " python3 "
2015-05-04 17:54:50 +08:00
} ,
" language_info " : {
" codemirror_mode " : {
" name " : " ipython " ,
2016-02-09 19:49:29 +08:00
" version " : 3
2015-05-04 17:54:50 +08:00
} ,
" file_extension " : " .py " ,
" mimetype " : " text/x-python " ,
" name " : " python " ,
" nbconvert_exporter " : " python " ,
2016-02-09 19:49:29 +08:00
" pygments_lexer " : " ipython3 " ,
2020-07-04 22:56:49 +08:00
" version " : " 3.7.2 "
2015-05-04 17:54:50 +08:00
}
} ,
" nbformat " : 4 ,
2020-07-04 22:56:49 +08:00
" nbformat_minor " : 1
2015-05-04 17:54:50 +08:00
}