Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Go to file
2015-04-03 06:06:46 -04:00
aws Added sample mrjob mapper and reducer to parse logs on s3 following the standard bucket logging format. 2015-04-03 06:06:46 -04:00
commands Prefixed various misc commands with ! so they can be executed within IPython Notebook. 2015-03-13 08:05:56 -04:00
data Reduced confusion matrix image, it was too wide and forced a horizontal scroll bar on nbviewer. 2015-03-25 07:56:39 -04:00
kaggle Transformed Embarked to dummy variables instead of integer representations. The latter implies ordering, which isn't the case with Embarked. 2015-04-02 23:29:33 -04:00
matplotlib Added matplotlib snippet skeleton package 2015-01-23 19:14:05 -05:00
numpy Added numpy snippet skeleton package 2015-01-23 19:14:27 -05:00
pandas Removed incomplete snippet for pandas idxmax causing an exception. 2015-02-28 18:37:44 -05:00
python-core Added image source used for slice snippets. 2015-02-28 18:34:17 -05:00
scikit-learn Added scikit-learn skeleton. Marked which projects are under active development and which are coming soon. Removed redundant info. 2015-01-26 09:15:12 -05:00
scipy Added scipy snippet skeleton package 2015-01-23 19:14:49 -05:00
spark Added Spark accumulators snippets. 2015-03-31 21:41:21 -04:00
__init__.py Added __init__.py files 2015-01-23 16:08:32 -05:00
.gitignore Added repo scratch dir to ignore 2015-01-26 09:07:25 -05:00
LICENSE Added LICENSE 2015-01-23 16:07:56 -05:00
README.md Reordered README sections. 2015-03-17 16:21:33 -04:00

alt text

ipython-data-notebooks

Continually updated IPython Data Science Notebooks geared towards processing big data (AWS, Spark, Hadoop, Linux command line, Python, NumPy, pandas, matplotlib, SciPy, scikit-learn, Kaggle).

kaggle

IPython Notebooks used in kaggle competitions.

aws

IPython Notebooks demonstrating Amazon Web Services functionality.

spark

IPython Notebooks demonstrating spark and HDFS functionality.

python-core

IPython Notebooks demonstrating core Python functionality geared towards data analysis.

pandas

IPython Notebooks demonstrating pandas functionality.

commands

IPython Notebooks demonstrating various command lines for Linux, Git, etc.

matplotlib

[Coming Soon] IPython Notebooks demonstrating matplotlib functionality.

scikit-learn

[Coming Soon] IPython Notebooks demonstrating scikit-learn functionality.

scipy

[Coming Soon] IPython Notebooks demonstrating SciPy functionality.

numpy

[Coming Soon] IPython Notebooks demonstrating NumPy functionality.

References

License

Copyright 2014 Donne Martin

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

   http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.