jupyter-guide
diff --git a/‎.gitattributes‎
Lines changed: 1 addition & 0 deletions b/‎.gitattributes‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎.travis.yml‎
Lines changed: 8 additions & 0 deletions b/‎.travis.yml‎
Lines changed: 8 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 10 additions & 8 deletions b/‎README.md‎
Lines changed: 10 additions & 8 deletions
diff --git a/‎example1/0-Workflow.html‎
Lines changed: 8 additions & 8 deletions b/‎example1/0-Workflow.html‎
Lines changed: 8 additions & 8 deletions
diff --git a/‎example1/0-Workflow.ipynb‎
Lines changed: 8 additions & 8 deletions b/‎example1/0-Workflow.ipynb‎
Lines changed: 8 additions & 8 deletions
@@ -0,0 +1 @@
+example*/*.html linguist-vendored
@@ -0,0 +1,8 @@
+language: python
+python:
+  - "3.6"
+cache: pip
+install:
+  - pip install -r requirements.txt
+script:
+  - python tests.py
@@ -1,4 +1,6 @@
 # Ten Simple Rules for Reproducible Research in Jupyter Notebooks
+[![Build Status](https://api.travis-ci.com/jupyter-guide/ten-rules-jupyter.svg?branch=master)](https://www.travis-ci.org/jupyter-guide/ten-rules-jupyter)
+[![GitHub License](https://img.shields.io/github/license/jupyter-guide/ten-rules-jupyter.svg)](https://github.com/sbl-sdsc/mmtf-spark/blob/master/LICENSE)
 
 This repository is a supplement to the "Ten Simple Rules for Reproducible Research in Jupyter Notebook" paper [ref].
 
@@ -16,11 +18,11 @@ notebooks in your web browser using the Binder ([mybinder.org](https://mybinder.
 
 | Nbviewer | Jupyter Notebook | Jupyter Lab | HTML |
 | ---      | --               | ---         | ---  |
-| [0-Workflow.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example1/0-Workflow.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example1%2F0-Workflow.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example1%2F0-Workflow.ipynb) | [HTML](https://htmlpreview.github.io?https://github.com/jupyter-guide/ten-rules-jupyter/blob/master/example1/0-Workflow.html) |
-| [1-CreateDataset.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example1/1-CreateDataset.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example1%2F1-CreateDataset.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example1%2F1-CreateDataset.ipynb) | [HTML](https://htmlpreview.github.io?https://github.com/jupyter-guide/ten-rules-jupyter/blob/master/example1/1-CreateDataset.html) |
-| [2-CalculateFeatures.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example1/2-CalculateFeatures.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example1%2F2-CalculateFeatures.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example1%2F2-CalculateFeatures.ipynb) | [HTML](https://htmlpreview.github.io?https://github.com/jupyter-guide/ten-rules-jupyter/blob/master/example1/2-CalculateFeatures.html) |
-| [3-FitModel.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example1/3-FitModel.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example1%2F3-FitModel.ipynb) |[![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example1%2F3-FitModel.ipynb)  | [HTML](https://htmlpreview.github.io?https://github.com/jupyter-guide/ten-rules-jupyter/blob/master/example1/3-FitModel.html) |
-| [4-Predict.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example1/4-Predict.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example1%2F4-Predict.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example1%2F4-Predict.ipynb)| [HTML](https://htmlpreview.github.io?https://github.com/jupyter-guide/ten-rules-jupyter/blob/master/example1/4-Predict.html) |
+| [0-Workflow.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example1/0-Workflow.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example1%2F0-Workflow.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example1%2F0-Workflow.ipynb) | [HTML](https://cdn.rawgit.com/jupyter-guide/ten-rules-jupyter/dd3a89ad/example1/0-Workflow.html) |
+| [1-CreateDataset.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example1/1-CreateDataset.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example1%2F1-CreateDataset.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example1%2F1-CreateDataset.ipynb) | [HTML](https://cdn.rawgit.com/jupyter-guide/ten-rules-jupyter/dd3a89ad/example1/1-CreateDataset.html) |
+| [2-CalculateFeatures.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example1/2-CalculateFeatures.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example1%2F2-CalculateFeatures.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example1%2F2-CalculateFeatures.ipynb) | [HTML](https://cdn.rawgit.com/jupyter-guide/ten-rules-jupyter/dd3a89ad/example1/2-CalculateFeatures.html) |
+| [3-FitModel.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example1/3-FitModel.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example1%2F3-FitModel.ipynb) |[![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example1%2F3-FitModel.ipynb)  | [HTML](https://cdn.rawgit.com/jupyter-guide/ten-rules-jupyter/dd3a89ad/example1/3-FitModel.html) |
+| [4-Predict.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example1/4-Predict.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example1%2F4-Predict.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example1%2F4-Predict.ipynb)| [HTML](https://cdn.rawgit.com/jupyter-guide/ten-rules-jupyter/dd3a89ad/example1/4-Predict.html) |
 
 ---
 
@@ -34,6 +36,6 @@ This example demonstrates a reproducible 2-step workflow for simulating a phylog
 
 | Nbviewer | Jupyter Notebook | Jupyter Lab | HTML |
 | ---      | --               | ---         | ---  |
-| [0-Workflow.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example2/0-Workflow.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example2%2F0-Workflow.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example2%2F0-Workflow.ipynb) | [HTML](https://htmlpreview.github.io?https://github.com/jupyter-guide/ten-rules-jupyter/blob/master/example2/0-Workflow.html) |
-| [1-SimulateTree.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example2/1-SimulateTree.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example2%2F1-SimulateTree.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example2%2F1-SimulateTree.ipynb) | [HTML](https://htmlpreview.github.io?https://github.com/jupyter-guide/ten-rules-jupyter/blob/master/example2/1-SimulateTree.html) |
-| [2-SimulateSequences.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example2/2-SimulateSequences.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example2%2F2-SimulateSequences.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example2%2F2-SimulateSequences.ipynb) | [HTML](https://htmlpreview.github.io?https://github.com/jupyter-guide/ten-rules-jupyter/blob/master/example2/2-SimulateSequences.html) |
+| [0-Workflow.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example2/0-Workflow.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example2%2F0-Workflow.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example2%2F0-Workflow.ipynb) | [HTML](https://cdn.rawgit.com/jupyter-guide/ten-rules-jupyter/dd3a89ad/example2/0-Workflow.html) |
+| [1-SimulateTree.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example2/1-SimulateTree.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example2%2F1-SimulateTree.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example2%2F1-SimulateTree.ipynb) | [HTML](https://cdn.rawgit.com/jupyter-guide/ten-rules-jupyter/dd3a89ad/example2/1-SimulateTree.html) |
+| [2-SimulateSequences.ipynb](https://nbviewer.jupyter.org/github/jupyter-guide/ten-rules-jupyter/blob/master/example2/2-SimulateSequences.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?filepath=example2%2F2-SimulateSequences.ipynb) | [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jupyter-guide/ten-rules-jupyter/master?urlpath=lab/tree/example2%2F2-SimulateSequences.ipynb) | [HTML](https://cdn.rawgit.com/jupyter-guide/ten-rules-jupyter/dd3a89ad/example2/2-SimulateSequences.html) |
@@ -11783,12 +11783,12 @@ <h1 id="Predict-Fold-Type-of-a-Protein-from-Protein-Sequence">Predict Fold Type
 </div>
 <div class="inner_cell">
 <div class="text_cell_render border-box-sizing rendered_html">
-<p><strong>The notebooks in this directory demonstrate the "Ten Rules for Reproducible Research in Jupyter Notebooks". Throughout the notebooks we refer to some the rules we applied.</strong></p>
+<p><strong>The notebooks in this directory demonstrate and apply the "Ten Rules for Reproducible Research in Jupyter Notebooks". Throughout the notebooks we refer to some the rules we applied.</strong></p>
 <p><strong>For example, this notebook demonstrates:</strong></p>
 <hr>
-<p><strong>Rule 1: Tell a Story for an Audience.</strong> This notebook was developed for biologists to learn how to apply a simple machine learning model to protein sequences.</p>
+<p><strong>Rule 1: Tell a Story for an Audience.</strong> This notebook was developed to learn how to apply a simple machine learning model to predict protein features based on protein sequences.</p>
 <p><strong>Rule 3: Build a Pipeline.</strong> This notebook describes the entire workflow from data preparation, feature calculation, model fitting, to prediction. The modularity makes it easy to replace one of the steps, for example, use a different method to calculate features or apply a different machine learning model.</p>
-<p><strong>Rule 5: Use Cell, Section adn Notebook Divisions to Make Steps Clear.</strong> We broke the workflow into separate notebooks and use this top-level notebook to explain and orchestrate the workflow.</p>
+<p><strong>Rule 5: Use Cell, Section and Notebook Divisions to Make Steps Clear.</strong> We broke the workflow into separate notebooks and use this top-level notebook to explain and organize the workflow.</p>
 <hr>
 
 </div>
@@ -11806,7 +11806,7 @@ <h2 id="Introduction">Introduction<a class="anchor-link" href="#Introduction">&#
 </div>
 <div class="inner_cell">
 <div class="text_cell_render border-box-sizing rendered_html">
-<p>Protein chains fold in regular patterns. Secondary structure describes the geometry of segments of a protein chain. The most common secondary structure elements are</p>
+<p>Proteins have four different levels of structure – primary, secondary, tertiary and quaternary. Secondary structure describes the geometry of segments of a protein chain. The most common secondary structure elements are:</p>
 <ul>
 <li>Alpha helices</li>
 <li>Beta sheets</li>
@@ -11819,7 +11819,7 @@ <h2 id="Introduction">Introduction<a class="anchor-link" href="#Introduction">&#
 </div>
 <div class="inner_cell">
 <div class="text_cell_render border-box-sizing rendered_html">
-<p>We can classify proteins into three major fold classes based on their predominant secondary structure content</p>
+<p>We can classify proteins into three major fold classes based on their predominant secondary structure content:</p>
 <ul>
 <li>alpha: contains predominantly alpha helices</li>
 <li>beta: contains predominantly beta sheets</li>
@@ -11833,7 +11833,7 @@ <h2 id="Introduction">Introduction<a class="anchor-link" href="#Introduction">&#
 </div>
 <div class="inner_cell">
 <div class="text_cell_render border-box-sizing rendered_html">
-<h2 id="Goal">Goal<a class="anchor-link" href="#Goal">&#182;</a></h2><p>This notebook demonstrates how to create a reproducible record to create a machine learning model. We train a simple model to predict the fold class of a protein given its protein sequence using a representative set of 3D structures from the Protein Data Bank.</p>
+<h2 id="Goal">Goal<a class="anchor-link" href="#Goal">&#182;</a></h2><p>This notebook demonstrates how to create a reproducible record using a machine learning model. We train the model to predict the fold class of a protein given its amino acid sequence using a representative set of 3D structures from the Protein Data Bank.</p>
 <p><strong>Run the following notebooks and explore how we applied the Ten Simple Rules.</strong></p>
 
 </div>
@@ -11887,7 +11887,7 @@ <h2 id="2.-Calculate-Features">2. Calculate Features<a class="anchor-link" href=
 </div>
 <div class="inner_cell">
 <div class="text_cell_render border-box-sizing rendered_html">
-<p>Protein sequences cannot be directly used for machine learning. Here use the Word2vec method to calculate a fixed-sized feature vector for each protein sequence.</p>
+<p>Protein sequences cannot be directly used for machine learning. Here we use the Word2vec method to calculate a fixed-sized feature vector for each protein sequence.</p>
 <p>Run the following notebook to calculate feature vectors.</p>
 
 </div>
@@ -12044,7 +12044,7 @@ <h2 id="Version-and-Hardware-Information">Version and Hardware Information<a cla
 <div class="inner_cell">
 <div class="text_cell_render border-box-sizing rendered_html">
 <hr>
-<p><strong>Authors:</strong> Peter W. Rose, Shih-Cheng Huang, UC San Diego, October 1, 2018</p>
+<p><strong>Authors:</strong> <a href="mailto:pwrose.ucsd@gmail.com">Peter W. Rose</a>, Shih-Cheng Huang, UC San Diego, October 1, 2018</p>
 <hr>
 
 </div>
 
@@ -11,17 +11,17 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "**The notebooks in this directory demonstrate the \"Ten Rules for Reproducible Research in Jupyter Notebooks\". Throughout the notebooks we refer to some the rules we applied.**\n",
+    "**The notebooks in this directory demonstrate and apply the \"Ten Rules for Reproducible Research in Jupyter Notebooks\". Throughout the notebooks we refer to some the rules we applied.**\n",
     "\n",
     "**For example, this notebook demonstrates:**\n",
     "\n",
     "---\n",
     "\n",
-    "**Rule 1: Tell a Story for an Audience.** This notebook was developed for biologists to learn how to apply a simple machine learning model to protein sequences.\n",
+    "**Rule 1: Tell a Story for an Audience.** This notebook was developed to learn how to apply a simple machine learning model to predict protein features based on protein sequences.\n",
     "\n",
     "**Rule 3: Build a Pipeline.** This notebook describes the entire workflow from data preparation, feature calculation, model fitting, to prediction. The modularity makes it easy to replace one of the steps, for example, use a different method to calculate features or apply a different machine learning model.\n",
     "\n",
-    "**Rule 5: Use Cell, Section adn Notebook Divisions to Make Steps Clear.** We broke the workflow into separate notebooks and use this top-level notebook to explain and orchestrate the workflow.\n",
+    "**Rule 5: Use Cell, Section and Notebook Divisions to Make Steps Clear.** We broke the workflow into separate notebooks and use this top-level notebook to explain and organize the workflow.\n",
     "\n",
     "---"
    ]
@@ -37,7 +37,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Protein chains fold in regular patterns. Secondary structure describes the geometry of segments of a protein chain. The most common secondary structure elements are\n",
+    "Proteins have four different levels of structure – primary, secondary, tertiary and quaternary. Secondary structure describes the geometry of segments of a protein chain. The most common secondary structure elements are:\n",
     "* Alpha helices\n",
     "* Beta sheets"
    ]
@@ -46,7 +46,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "We can classify proteins into three major fold classes based on their predominant secondary structure content\n",
+    "We can classify proteins into three major fold classes based on their predominant secondary structure content:\n",
     "* alpha: contains predominantly alpha helices\n",
     "* beta: contains predominantly beta sheets\n",
     "* alpha+beta: contains alpha helices and beta sheets"
@@ -57,7 +57,7 @@
    "metadata": {},
    "source": [
     "## Goal\n",
-    "This notebook demonstrates how to create a reproducible record to create a machine learning model. We train a simple model to predict the fold class of a protein given its protein sequence using a representative set of 3D structures from the Protein Data Bank.\n",
+    "This notebook demonstrates how to create a reproducible record using a machine learning model. We train the model to predict the fold class of a protein given its amino acid sequence using a representative set of 3D structures from the Protein Data Bank.\n",
     "\n",
     "**Run the following notebooks and explore how we applied the Ten Simple Rules.**"
    ]
@@ -103,7 +103,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Protein sequences cannot be directly used for machine learning. Here use the Word2vec method to calculate a fixed-sized feature vector for each protein sequence.\n",
+    "Protein sequences cannot be directly used for machine learning. Here we use the Word2vec method to calculate a fixed-sized feature vector for each protein sequence.\n",
     "\n",
     "Run the following notebook to calculate feature vectors. "
    ]
@@ -230,7 +230,7 @@
    "source": [
     "---\n",
     "\n",
-    "**Authors:** Peter W. Rose, Shih-Cheng Huang, UC San Diego, October 1, 2018\n",
+    "**Authors:** [Peter W. Rose](mailto:pwrose.ucsd@gmail.com), Shih-Cheng Huang, UC San Diego, October 1, 2018\n",
     "\n",
     "---"
    ]