Skip to content

Commit a28ab52

Browse files
committed
updated documentation
1 parent 411e36c commit a28ab52

File tree

4 files changed

+24
-24
lines changed

4 files changed

+24
-24
lines changed

example1/1-CreateDataset.html

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -11775,7 +11775,7 @@
1177511775
</div>
1177611776
<div class="inner_cell">
1177711777
<div class="text_cell_render border-box-sizing rendered_html">
11778-
<h1 id="Create-Dataset">Create Dataset<a class="anchor-link" href="#Create-Dataset">&#182;</a></h1><p>This notebook extracts from the Protein Data Bank information about the secondary structure of proteins. The ultimate goal is to assign a fold classification for a set of representative proteins.</p>
11778+
<h1 id="Create-Dataset">Create Dataset<a class="anchor-link" href="#Create-Dataset">&#182;</a></h1><p>This notebook extracts from the Protein Data Bank information about the secondary structure of proteins. The ultimate goal is to assign a fold classification from a protein sequence.</p>
1177911779

1178011780
</div>
1178111781
</div>
@@ -12416,12 +12416,12 @@ <h2 id="Calculate-Secondary-Structure-Content">Calculate Secondary Structure Con
1241612416
<div class="text_cell_render border-box-sizing rendered_html">
1241712417
<h2 id="Classify-Sequences-by-Secondary-Structure-Content">Classify Sequences by Secondary Structure Content<a class="anchor-link" href="#Classify-Sequences-by-Secondary-Structure-Content">&#182;</a></h2><p>Next we classify each protein chain into one of four classes. We use a threshold of 25% to define a predominant class.</p>
1241812418
<ul>
12419-
<li>alpha: predominantly alpha (&gt;=25%)</li>
12420-
<li>beta: predominantly beta (&gt;=25%)</li>
12421-
<li>alpha+beta: significant alpha (&gt;=25%) and beta (&gt;=25%)</li>
12419+
<li>alpha: predominantly alpha (&gt; 25%)</li>
12420+
<li>beta: predominantly beta (&gt; 25%)</li>
12421+
<li>alpha+beta: significant alpha (&gt; 25%) and beta (&gt; 25%)</li>
1242212422
<li>other: cases that do not fit into the 3 classes above </li>
1242312423
</ul>
12424-
<p>Protein chains in the <em>other</em> class will be ignored in the subsequent analysis.</p>
12424+
<p>Protein chains in the <strong>other</strong> class will be ignored in the subsequent analysis.</p>
1242512425

1242612426
</div>
1242712427
</div>
@@ -12431,17 +12431,17 @@ <h2 id="Classify-Sequences-by-Secondary-Structure-Content">Classify Sequences by
1243112431
<div class="prompt input_prompt">In&nbsp;[7]:</div>
1243212432
<div class="inner_cell">
1243312433
<div class="input_area">
12434-
<div class=" highlight hl-ipython3"><pre><span></span><span class="k">def</span> <span class="nf">protein_fold_class</span><span class="p">(</span><span class="n">data</span><span class="p">,</span> <span class="n">minThreshold</span><span class="p">,</span> <span class="n">maxThreshold</span><span class="p">):</span>
12434+
<div class=" highlight hl-ipython3"><pre><span></span><span class="k">def</span> <span class="nf">protein_fold_class</span><span class="p">(</span><span class="n">data</span><span class="p">,</span> <span class="n">min_threshold</span><span class="p">,</span> <span class="n">max_threshold</span><span class="p">):</span>
1243512435
<span class="sd">&quot;&quot;&quot;</span>
1243612436
<span class="sd"> Returns fold classification:</span>
1243712437
<span class="sd"> &quot;alpha&quot;, &quot;beta&quot;, &quot;alpha+beta&quot;, and &quot;other&quot; based upon the </span>
1243812438
<span class="sd"> fraction of alpha and beta.</span>
1243912439
<span class="sd"> &quot;&quot;&quot;</span>
12440-
<span class="k">if</span> <span class="n">data</span><span class="o">.</span><span class="n">alpha</span> <span class="o">&gt;</span> <span class="n">maxThreshold</span> <span class="ow">and</span> <span class="n">data</span><span class="o">.</span><span class="n">beta</span> <span class="o">&lt;</span> <span class="n">minThreshold</span><span class="p">:</span>
12440+
<span class="k">if</span> <span class="n">data</span><span class="o">.</span><span class="n">alpha</span> <span class="o">&gt;</span> <span class="n">max_threshold</span> <span class="ow">and</span> <span class="n">data</span><span class="o">.</span><span class="n">beta</span> <span class="o">&lt;</span> <span class="n">min_threshold</span><span class="p">:</span>
1244112441
<span class="k">return</span> <span class="s2">&quot;alpha&quot;</span>
12442-
<span class="k">elif</span> <span class="n">data</span><span class="o">.</span><span class="n">beta</span> <span class="o">&gt;</span> <span class="n">maxThreshold</span> <span class="ow">and</span> <span class="n">data</span><span class="o">.</span><span class="n">alpha</span> <span class="o">&lt;</span> <span class="n">minThreshold</span><span class="p">:</span>
12442+
<span class="k">elif</span> <span class="n">data</span><span class="o">.</span><span class="n">beta</span> <span class="o">&gt;</span> <span class="n">max_threshold</span> <span class="ow">and</span> <span class="n">data</span><span class="o">.</span><span class="n">alpha</span> <span class="o">&lt;</span> <span class="n">min_threshold</span><span class="p">:</span>
1244312443
<span class="k">return</span> <span class="s2">&quot;beta&quot;</span>
12444-
<span class="k">elif</span> <span class="n">data</span><span class="o">.</span><span class="n">alpha</span> <span class="o">&gt;</span> <span class="n">maxThreshold</span> <span class="ow">and</span> <span class="n">data</span><span class="o">.</span><span class="n">beta</span> <span class="o">&gt;</span> <span class="n">maxThreshold</span><span class="p">:</span>
12444+
<span class="k">elif</span> <span class="n">data</span><span class="o">.</span><span class="n">alpha</span> <span class="o">&gt;</span> <span class="n">max_threshold</span> <span class="ow">and</span> <span class="n">data</span><span class="o">.</span><span class="n">beta</span> <span class="o">&gt;</span> <span class="n">max_threshold</span><span class="p">:</span>
1244512445
<span class="k">return</span> <span class="s2">&quot;alpha+beta&quot;</span>
1244612446
<span class="k">else</span><span class="p">:</span>
1244712447
<span class="k">return</span> <span class="s2">&quot;other&quot;</span>
@@ -12458,7 +12458,7 @@ <h2 id="Classify-Sequences-by-Secondary-Structure-Content">Classify Sequences by
1245812458
<div class="inner_cell">
1245912459
<div class="input_area">
1246012460
<div class=" highlight hl-ipython3"><pre><span></span><span class="c1"># assign protein fold class</span>
12461-
<span class="n">df</span><span class="p">[</span><span class="n">value_col</span><span class="p">]</span> <span class="o">=</span> <span class="n">df</span><span class="o">.</span><span class="n">apply</span><span class="p">(</span><span class="n">protein_fold_class</span><span class="p">,</span> <span class="n">minThreshold</span><span class="o">=</span><span class="mf">0.05</span><span class="p">,</span> <span class="n">maxThreshold</span><span class="o">=</span><span class="mf">0.25</span><span class="p">,</span> <span class="n">axis</span><span class="o">=</span><span class="mi">1</span><span class="p">)</span>
12461+
<span class="n">df</span><span class="p">[</span><span class="n">value_col</span><span class="p">]</span> <span class="o">=</span> <span class="n">df</span><span class="o">.</span><span class="n">apply</span><span class="p">(</span><span class="n">protein_fold_class</span><span class="p">,</span> <span class="n">min_threshold</span><span class="o">=</span><span class="mf">0.05</span><span class="p">,</span> <span class="n">max_threshold</span><span class="o">=</span><span class="mf">0.25</span><span class="p">,</span> <span class="n">axis</span><span class="o">=</span><span class="mi">1</span><span class="p">)</span>
1246212462

1246312463
<span class="c1"># exclude protein chains without a dominant classification from further analysis.</span>
1246412464
<span class="n">df</span> <span class="o">=</span> <span class="n">df</span><span class="p">[</span><span class="n">df</span><span class="p">[</span><span class="n">value_col</span><span class="p">]</span> <span class="o">!=</span> <span class="s1">&#39;other&#39;</span><span class="p">]</span>

example1/1-CreateDataset.ipynb

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@
55
"metadata": {},
66
"source": [
77
"# Create Dataset\n",
8-
"This notebook extracts from the Protein Data Bank information about the secondary structure of proteins. The ultimate goal is to assign a fold classification for a set of representative proteins."
8+
"This notebook extracts from the Protein Data Bank information about the secondary structure of proteins. The ultimate goal is to assign a fold classification from a protein sequence."
99
]
1010
},
1111
{
@@ -617,12 +617,12 @@
617617
"## Classify Sequences by Secondary Structure Content\n",
618618
"Next we classify each protein chain into one of four classes. We use a threshold of 25% to define a predominant class.\n",
619619
"\n",
620-
"* alpha: predominantly alpha (>=25%)\n",
621-
"* beta: predominantly beta (>=25%)\n",
622-
"* alpha+beta: significant alpha (>=25%) and beta (>=25%)\n",
620+
"* alpha: predominantly alpha (> 25%)\n",
621+
"* beta: predominantly beta (> 25%)\n",
622+
"* alpha+beta: significant alpha (> 25%) and beta (> 25%)\n",
623623
"* other: cases that do not fit into the 3 classes above \n",
624624
"\n",
625-
"Protein chains in the *other* class will be ignored in the subsequent analysis."
625+
"Protein chains in the **other** class will be ignored in the subsequent analysis."
626626
]
627627
},
628628
{
@@ -631,17 +631,17 @@
631631
"metadata": {},
632632
"outputs": [],
633633
"source": [
634-
"def protein_fold_class(data, minThreshold, maxThreshold):\n",
634+
"def protein_fold_class(data, min_threshold, max_threshold):\n",
635635
" \"\"\"\n",
636636
" Returns fold classification:\n",
637637
" \"alpha\", \"beta\", \"alpha+beta\", and \"other\" based upon the \n",
638638
" fraction of alpha and beta.\n",
639639
" \"\"\"\n",
640-
" if data.alpha > maxThreshold and data.beta < minThreshold: \n",
640+
" if data.alpha > max_threshold and data.beta < min_threshold: \n",
641641
" return \"alpha\" \n",
642-
" elif data.beta > maxThreshold and data.alpha < minThreshold: \n",
642+
" elif data.beta > max_threshold and data.alpha < min_threshold: \n",
643643
" return \"beta\" \n",
644-
" elif data.alpha > maxThreshold and data.beta > maxThreshold: \n",
644+
" elif data.alpha > max_threshold and data.beta > max_threshold: \n",
645645
" return \"alpha+beta\" \n",
646646
" else: \n",
647647
" return \"other\"\n"
@@ -804,7 +804,7 @@
804804
],
805805
"source": [
806806
"# assign protein fold class\n",
807-
"df[value_col] = df.apply(protein_fold_class, minThreshold=0.05, maxThreshold=0.25, axis=1)\n",
807+
"df[value_col] = df.apply(protein_fold_class, min_threshold=0.05, max_threshold=0.25, axis=1)\n",
808808
"\n",
809809
"# exclude protein chains without a dominant classification from further analysis.\n",
810810
"df = df[df[value_col] != 'other']\n",

example1/3-FitModel.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -12067,10 +12067,10 @@ <h2 id="Select-a-Classification-Method-(default-SVM)">Select a Classification Me
1206712067

1206812068

1206912069

12070-
<div id="4f72f8e2-c8e8-4175-be53-0a1f3cf27e75"></div>
12070+
<div id="32da9708-9524-4db9-bd1f-de45152cc9e5"></div>
1207112071
<div class="output_subarea output_widget_view ">
1207212072
<script type="text/javascript">
12073-
var element = $('#4f72f8e2-c8e8-4175-be53-0a1f3cf27e75');
12073+
var element = $('#32da9708-9524-4db9-bd1f-de45152cc9e5');
1207412074
</script>
1207512075
<script type="application/vnd.jupyter.widget-view+json">
1207612076
{"model_id": "31791fc7b90c40088bc8e281f4434ed4", "version_major": 2, "version_minor": 0}

example1/4-Predict.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -11855,10 +11855,10 @@ <h2 id="Enter-a-Protein-Sequence-in-Text-Box">Enter a Protein Sequence in Text B
1185511855

1185611856

1185711857

11858-
<div id="229d132e-5bde-46bd-9981-6990339e47ff"></div>
11858+
<div id="065d7337-9dad-445e-b0e4-25a6a0c63588"></div>
1185911859
<div class="output_subarea output_widget_view ">
1186011860
<script type="text/javascript">
11861-
var element = $('#229d132e-5bde-46bd-9981-6990339e47ff');
11861+
var element = $('#065d7337-9dad-445e-b0e4-25a6a0c63588');
1186211862
</script>
1186311863
<script type="application/vnd.jupyter.widget-view+json">
1186411864
{"model_id": "450d2400f1134413be6083bf5021d15e", "version_major": 2, "version_minor": 0}

0 commit comments

Comments
 (0)