You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<h1id="Create-Dataset">Create Dataset<aclass="anchor-link" href="#Create-Dataset">¶</a></h1><p>This notebook extracts from the Protein Data Bank information about the secondary structure of proteins. The ultimate goal is to assign a fold classification for a set of representative proteins.</p>
11778
+
<h1id="Create-Dataset">Create Dataset<aclass="anchor-link" href="#Create-Dataset">¶</a></h1><p>This notebook extracts from the Protein Data Bank information about the secondary structure of proteins. The ultimate goal is to assign a fold classification from a protein sequence.</p>
11779
11779
11780
11780
</div>
11781
11781
</div>
@@ -12416,12 +12416,12 @@ <h2 id="Calculate-Secondary-Structure-Content">Calculate Secondary Structure Con
<h2id="Classify-Sequences-by-Secondary-Structure-Content">Classify Sequences by Secondary Structure Content<aclass="anchor-link" href="#Classify-Sequences-by-Secondary-Structure-Content">¶</a></h2><p>Next we classify each protein chain into one of four classes. We use a threshold of 25% to define a predominant class.</p>
12418
12418
<ul>
12419
-
<li>alpha: predominantly alpha (>=25%)</li>
12420
-
<li>beta: predominantly beta (>=25%)</li>
12421
-
<li>alpha+beta: significant alpha (>=25%) and beta (>=25%)</li>
12419
+
<li>alpha: predominantly alpha (>25%)</li>
12420
+
<li>beta: predominantly beta (>25%)</li>
12421
+
<li>alpha+beta: significant alpha (>25%) and beta (>25%)</li>
12422
12422
<li>other: cases that do not fit into the 3 classes above </li>
12423
12423
</ul>
12424
-
<p>Protein chains in the <em>other</em> class will be ignored in the subsequent analysis.</p>
12424
+
<p>Protein chains in the <strong>other</strong> class will be ignored in the subsequent analysis.</p>
12425
12425
12426
12426
</div>
12427
12427
</div>
@@ -12431,17 +12431,17 @@ <h2 id="Classify-Sequences-by-Secondary-Structure-Content">Classify Sequences by
Copy file name to clipboardExpand all lines: example1/1-CreateDataset.ipynb
+10-10Lines changed: 10 additions & 10 deletions
Original file line number
Diff line number
Diff line change
@@ -5,7 +5,7 @@
5
5
"metadata": {},
6
6
"source": [
7
7
"# Create Dataset\n",
8
-
"This notebook extracts from the Protein Data Bank information about the secondary structure of proteins. The ultimate goal is to assign a fold classification for a set of representative proteins."
8
+
"This notebook extracts from the Protein Data Bank information about the secondary structure of proteins. The ultimate goal is to assign a fold classification from a protein sequence."
9
9
]
10
10
},
11
11
{
@@ -617,12 +617,12 @@
617
617
"## Classify Sequences by Secondary Structure Content\n",
618
618
"Next we classify each protein chain into one of four classes. We use a threshold of 25% to define a predominant class.\n",
619
619
"\n",
620
-
"* alpha: predominantly alpha (>=25%)\n",
621
-
"* beta: predominantly beta (>=25%)\n",
622
-
"* alpha+beta: significant alpha (>=25%) and beta (>=25%)\n",
620
+
"* alpha: predominantly alpha (>25%)\n",
621
+
"* beta: predominantly beta (>25%)\n",
622
+
"* alpha+beta: significant alpha (>25%) and beta (>25%)\n",
623
623
"* other: cases that do not fit into the 3 classes above \n",
624
624
"\n",
625
-
"Protein chains in the *other* class will be ignored in the subsequent analysis."
625
+
"Protein chains in the **other** class will be ignored in the subsequent analysis."
0 commit comments