Commit 3ea377d
committed
- Rename mpileup --edlib to --indels-cns
The main benefit of the new indel caller is the use of alignment
against diploid consensus generation instead of simply patching the
reference with candidate indel types.
This greatly reduces false positives and incorrect allele alignment
(leading to wrong genotype calls). This was the earlier PR #1679, but
has since acquired edlib as an alternative to BAQ for the indel
alignment. However this is primarily a speed benefit (with some
minor removal of false-negatives due to quality smashing), and
isn't the main thing users should think about when choosing an indel
caller.
Also tidied up the usage statement and added an explicit "-X list" to
print the profile parameters.
- Add extra debugging defines.
GLF_DEBUG reports GLF calculation in bam2bcf.c.
ALIGN_DEBUG uses edlib EDLIB_TASK_PATH to report sequence alignment.
NB to use this you need to link against edlib itself rather than the
cutdown version in this repository.
Also fix the edlib heuristics used in bcf_call_glfgen. We don't want
to change the call (b=5) as this affects AD. Instead we change the
quality so poor calls get filtered by QUAL rather than simply being
removed.
- Tweak edlib tuning for SeqQ/qual.
Add quality value assessment into soft-clip recovery.
Use /500 instead of /111 in indelQ assignment, and skew indel-bias
accordingly. This gives better separation of FP/GT/FN generally.
- Added --seqq-offset parameter so we can use it in tunables per
profile. This is used as a limit on the seqQ reduction in the
"VAL-5*MIN(20,depth)" formula, used for favouring data over seqQ
scores when depth is sufficient. Experimentation showed no single
value that worked for all platforms, but the default is in the
middle.
- Tidy up to cull ifdefed and commented out code.
- Add test for indels-cns.
It's minimal, but the whole indel calling has minimal testing. I
think we have under 10 indels in total with develop (all short read
and mostly duplications of each other), and no testing of indels-2.0.
This tests 4 indels with indels-cns.
- Added documentation for the new --indels-2.0 options
- Cull more unused parts of edlib.c.
This avoids clang warnings (which become errors with -Werror).
We're only including the bits we need here for mpileup. If you want
the whole thing, link against the upstream source library instead.1 parent 0e45073 commit 3ea377d
File tree
8 files changed
+655
-1201
lines changed- doc
- test
- mpileup
8 files changed
+655
-1201
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
4 | | - | |
| 4 | + | |
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
| |||
249 | 249 | | |
250 | 250 | | |
251 | 251 | | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
252 | 256 | | |
253 | 257 | | |
254 | 258 | | |
| |||
351 | 355 | | |
352 | 356 | | |
353 | 357 | | |
| 358 | + | |
| 359 | + | |
| 360 | + | |
| 361 | + | |
354 | 362 | | |
355 | 363 | | |
356 | 364 | | |
| |||
363 | 371 | | |
364 | 372 | | |
365 | 373 | | |
366 | | - | |
367 | | - | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
368 | 383 | | |
369 | | - | |
370 | | - | |
371 | | - | |
372 | | - | |
373 | | - | |
374 | | - | |
375 | | - | |
376 | | - | |
377 | | - | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
| 393 | + | |
| 394 | + | |
| 395 | + | |
| 396 | + | |
| 397 | + | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
| 402 | + | |
| 403 | + | |
| 404 | + | |
| 405 | + | |
| 406 | + | |
| 407 | + | |
| 408 | + | |
| 409 | + | |
| 410 | + | |
378 | 411 | | |
379 | | - | |
380 | 412 | | |
| 413 | + | |
381 | 414 | | |
382 | 415 | | |
383 | 416 | | |
384 | 417 | | |
385 | | - | |
386 | | - | |
387 | | - | |
388 | | - | |
389 | 418 | | |
390 | 419 | | |
391 | 420 | | |
| |||
419 | 448 | | |
420 | 449 | | |
421 | 450 | | |
| 451 | + | |
| 452 | + | |
| 453 | + | |
| 454 | + | |
| 455 | + | |
422 | 456 | | |
423 | 457 | | |
424 | 458 | | |
| |||
1202 | 1236 | | |
1203 | 1237 | | |
1204 | 1238 | | |
1205 | | - | |
| 1239 | + | |
1206 | 1240 | | |
1207 | | - | |
1208 | | - | |
| 1241 | + | |
| 1242 | + | |
| 1243 | + | |
| 1244 | + | |
1209 | 1245 | | |
1210 | 1246 | | |
1211 | 1247 | | |
1212 | | - | |
1213 | 1248 | | |
1214 | 1249 | | |
1215 | | - | |
1216 | 1250 | | |
1217 | 1251 | | |
1218 | 1252 | | |
1219 | 1253 | | |
| 1254 | + | |
| 1255 | + | |
| 1256 | + | |
1220 | 1257 | | |
| 1258 | + | |
| 1259 | + | |
| 1260 | + | |
1221 | 1261 | | |
1222 | | - | |
1223 | | - | |
1224 | 1262 | | |
1225 | 1263 | | |
1226 | 1264 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
123 | 123 | | |
124 | 124 | | |
125 | 125 | | |
| 126 | + | |
126 | 127 | | |
127 | 128 | | |
128 | 129 | | |
| |||
0 commit comments