dhtslib.sam.md

Module to deal with SAM records' MD auxillary tag.

This tag is a string encoding mismatched and deleted reference bases, used in conjunction with CIGAR and SEQ fields to reconstruct the bases of the reference sequence interval to which the alignment has been mapped. This can enable variant calling without requiring access to the entire original reference.

"For example, a string 10A5^AC6 means from the leftmost reference base in the alignment, there are 10 matches followed by an A on the reference which is different from the aligned read base; the next 5 reference bases are matches followed by a 2bp deletion from the reference; the deleted sequence is AC; the last 6 bases are matches."

Reference: https://samtools.github.io/hts-specs/SAMtags.pdf

Members

Functions

getMDPairs
auto getMDPairs(SAMRecord rec)

(?) For SAM record rec, return ForwardRange over read's MD tag data

Structs

MDItr
struct MDItr

(?) Iterator yielding mismatched base in query; or '=' if equal to reference

MDPair
struct MDPair
Undocumented in source.

Variables

MDREGEX
auto MDREGEX;

regex to extract MD string groups ex: "11A3^G" -> [(11, "A"), (3, "^G")]

Meta