align module¶
Should take in a sam file from a aligner like bwa aln or bwa mem and convert it into a
-
mavis.align.
SUPPORTED_ALIGNER
= MavisNamespace(BLAT='blat', BWA_MEM='bwa mem', __name__='~mavis.align.SUPPORTED_ALIGNER', _defns={}, _types={'BWA_MEM': <class 'str'>, 'BLAT': <class 'str'>})¶ MavisNamespace
– supported aligners
-
class
mavis.align.
SplitAlignment
(*pos, **kwargs)[source]¶ Bases:
mavis.breakpoint.BreakpointPair
-
query_consumption
()[source]¶ fraction of the query sequence which is aligned (everything not soft-clipped) in either alignment
-
-
mavis.align.
align_sequences
(sequences, input_bam_cache, reference_genome, aligner, aligner_reference, aligner_output_file='aligner_out.temp', aligner_fa_input_file='aligner_in.fa', aligner_output_log='aligner_out.log', blat_limit_top_aln=25, blat_min_identity=0.7, clean_files=True, log=<function devnull>, **kwargs)[source]¶ calls the alignment tool and parses the return output for a set of sequences
-
mavis.align.
call_paired_read_event
(read1, read2)[source]¶ For a given pair of reads call all applicable events. Assume there is a major event from both reads and then call indels from the individual reads
-
mavis.align.
call_read_events
(read, secondary_read=None)[source]¶ Given a read, return breakpoint pairs representing all putative events
-
mavis.align.
convert_to_duplication
(bpp, reference_genome)[source]¶ Given a breakpoint call, tests if the untemplated sequences matches the preceding reference sequence. If it does this is annotated as a duplication and the new breakpoint pair is returned. If not, then the original breakpoint pair is returned