Documentation for BioDemultiplexer.
Compare each sequence in the fastqR1 file with the sequences in bcdf, and classify the sequences of the specified file based on that comparison.
Divides a single FASTQ file for parallel processing.
Divides a pair of FASTQ files into smaller parts for parallel processing. It calculates the number of reads per worker and uses the split command to divide the files.
Orchestrates the entire demultiplexing process for FASTQ files. Handles the preprocessing, dividing, demultiplexing, and merging of files.
Calculate and compare the similarity of a given sequence seq with the sequences in the given DataFrame bc_df.
(max_score_bc, delta), where
max_score_bc is the index of the best matching sequence in
delta is the difference between the highest and second-highest scores.
Preprocesses the barcode file by modifying sequences based on specific criteria.
Fast version of semiglobal_alignment function.
This function aligns
ref strings, using semiglobal alignment algorithm.
A similarity score as a float, where higher values indicate better alignment.(0<=similarity_score<=1)