BioDemultiplexer

Documentation for BioDemultiplexer.

BioDemultiplexer.classify_sequencesFunction

Compare each sequence in the fastqR1 file with the sequences in bcdf, and classify the sequences of the specified file based on that comparison.

BioDemultiplexer.divide_fastqMethod

Divides a pair of FASTQ files into smaller parts for parallel processing. It calculates the number of reads per worker and uses the split command to divide the files.

BioDemultiplexer.find_best_matching_bcMethod

Calculate and compare the similarity of a given sequence seq with the sequences in the given DataFrame bc_df.

Returns

A tuple (max_score_bc, delta), where max_score_bc is the index of the best matching sequence in bc_df, and delta is the difference between the highest and second-highest scores.

BioDemultiplexer.semiglobal_alignmentMethod

This function aligns query and ref strings, using semiglobal alignment algorithm.

Returns

A similarity score as a float, where higher values indicate better alignment.(0<=similarity_score<=1)