thoraxe.utils package
Submodules
thoraxe.utils.folders module
folders: Util functions to create output folder structure.
- thoraxe.utils.folders.create_subfolder(folder, subfolder)
Return path to the subfolder. It creates the folder if it doesn’t exist.
thoraxe.utils.species module
species: Util functions to get the species list.
- thoraxe.utils.species.check_species_name(species_name)
Return True if the species_name has the correct format.
The expected name is binomial. Trinomial names aren’t used but they don’t raise an error. If the species_name doesn’t conform the expected format an error is raised.
>>> check_species_name('homo_sapiens') True >>> check_species_name('colobus_angolensis_palliatus') False >>> check_species_name('cricetulus_griseus_chok1gshd') False
- thoraxe.utils.species.get_species_list(specieslist)
Return a list of species names (str).
>>> get_species_list('') >>> get_species_list('homo_sapiens,mus_musculus') ['homo_sapiens', 'mus_musculus']
thoraxe.utils.split module
split: Util functions to split s-exons.
- thoraxe.utils.split.split_exons(exons)
Return a list of the s-exons in the string.
>>> split_exons('3_0/3_1') ['3_0', '3_1'] >>> split_exons('4_0') ['4_0']
- thoraxe.utils.split.split_lengths(lengths, seq)
Return a list of the s-exon lengths, len(seq) if there is not data.
>>> import numpy as np >>> split_lengths('4/2', 'MLGHAC') [4, 2] >>> split_lengths('6', 'MLGHAC') [6] >>> split_lengths('', 'MLGHAC') [6] >>> split_lengths(np.nan, 'MLGHAC') [6]
- thoraxe.utils.split.split_seqs(sequences, seq)
Return a list with the s-exon sequences.
>>> split_seqs('MLGH/AC', 'MLGHAC') ['MLGH', 'AC'] >>> split_seqs('MLGHAC', 'MLGHAC') ['MLGHAC'] >>> split_seqs('', 'MLGHAC') ['MLGHAC']
thoraxe.utils.windows module
Windows: Functions useful for testing and Windows support.
- thoraxe.utils.windows.is_windows()
Return True in Windows.
Module contents
utils: A collection of functions to use in different places of thoraxe.