Changelog
Clinical-Genomics-Lund/jasentool: Changelog
All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog and this project adheres to Semantic Versioning.
[1.1.0]
Added
minority-reportsubcommand — computes minority base frequency distribution from a pre-computedsamtools mpileupfile (.mpileupor.mpileup.gz), with optional blacklist filteringcreate-blacklistsubcommand — runs mpileup and minority base distribution across a set of BAM files, then aggregates per-position frequencies to produce a minority variant blacklist TSV--plasmidfinderflag increate-yaml— emits PlasmidFinder result path under theplasmidfinderkey--plasmidfinder-genome-hitsflag increate-yaml— emits the path to the PlasmidFinderHit_in_genome_seq.fsaunder theplasmidfinder_genome_hitskey--plasmidfinder-plasmid-seqsflag increate-yaml— emits the path to the PlasmidFinderPlasmid_seqs.fsaunder theplasmidfinder_plasmid_seqskey--shigatyperflag increate-yaml— emits ShigaTyper result path under theshigatyperkey (#47)--samtools-statsand--samtools-bedcovflags increate-yaml— mutually exclusive with--postalnqc
Fixed
Changed
[1.0.0]
Added
Added
plot.pyfor all plotting functionsAdded plotting of
n_missing_locibarplotpost-align-qcsubcommand — computes post-alignment QC metrics from a BAM filecount-readssubcommand — counts reads from FASTQ file(s), outputs JSONAdded readthedocs
Added pytesting
Added
DockerfileAdded
download-ncbisubcommand — downloads genome FASTA and GFF from NCBI Datasets v2 APIAdded
download-bigsdbsubcommand — downloads cgMLST scheme alleles from PubMLST / BIGSdb Pasteur via OAuth1Added
concatenate-filessubcommand — merges multiple YAML files (e.g.versions.yml) into one output YAMLAdded
create-yamlsubcommand — creates a YAML input file for Bonsai upload from sample metadata and analysis result pathsAdded GA workflows for publishing to dockerhub and pypi
Added pytest for
download-ncbisubcommandAdded conda
environment.ymlfor installationAdded
annotate-dellysubcommand — annotates Delly structural-variant VCFs with gene symbols and locus tags from a tabix-indexed BED fileAdded pytests for
post-align-qcandcount-readssubcommands using simulated S. aureus fixture dataAdded S. aureus simulated FASTQ and BAM fixtures (
saureus_sim_R1.fastq.gz,saureus_sim_R2.fastq.gz,saureus_test_1.bam) for testing
Fixed
Fixed cronjob bugs
Fixed cronjob missing execution
Fixed parsing of jasen
.jsonfiles
Changed
Changed cronjob to use environmental variables
Changed default
--remote_hostnamemissing→identify-missingconvert→transform-file-formatfix→reformat-csvconverge→converge-cataloguesAll subcommand argument names changed from underscore style (
--db_name) to hyphen style (--db-name)Removed
insertsubcommandChange sambamba for pysam
Removed picard hs metrics and
--referenceargdownload_ncbi.py→ncbi.py(classDownloadNcbi→NCBI),download_bigsdb.py→bigsdb.py(classDownloadBigsdb→BIGSdb) — renamed to match module naming conventionRemoved
download-krakendbsubcommand; Kraken2 DB building is now documented as a Singularity workflow indocs/usage.mdUtils.download_and_save_fileupgraded with exponential-backoff retry logic (max_retriesparameter);Utils.unzipnow returns bool and handles corrupt archivescount-reads: now reads in binary chunks and counts newlines instead of iterating line-by-line in text mode to reduce overheadannotate-delly: VCF and TabixFile handles now explicitly closed; writer wrapped in try/finally; tabix index existence validated before opening BED file
[0.2.0]
Added
Added tb validation notebook
Added output csv for failed validation samples
Added differential matrix for validation
Added saureus validation notebook
Added
--generate_matrixargument to cliAdded
alter_sample_idfilterAdded summed differential matrix
Added mean coverage plot
Added jitter to mean coverage boxplot
Fixed
Fixed
pyproject.tomlsetuptools bugFixed cgmlst comparison and plots
Fixed
fixsubargFixed TN/TP/FN/FP plots
Changed
Changed id format for fix and missing
Changed matrix to boxplot, jitter, and annotate
[0.1.0] - 2024-05-22
Added
find— query samples from MongoDBinsert— insert JSON results into MongoDBvalidate— compare pipeline outputs against MongoDB recordsmissing— identify samples absent from JASEN results directoryconvert— convert cgMLST target TSV to BED formatfix— reformat BJORN CSV/SH files for JASENconverge— merge WHO, TBdb, and FoHM TB mutation cataloguesqc— compute post-alignment QC from BAMPylint GitHub Actions workflow
pyproject.tomlreplacingsetup.pysequencing_runandclarity_sample_idfields in outputs