All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog and this project adheres to Semantic Versioning.
- #628, #722 -
ASCATnow use.gcfile - #712, #718 - Added possibilities to run Sarek with
conda - #719 - Annotation documentation
- #719 - Helper script to download
snpeffandVEPcache files - #719 - New
--annotation_cache,--snpEff_cache,--vep_cacheparameters - #719 - Possibility to use cache wen annotating with
snpEffandVEP - #722 - Add path to ASCAT
.gcfile inigenomes.config - #728 - Update
Sarek-datasubmodule with multiple patients TSV file - #732 - Add
cadd_WG_SNVs,cadd_WG_SNVs_tbi,cadd_InDels,cadd_InDels_tbiandcadd_cacheparams - #732 - Add tabix indexed cache for VEP
- #732 - New
DownloadCADDprocess to download CADD files - #732 - Specify values for
cadd_WG_SNVs,cadd_WG_SNVs_tbi,cadd_InDels,cadd_InDels_tbiandcadd_cacheparams inmunin.conffile - #732 - Use
cadd_cacheparam for optional use of CADD VEP plugin inannotate.nf - #732 - VEP cache has now fasta files for
--HGVS - #735 - Added
--exomefor Manta, and for StrelkaBP - #735 - Added Travis CI test for targeted
- #710 - Improve release checklist and script
- #711 - Improve configuration priorities
- #716 - Update paths to containers and iGenomes
- #717 -
checkFileExtensionhas changed tohasExtension, and now only verify if file has extension - #717 -
fastqFilesrenamed toinputFiles - #717 -
mappingstep can now map BAM files too - #717 -
MapReadscan now convert BAM to FASTQ and feed it to BWA on the fly - #717, #732 - Update documentation
- #719 -
snpeffandvepcontainers are now built with conda - #719 -
vepCacheVersionis now defined inconf/genomes.configorconf/igenomes.config - #722 - Add path to ASCAT
.gcfile inigenomes.config - #722 - Update
Sarek-datasubmodule - #723, #725 - Update docs
- #724 - Improved AwsBatch configuration
- #728 - Improved usage of
targetBEDparams - #728 - Strelka Best Practices output is now prefixed with
StrelkaBP_ - #728 - VCFs and Annotated VCFs are now ordered by Patient, then tools
- #732 - Merge
buildContainers.nfandbuildReferences.nfinbuild.nf - #732 - Reduce number of CPUs for
RunVEPto4cf: VEP docs - #732 - Update VEP from
95.1to95.2
- #715 - Remove
defReferencesFilesfunction frombuildReferences.nf - #719 -
snpEffbase container is no longer used - #721 - Remove COSMIC docs
- #728 - Remove
defineDirectoryMap() - #732 - Removed
--databaseoption for VEP cf: VEP docs
- #720 - bamQC is now run on the recalibrated bams, and not after MarkDuplicates
- #726 - Fix Ascat ref file input (one file can't be a set)
- #727 - bamQC outputs are no longer overwritten (name of dir is now the file instead of sample)
- #728 - Fix issue with annotation that was consuming
cachechannels - #728 - Fix multi sample TSV file #691
- #733 - Fix the possibility to specify reference files on the command line
- #671 - New
publishDirModeparam and docs - #673, #675, #676 - Profiles for BinAC and CFC clusters in Tübingen
- #679 - Add container for
CreateIntervalBeds - #692, #697 - Add AWS iGenomes possibilities (within
conf/igenomes.conf) - #694 - Add monochrome and grey logos for light or dark background
- #698 - Add btb profile for munin server
- #702 - Add font-ttf-dejavu-sans-mono
2.37and fontconfig2.12.6to container
- #663 - Update
do_release.shscript - #671 - publishDir modes are now params
- #677, #698, #703 - Update docs
- #678 - Changing VEP to v92 and adjusting CPUs for VEP
- #679 - Update old awsbatch configuration
- #682 - Specifications for memory and cpus for awsbatch
- #693 - Qualimap bamQC is now ran after mapping and after recalibration for better QC
- #700 - Update GATK to
4.0.9.0 - #702 - Update FastQC to
0.11.8 - #705 - Change
--TMP_DIRby--tmp-dirfor GATK4.0.9.0BaseRecalibrator - #706 - Update TravisCI testing
- #665 - Input bam file now has always the same name (whether it is from a single fastq pair or multiple) in the MarkDuplicates process, so metrics too
- #672 - process
PullSingularityContainersfrombuildContainers.nfnow expect a file with the correct.simgextension for singularity images, and no longer the.imgone. - #679 - Add publishDirMode for
germlineVC.nf - #700 - Fix #699 missing DP in the FORMAT column VCFs for MuTect2
- #702 - Fix #701
- #705 - Fix #704
- #613 - Add Issue Templates (bug report and feature request)
- #614 - Add PR Template
- #615 - Add presentation
- #616 - Update documentation
- #620 - Add
tmp/to.gitignore - #625 - Add
pathfindras a submodule - #635 - To process targeted sequencing with a target BED
- #639 - Add a complete example analysis to docs
- #640, #642 - Add helper script for changing version number
- #608 - Update Nextflow required version
- #615 - Use
splitCsvinstead ofreadlines - #616 - Update CHANGELOG
- #621, #638 - Improve install script
- #621, #638 - Simplify tests
- #627, #629, #637 - Refactor docs
- #629 - Refactor config
- #632 - Use 2 threads and 2 cpus FastQC processes
- #637 - Update tool version gathering
- #638 - Use correct
.simgextension for Singularity images - #639 - Smaller refactoring of the docs
- #640 - Update RELEASE_CHECKLIST
- #642 - MultiQC 1.5 -> 1.6
- #642 - Qualimap 2.2.2a -> 2.2.2b
- #642 - Update conda channel order priorities
- #642 - VCFanno 0.2.8 -> 0.3.0
- #642 - VCFtools 0.1.15 -> 0.1.16
- #555 -
snpEffoutput intoVEP - #556 -
StrelkaBest Practices - #563 - Use
SnpEFFreports inMultiQC - #568 -
VCFToolsprocessRunVcftoolsfor QC - #574, #580 - Abstracts for NPMI, JOBIM and EACR25
- #577 - New repository for testing: Sarek-data
- #595 - New library
QCfor functionsbamQC,bcftools,samtoolsStats,vcftools,getVersionBCFtools,getVersionGATK,getVersionManta,getVersionSnpEFF,getVersionStrelka,getVersionVCFtools,getVersionVEP - #595 - New Processes
GetVersionBCFtools,GetVersionGATK,GetVersionManta,GetVersionSnpEFF,GetVersionStrelka,GetVersionVCFtools,GetVersionVEP - #595 - new Python script
bin/scrape_tool_versions.pyinspired by @ewels and @apeltzer - #595 - New QC Process
RunVcftools - #596 - New profile for BinAC cluster
- #597 - New function
sarek_ascii()inSarekUtils - #599, #602 - New Process
CompressVCF - #601, #603 - Container for GATK4
- #606 - Add test data as a submodule from
Sarek-data - #608 - Add documentation on how to install Nextflow on
bianca
- #557, #583, #585, #588 - Update help
- #560 - GitHub langage for the repository is now
Nextflow - #561 -
do_all.shbuild only containers for one genome reference (defaultGRCh38) only - #571 - Only one container for all QC tools
- #582, #587 - Update figures
- #595 - Function
defineDirectoryMap()is now part ofSarekUtils - #595 - Process
GenerateMultiQCconfigreplace by functioncreateMultiQCconfig() - #597 -
extractBams()now takes an extra parameter. - #597 - Move
checkFileExtension(),checkParameterExistence(),checkParameterList(),checkReferenceMap(),checkRefExistence(),extractBams(),extractGenders(),returnFile(),returnStatus()andreturnTSV()functions toSarekUtils - #597 - Reduce data footprint for Process
CreateRecalibrationTable - #597 - Replace depreciated operator
phasebyjoin. - #599 - Merge is tested with
ANNOTATEALL - #604 - Synching
GRCh38wgs_calling_regionsbedfiles - #607 - One container approach
- #607 - Update to GATK4
- #608 - Update Nextflow required version
- #616 - Update CHANGELOG
- #617 - Replace depreciated $name syntax with withName
- #560 - Display message for
repositoryandcontainerPath - #566 -
slurmDownloadprofile - #579, #584 -
Mantaoutput reorganized after modification forStrelka Best Practicesprocess - #585 - Trace file is plain txt
- #590, #593 - Fix Singularity installation in Travis CI testing
- #598, #601 - Fixes for Python script
selectROI.pyto work with CLC viewer
- #607 - Remove Mutect1
- basic wrapper script
- Abstract, posters and figures
- ROI selector and FreeBayes sanitizer scripts
- New logo and icon for the project
- check for existing tumor/normal channel
SarekUtilswithcheckParams(),checkParameterList(),checkParameterExistence()andisAllowedParams()functions- some
runOptionsfordocker(prevent some user right problem) - This
CHANGELOG
CAWis nowSarek- Dissect Workflow in 5 new scripts:
annotate.nf,main.nf,germlineVC.nf,runMultiQC.nfandsomaticVC.nf report.html,timeline.htmlandtrace.htmlare generated inReports/--versionis now used to define the workflow version- most params are now defined in the base.config file instead of in the scripts
- update RELEASE_CHECKLIST.md
checkParams(),checkParameterList(),checkParameterExistence()andisAllowedParams()in script functions are now called withinSarekUtilsnf_required_versionis nowparams.nfRequiredVersion- in
buildReferences.nfscript, channels now begin bych_, and files byf_ - use
PublishDir mode: 'link'instead ofcopy directoryMapnow containsparams.outDir- #539 - use Nextflow support of scratch
- reordered Travis CI tests
- update documentation
MultiQCversion in container from v1.4to v1.5vepgrch37container base image fromrelease_90.6torelease_92vepgrch38container base image fromrelease_90.6torelease_92VEPversion in containers from v90to v91nucleotidesPerSecondis nowparams.nucleotidesPerSecond- default
params.tagis nowlatestinstead of current version, so --tag needs to be specified with the right version to be sure of using thecontainerscorresponding
standardprofileuppmax-localhost.configfile
scripts/skeleton_batch.sh- old data and tsv files
- UPPMAX directories from containers
--stepinannotate.nf,germlineVC.nfandsomatic.nf- some
runOptionsfor Singularity (binding not needed anymore on UPPMAX) downloadprofile
- Zenodo for DOI
- Delivery README
- Document use of the
--sampleDiroption - Contributing Guidelines
- Issue Templates
- Release Checklist
--outDirawsbatchprofileaws-batch.configconfig file--noBAMQCparams (failing sometimes on Bianca)
- Update
Nextflowto0.26.0(new fancy report + AWS Batch) - Extra time on Travis CI testing
- Replace
bundleDirbyparams.genome_base - Update
MultiQCto1.3(MEGAQC FTW) - Move and rename some test files
- Version of COSMIC GRCh37 v83
- Write an error message when
--sampleDirdoes not find any FASTQ files base.configfor ConcatVCF process- File specification for recalibrationReport in RecalibrateBam process (got error on AWS Batch)
- #488 - Better CPU requirements for
ConcatVCF - #489 - Exception handling for
ASCAT - #490 - CPU requirements for
runSingleStrelkaandrunSingleManta
- #357 -
ASCATworks for GRCh38 - #471 - Running
Singularityon/scratch - #475 - 16 cpus for local executor
- #480 - No
tsvfile needed for stepannotate
- #479 - Typo in
uppmax-localhost.config
runascatandrunconvertallelecountscontainers are now replaced byr-basewillmclaren/ensembl-vep:release_90.5is now base forvepgrch37andvepgrch38
vepcontainerstrelka_config.inifile
- #471 - Running
Singularityon /scratch - #472 - Update function to check Nextflow version
- #473 - Remove
returnMin()function
- Fix version for Manuscript
- Singularity possibilities
- Reports made by default
- Intervals file can be a bed file
- Normal sample preprocessing + HaplotypeCaller is possible
- Better Travis CI tests
- Memory requirements
- Docker possibilities