All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog and this project adheres to Semantic Versioning.
- #774 - Add logo for Danish National Genome Center
- #762 - Polish CHANGELOG + figures
- #766 - Align box description in subway map
- #768 - Use double quotes to fix import of singularity images for deepvariant module
- #770 - Use double quotes to fix import of singularity images for gatk4/cnnscorevariants module
- #771 - update to new modules syntax
3.0.2 - Lájtávrre
Lájtávrre is a lake you can see from the Skierfe mountain, formed by the Rapaätno river.
- #691 - Enable
PROFILE=conda,PROFILE=dockerandPROFILE=singularityfor pytest - #716 - Add documentation for Azure recommended config vm_size
- #752 - Add tracking of all dependencies starting 3.0
- #679 - Back to
dev - #685 - Updating the nf-core modules used by Sarek
- #691 - To run the same pytest as before locally, use
PROFILE=docker - #692 - Use
params.tools=strelkain profiletest - #696 - Adding check of md5-sums in CI-tests
- #719 - Added boxes to subway map
- #720 - Sync
TEMPLATEwithtools2.5 - #723 - Sync
TEMPLATEwithtools2.5.1 - #726 - Adapt resource requests
- #730 - Reduce number of tests
- #731 - Run the somatic test as default on
-profile test_full, the germline can be tested with-profile test_full_germline - #733 - Add description for params.cf_chrom_len
- #734 - nf-core modules update -a
- #736 - More extensive CI for default test
- #742 - Requiring the Haplotypecaller to be specified as one of the tools for joint germline genotyping
- #752 - Code polishing
- #679 - Fixed typos in subway maps
- #681 - Fixed intermediate files published cf #680
- #688 - Fixed VEP plugins issue cf #687
- #689 - Fixed when clause for non
BWA membuilding mapping indexes - #704 - Fixed
cf_ploidyto string instead of number - #705 - Fix publishing for processes in
alignment_to_fastqsubworkflow; prevent tabix computation forknown_snpswhen present; publishumiprocessed files intopreprocessing/umisubdirectory - #706 - Fixed
vep_versionnot found error when running--vep_loftee - #724 - Fixed prettier issue
- #727 - Allow
.listinterval files; removesecondsfrom GRCh38 file to allow--nucleotides_per_secondto be used - #728 - Circumvent issue with controlfreec and length file containing regions not in intervals file
- #729 - Trailing commas in
--tools,--skip_toolsand--use_gatk_sparknow raise failure cf #722 - #741 - Fix prefix for
bcftools sortfor joint germline variant calling - #743 - Remove profile definitions in profile to avoid issues with Tower
- #758 - Fix Zenodo batch
- #760 - Fix CHANGELOG dependencies
- #761 - Fix font in subway map and workflow image
- #742 - Removed some lines from the usage-doc as Sarek no longer support input supplied as a list of multiple csv-files
- #757 - Remove
errorStrategyinconf/modules.config
3.0.1 - Saiva
Saiva is a lake in the Sarek national park, just below the Skierfe mountain.
- #708 - Fixes mpileup bug. Update nf-core module
samtools/mpileupto subset CRAM file by intervals
3.0 - Skierfe
Skierfe is a mountain in the Sarek national park, and the inspiration for the logo.
- #388 - Add cram support + read splitting with
SeqKitfor speedup - #394 - Add
DeepVariant - #411 - cram in csv samplesheet
- #448 - Allow to skip base quality recalibration with
--skip_bqsr - #449 - @FriederikeHanssen is now a
CODEOWNERS - #460 - Add posters
- #463 - Add dark/light logo versions
- #464, #514 - Add
DRAGMAPas a possible aligner - #479 - Add more subworkflows
- #485 -
--skip_qc,--skip_markduplicatesand--skip_bqsris now--skip_tools - #507, #537 - Subway map for building indexes
- #512, #531, #537 - Subway map for pipeline
- #522 - Add QC for vcf files & MultiQC
- #533 - Add param
--only_paired_variant_callingto allow skipping of germline variantcalling for paired samples - #536 - Add
--step markduplicatesto start from duplicate marking,--step prepare_recalibrationnow ONLY starts at processBaseRecalibrator& addingbamandcraminput support for--stepmarkduplicates,prepare_recalibration,recalibrate, andvariant_calling - #538 - Add param
--seq_platform, default:ILLUMINA - #545 - Add modules and subworkflows for
cnvkittumor_only mode - #540 - Add modules and subworkflows for
cnvkitsomatic mode - #557 - Add
Haplotypecallersingle sample mode together withCNNScoreVariantsandFilterVariantTranches - #576 - Add modules and subworkflows for
cnvkitgermline mode - #582 - Added option
--vep_out_formatfor setting the format of the output-file from VEP tojson,taborvcf(default) - #594 - Add parameter
--save_output_as_bamto allow output of result files in BAM format - #595 - Added Haplotypecaller joint germline calling
- #597 - Added tiddit for tumor variant calling
- #600 - Added description for UMI related params in schema
- #604, #617 - Added full size tests WGS 30x NA12878
- #613 - Added params
--dbnsfp_fieldsto allow configuration of fields for thedbnsfpVEPplugin - #613 - Added params
--dbnsfp_consequenceto allow configuration of consequence for thedbnsfpVEPplugin - #613 - Added params
--vep_versionto allow more configuration on the vep container definition - #620 - Added checks for sex information when running a CNV tools
- #623 - Additional checks of data in the input sample sheet
- #629 - Added checks to catch inconsistency between supplied samples and requested tools
- #632 - Added params
--snpeff_versionto allow more configuration on the snpeff container definition - #632 - Added params
--vep_include_fastato use the fasta file for annotation - #639 - Adding genes-txt-file and summary-html-file to the published output from snpEff
- #647 - Update resource requests for preprocessing based on what worked for 5 ICGC matched WGS samples
- #652 - Added full size somatic test profile
- #580 - changed the test_full config to real public WXS data. 1 sample WXS germline, 1 Tumor/Normal pair. https://doi.org/10.1038/sdata.2016.25 and https://doi.org/10.1038/s41587-021-00994-5
- #383, #528 - Update
CHANGELOG - #390 - Update
nextflow_schema.json - #408 - Sync
TEMPLATEwithtools2.0.1 - #416 - Sync
TEMPLATEwithtools2.1 - #417 - Merge
dsl2anddevbranches - #419 - Improve preprocessing
- #420, #455, #459, #633 -
nf-core modules update --all - #427 - Update
DeepVariant - #462 - Update modules and
modules.config - #465 - Improve
test_data.config - #466, #478, #492, #521 - Move some local modules to
nf-core/modules - #466, #485, #492, #494, #515 - Improve preprocessing subworkflows
- #474, #475 - Sync
TEMPLATEwithtools2.2 - #487, #489, #492, #497, #522, #583 - Improve variant calling subworkflows
- #498 - Update docs
- #501 - Sync
TEMPLATEwithtools2.3 - #511 - Sync
TEMPLATEwithtools2.3.2 - #520 - Improve annotation subworkflows
- #537 - Update workflow figure
- #539 - Update
CITATIONS.md - #544 -
Mutect2is no longer compatible with--no_intervals - #551 - Sync
TEMPLATEwithtools2.4 - #562 - Restart from
--step annotateis now also requiring a CSV file - #563 - Updated subway map
- #570 - Extract mpileup into its own subworkflow; zip mpileup files
- #571 - Including and using GATK4's mergeVcfs
- #572 - Adjusted subway map svg for firefox compatibility
- #577 - Update
RELEASE_CHECKLIST - #578 - Updated module deeptools/bamcoverage
- #585 - Remove explicit BAM to CRAM conversion after MarkduplicatesSpark; tool does it internally
- #581 -
TIDDITis updated to3.1.0 - #593 - update
ensembl-vepcache version and module - #600 - Remove
TODOin awsfulltest - #606 - Updated
ASCATto version3.0as module - #608 - Prevent candidate VCFs from getting published in manta
- #618 - Update
multiqcmodule - #618 - Update test yml files
- #620 -
genderis nowsexin the samplesheet - #630 - Update citations file
- #632 - Update
snpEffversion to5.1and cache up to105 - #632 - Update
VEPversion to106.1and cache up to106 - #618 - Update
multiqcmodule update test yml files - #618 - Update test yml files
- #633 - Update
BCFTOOLSversion to1.15.1 - #644 - Use
-Yforbwa-mem(2)and remove-M - #645 - Merge
tests/nextflow.configinconf/test.config - #646 - Update
nextflow_schema.jsonto reflect new parameters and functions, removes--annotation_cache, removes--ascat_chromosomes - #649 - Update, simplify and add more files to all
test_*.ymlfiles - #651 - Added TIDDIT_SOMATIC subworkflow
- #653 - Coherent results subfolder structure between preprocessing, variantcalling and reporting
- #659 - Update usage.md docu section on
How to run ASCAT with WES - #661 - Add cnvkit reference creation to index subway map
- #662 - Add bgzipped and indexed GATKBundle reference files for
GATK.GRCh37and replace germline-resources with GATKBundle one - #663 - Add separate parameters for
ASCATandControlFREECback in - #668 - Update annotation documentation
- #674 - Default value for splitting is
50000000
- #234 - Switching to DSL2
- #234, #238 - Add modules and sub workflow for building indices
- #234, #252, #256, #283, #334 - Update Nextflow
19.10.0->20.11.0-edge - #239 - Restore Sarek ascii art to header
- #241, #248, #250, #257, #259 - Add modules and sub workflow for preprocessing
- #242, #244, #245, #246, #247, #249, #252, #256, #263, #264, #283, #285, #338 - Refactor
dsl2branch - #257 - Use a params modules config file
- #266, #285, #297 - Add modules and sub workflow for variant calling
- #333 - Bump
Sarekversion to3.0dev - #334 - Sync
dsl2anddevbranches - #342 - Update
README.md - #386 - Annotation is back
- #410, #412, #584 - Update
CItests - #418 - Fix
known_siteschannels - #432, #457 - Sort before
tabix index - #454 - Input is optional (can actually be found automatically by
Sarekif previously run) - #463, #468 - Fix
nf-core lint - #513, #527 - CNV is back
- #529 - Do not save
versions.ymlfiles - #524 - Fix intervals usage by counting the actual list of scatter/gather files produced and not overall number of intervals
- #549 - Fix unique lanes required for Freebayes: issue #311, replaces
meta.clone()with actual copy of map to avoid issues with https://nfcore.slack.com/archives/C027CM7P08M/p1644241819942339 - #567 - Fix interval name resolving during scatter/gather by moving logic to modules.config causing name to be correctly resolved on process execution; also fixed duplicate naming when variant callers produce multiple vcf files by adding field
typetometamap - #585 - Fix Spark usage for GATK4 modules
- #587 - Fix issue with VEP extra files
- #581 -
TIDDITis back - #590 - Fix empty folders during scatter/gather
- #592 - Fix optional resources for Mutect2, GetPileupSummaries, and HaplotypeCaller: issue #299, #359, #367
- #598, #614, #626 - Remove WARNING message for config selector not matching
- #599 - Add checks for correct data type for
params.step - #599 - Add checks for no empty
--toolswith--step variant_callingor--step annotate - #600 - Remove
nf-core lintwarnings - #602 - Fixed bug in
alignment_to_fastqand added tests - #609 - Remove unused intervals code, reorganize combined intervals file
- #613 - Fixed filenames for
dbnsfpandSpliceAIVEPplugin - #615 - Fix ASCAT igenomes file paths
- #619 - Fix issue with checking samplesheet content with AWS
- #628 - Fix issue with value converting to string before schema validation
- #628 - Fix dbsnp check issue with
--step annotate - #618 - Fix
bcftools/vcftoolssample labelling in multiqc report - #618 - Fix issue with tiddit #621
- #618 - Fix channel issue with
targets.bedin prepare_intervals - #634 - Fix issue with samtools/mosdepth plots in multiqc_report
- #641 - Fix issue with duplicate substring in tools and skip_tools
- #642 - Only unzip ref files if tool is run, only publish ref files if
--save_referenceand simplify CNKit logic - #650 - Fix intervals checks
- #654 - Allow any step but annotation to start from BAM files
- #655 - Fix
--intervals falselogic & add versioning for local modules - #658 - Fix split fastq names in multiqc-report
- #666 - Simplify multiqc config channel input
- #668 - Add
snpeff_versionandvep_versiontoschema_ignore_paramsto avoid issue when specifying on command line - #669 - Fix path to files when creating csv files
| Dependency | Old version | New version |
|---|---|---|
ascat |
2.5.2 | 3.0.0 |
bcftools |
1.9 | 1.15.1 |
bwa-mem2 |
2.0 | 2.2.1 |
bwa |
0.7.17 | unchanged |
cancerit-allelecount |
4.0.2 | 4.3.0 |
cnvkit |
0.9.6 | 0.9.9 |
control-freec |
11.6 | unchanged |
deepvariant |
added | 1.3.0 |
dragmap |
added | 1.2.1 |
ensembl-vep |
99.2 | 106.1 |
fastp |
added | 0.23.2 |
fastqc |
0.11.9 | unchanged |
fgbio |
1.1.0 | 2.0.2 |
freebayes |
1.3.2 | 1.3.5 |
gatk4 |
4.1.7.0 | 4.2.6.1 |
gawk |
added | 5.1.0 |
genesplicer |
1.0 | removed |
htslib |
1.9 | removed |
llvm-openmp |
8.0.1 | removed |
manta |
1.6.0 | unchanged |
markdown |
3.1.1 | removed |
mosdepth |
0.3.3 | unchanged |
msisensor-pro |
1.1.a | 1.2.0 |
msisensor |
0.5 | removed |
multiqc |
1.8 | 1.13a |
openjdk |
added | 8.0.312 |
openmp |
8.0.1 | removed |
p7zip |
added | 15.09 |
pigz |
2.3.4 | unchanged |
pygments |
2.5.2 | removed |
pymdown-extensions |
6.0 | removed |
qualimap |
2.2.2d | removed |
r-ggplot2 |
3.3.0 | removed |
samblaster |
0.1.24 | 0.1.26 |
samtools |
1.9 | 1.15.1 |
sed |
added | 4.7 |
snpeff |
4.3.1t | 5.1 |
strelka |
2.9.10 | unchanged |
svdb |
added | 2.6.1 |
tabix |
added | 1.11 |
tiddit |
2.7.1 | 3.1.0 |
trim-galore |
0.6.5 | removed |
vcfanno |
0.3.2 | removed |
vcftools |
0.1.16 | unchanged |
- #485 -
--skip_qc,--skip_markduplicatesand--skip_bqsris now--skip_tools - #538 -
--sequencing_centeris now--seq_center - #538 -
--markdup_java_optionshas been removed - #539 -
--annotate_toolshas been removed - #539 -
--cadd_cache,--cadd_indels,--cadd_indels_tbi,--cadd_wg_snvs,--cadd_wg_snvs_tbihave been removed - #539 -
--genesplicerhas been removed - #539 -
conf/genomes.configandparams.genomes_basehave been removed - #562 - Restart from
--step annotatefrom folder is removed. Use acsvfile instead - #571 - Removed the local module
concat_vcf - #605 - Removed Scatter/gather from GATK_SINGLE_SAMPLE_GERMLINE_VARIANT_CALLING, all intervals are processed together
- #643 - Removed Sentieon parameters
2.7.2 - Áhkká
Áhkká is one of the massifs just outside of the Sarek National Park.
- #566 - Fix caching bug affecting a variable number of
MapReadsjobs due to non-deterministic state ofstatusMapduring caching evaluation
2.7.1 - Pårtejekna
Pårtejekna is one of glaciers of the Pårte Massif.
- #353 - Add support for task retries with exit code 247 (exhibited by
Picard MarkDuplicates) - #354 - Add tumor only mode for
Mutect2andMSIsensor - #356 - Add
--cf_contamination_adjustmentparams to adjust contamination withControl-FREEC - #372 - Add
--cf_contaminationparams to specify contamination value withControl-FREEC
- #373 - Sync
TEMPLATEwithtools1.14 - #376 - Better logo on Github dark Mode
- #387 - Fix tables for TSV file content
- #375, #381, #382, #385 - Fix bugs due to
TEMPLATEsync from #373 - #378 - Fix
Sparkrelated issue due toDockersettings innextflow.config
- #368 - Remove social preview image to use GitHub OpenGraph
2.7 - Pårte
Pårte is one of the main massif in the Sarek National Park.
- #145 - Add
UMI annotation and consensusfunctionality toSarek - #230 - Add
ignore_soft_clipped_basesoption forGATK Mutect2#218 - #253 - Add
UMICItesting - #262 - Add
nextflow_schema.json - #237, #282 - Add
--alignerto choose betweenbwaandbwa-mem2 - #294 - Add
Troubleshootingsection todocs/usage.md - #302, #304 - Add WES and tumor-only mode for
Control-FREEC
- #253, #255, #326, #329 - Update docs
- #260, #262, #278, #322 - Sync with
TEMPLATEupdated from nf-core/tools1.10.2 - #262 - Update issue templates to fit the recommended community standards
- #278, #322 - Refactor docs
- #284 - Update F1000Research publication to version 2
- #284 - Update Scilifelab logo
- #317 - Update
README.md(Add: QBiC + Friederike/Gisela) - #320 - Set
MarkDuplicates MAX_RECORDS_IN_RAMto default value
- #229 - Fix
Control-FREECrestart issue #225 - #236 - Fix
GATK Mutect2typo issue #227 - #271 - Fix
ConcatVCF_Mutect2SIGPIPEissue #268 - #272 - Fix annotation
--tools mergeissue - #279 - Fix issue with
--step prepare_recalibration#267 - #280 - Use HTML codes instead of
<and>in docs - #288 - Fix
test_annotationprofile - #289 - Random string added to
extractFastqFromDirto avoid name collition - #290, #323 - Faster solving of
Condaenvironment - #293 - Fix typo issue when printing infos #292
- #309 - Fixed concatenation of many VCF files
- #310 - Fix Github Actions not running after November 16, 2020 (deprecated Github Actions API #739
- #329 - Simplify
Control-FREECusage - #331 - Replace
spreadoperator bycombineto removeNextflowdeprecation warning
- #234 - Removing obsolete script #92
- #262 - Removing deprecated params:
annotateTools,annotateVCF,cadd_InDels,cadd_InDels_tbi,cadd_WG_SNVs,cadd_WG_SNVs_tbi,maxMultiqcEmailFileSize,noGVCF,noReports,noStrelkaBP,nucleotidesPerSecond,publishDirMode,sample,sampleDir,saveGenomeIndex,skipQC,snpEff_cache,targetBed - #262 - Removing warning message about deprecated and obsolete params
- #324 -
--no_gatk_sparkis now removed, use--use_gatk_sparkinstead - #324 -
--no_gvcfis now removed, use--generate_gvcfinstead
2.6.1 - Gådokgaskatjåhkkå
Gådokgaskatjåhkkå is the highest peak in the Piellorieppe massif.
- #208 - Merge changes from the release PR
- #208 - Bump version to
3.0dev - #214 - Update
GATKfrom4.1.6.0to4.1.7.0 - #219 - Added
awsfulltest.ymlGitHub Actions workflow - #222 - Bump version to
2.6.1and minor release - #223 - Apply comments from the release PR
- #211 - Extend timeout for pushing to DockerHub for VEP containers
- #212 - No AWS test on forks
- #214 - Fix channels collision between
FreebayesandGATK Mutect2#200 - #214 - Fix warning Invalid tag value for
CreateIntervalBeds#209 - #214 - Fix
GATK Mutect2issue #210 - #219 - Updated
awstest.ymlGitHub actions workflow - #221 - Fix issue with
tmp_dirinBaseRecalibratorprocess
2.6 - Piellorieppe
Piellorieppe is one of the main massif in the Sarek National Park.
- #76 - Add
GATK Sparkpossibilities to Sarek - #87 - Add
GATK BaseRecalibratorplot toMultiQCreport - #115 - Add @szilvajuhos abstract for ESHG2020
- #117 - Add
Trim Galorepossibilities to Sarek - #141 - Add containers for
WBcel235 - #150, #151, #154 - Add AWS mega test GitHub Actions
- #153 - Add
CNVkitpossibilities to Sarek - #158 - Added
ggplot2version3.3.0 - #163 - Add MSIsensor in tools and container
- #164 - Add
--no_gatk_sparkparams and tests - #167 - Add
--markdup_java_optionsdocumentation - #169 - Add
RELEASE_CHECKLIST.mddocument - #174 - Add
variant_calling.mddocumentation - #175 - Add
Sentieondocumentation - #176 - Add empty
customgenome ingenomes.configto allow genomes that are not inAWS iGenomes - #179, #201 - Add
FreeBayesgermline variant calling - #180 - Now saving Mapped BAMs (and creating TSV) in minimal setting
- #182 - Add possibility to run
HaplotypeCallerwithoutdbsnpso it can be used to actually generate vcfs to build a set of known sites (cf gatkforums) - #195 - Now creating TSV for duplicates marked BAMs in minimal setting
- #195, #202 - Add
--save_bam_mappedparams to save mapped BAMs - #197 - Add step
prepare_recalibrationto allow restart from DuplicatesMarked BAMs - #204 - Add step
Control-FREECto allow restart from pileup files - #205 - Add
--skip_markduplicatesto allow skipping theMarkDuplicatesprocess
- #76 - Use
MarkDuplicatesSparkinstead ofMarkDuplicates - #76 - Use
gatk4-sparkinstead ofgatk4inenvironment.yml - #80 - Re-bump
devbranch - #85 - Use new merged vcf files for known indels to simplify setting up channel
- #104 - Update Figure 1
- #107 - Switch params to snake_case
- #109 - Update publication with F1000Research preprint
- #113 - Move social preview image
- #120 - Sync TEMPLATE
- #121 - Update
MultiQCto1.8 - #126, #131 - Update docs
- #131 - Use
nfcore/base:1.9as base for containers - #131 - Update
Control-FREECto11.5 - #131 - Update
FastQCto0.11.9 - #131 - Update
FreeBayesto1.3.2 - #131 - Update
Mantato1.6.0 - #131 - Update
Qualimapto2.2.2d - #131 - Update
VEPto99.2 - #141 - Update
snpEffcache version from75to87forGRCh37 - #141 - Update
snpEffcache version from86to92forGRCh38 - #141 - Update
VEPdatabases to99 - #143 - Revert
snpEffcache version to75forGRCh37 - #143 - Revert
snpEffcache version to86forGRCh38 - #152, #158, #164, #174, #194, #198, #204 - Update docs
- #164 - Update
gatk4-sparkfrom4.1.4.1to4.1.6.0 - #180, #195 - Improve minimal setting
- #183, #204 - Update
input.mddocumentation - #197 - Output directory
DuplicateMarkedis now replaced byDuplicatesMarked - #204 - Output directory
controlFREECis now replaced byControl-FREEC
- #83 - Fix some typos in
docs/input.md - #107 - Fix linting
- #110 - Fix
snpEffreport issue cf #106 - #126 - Fix
iGenomespaths - #127, #128 - Fix
ASCAT - #129 - Fix issue with Channel
channel ch_software_versions_yaml - #129 - Apply @drpatelh fix for
mardown_to_html.pycompatibility with Python 2 - #129 - Removed
Python3.7.3from conda environment due to incompatibility - #129 - Change ascii characters that were not supported from the
output.mddocs - #140 - Fix extra T/N combinations for
ASCATcf #136 - #141 - Fix
download_cache.nfscript to download cache forsnpEffandVEP - #143 - Fix annotation CI testing with
snpEffandVEP - #144 - Fix CircleCI for building
VEPcontainers - #146 - Fix
--no_intervalsforGATK Mutect2cf #135 - #156 - Fix typos
- #156 - Fix issues with
dbsnpfiles while using onlySentiontools - #158 - Fix typo with
params.snpeff_cacheto decide containers forsnpEff - #164 - Fix issues when running with
Sentieon - #164 - Add more VCFs to annotation
- #167 - Add
--markdup_java_optionsdocumentation to fix #166 - #178 - Fix
Sentieonvariant calling, now using deduped bam files - #188 - Fix input/output channels for process
IndexBamFileto match actual files in themapped.tsvfiles - #189 - Fix
no_intervalsfor processHaplotypeCaller(the file just need to actually exists...) - #197 - Fix issue with
--step recalibrate - #197 - Fix typo in output directory
DuplicateMarked->DuplicatesMarked
- #107 -
--annotateToolsis now deprecated, use--annotate_toolsinstead - #107 -
--cadd_InDelsis now deprecated, use--cadd_indelsinstead - #107 -
--cadd_InDels_tbiis now deprecated, use--cadd_indels_tbiinstead - #107 -
--cadd_WG_SNVsis now deprecated, use--cadd_wg_snvsinstead - #107 -
--cadd_WG_SNVs_tbiis now deprecated, use--cadd_wg_snvs_tbiinstead - #107 -
--maxMultiqcEmailFileSizeis now deprecated, use--max_multiqc_email_sizeinstead - #107 -
--noGVCFis now deprecated, use--no_gvcfinstead - #107 -
--noStrelkaBPis now deprecated, use--no_strelka_bpinstead - #107 -
--nucleotidesPerSecondis now deprecated, use--nucleotides_per_secondinstead - #107 -
--publishDirModeis now deprecated, use--publish_dir_modeinstead - #107 -
--saveGenomeIndexis now deprecated, use--save_referenceinstead - #107 -
--skipQCis now deprecated, use--skip_qcinstead - #107 -
--snpEff_cacheis now deprecated, use--snpeff_cacheinstead - #107 -
--targetBedis now deprecated, use--target_bedinstead
- #107 -
--acLociGCis now removed, use--ac_loci_gcinstead - #107 -
--acLociis now removed, use--ac_lociinstead - #107 -
--bwaIndexis now removed, use--bwainstead - #107 -
--chrDiris now removed, use--chr_dirinstead - #107 -
--chrLengthis now removed, use--chr_lengthinstead - #107 -
--dbsnpIndexis now removed, use--dbsnp_indexinstead - #107 -
--fastaFaiis now removed, use--fasta_faiinstead - #107 -
--genomeDictis now removed, use--dictinstead - #107 -
--genomeFileis now removed, use--fastainstead - #107 -
--genomeIndexis now removed, use--fasta_faiinstead - #107 -
--germlineResourceIndexis now removed, use--germline_resource_indexinstead - #107 -
--germlineResourceis now removed, use--germline_resourceinstead - #107 -
--igenomesIgnoreis now removed, use--igenomes_ignoreinstead - #107 -
--knownIndelsIndexis now removed, use--known_indels_indexinstead - #107 -
--knownIndelsis now removed, use--known_indelsinstead - #107 -
--singleCPUMemis now removed, use--single_cpu_meminstead - #107 -
--snpeffDbis now removed, use--snpeff_dbinstead - #107 -
--vepCacheVersionis now removed, use--vep_cache_versioninstead - #152 - Removed
Jenkinsfile - #169 - Removed omicX from README
- #181 - Remove duplicate code in
nextflow.config
2.5.2 - Jåkkåtjkaskajekna
Jåkkåtjkaskajekna is one of the two glaciers of the Ålkatj Massif.
- #45 - Include Workflow figure in
README.md - #46 - Add location to abstracts
- #52 - Add support for mouse data
GRCm38 - #60 - Add
no_intervalsparams - #60 - Add automatic generation of
intervalsfile withBuildIntervalsprocess - #60 - Add minimal support for minimal genome (only
fasta, orfasta+knownIndels) - #60 - Add new processes (
IndexBamFile,IndexBamRecal) to deal with optional usage of interval files and minimal genome - #60 - Add tests for minimal genome usage
- #60 - Add new minimal genomes (
TAIR10,EB2,UMD3.1,bosTau8,WBcel235,ce10,CanFam3.1,canFam3,GRCz10,danRer10,BDGP6,dm6,EquCab2,equCab2,EB1,Galgal4,galGal4,Gm01,hg38,hg19,Mmul_1,mm10,IRGSP-1.0,CHIMP2.1.4,panTro4,Rnor_6.0,rn6,R64-1-1,sacCer3,EF2,Sbi1,Sscrofa10.2,susScr3,AGPv3) toigenomes.config - #61 - Add params
split_fastq - #61 - Add test
SPLITFASTQ - #66 - Add
Sentieonpossibilities to Sarek
- #54 - Bump version to
2.5.2dev - #60 - Some process (
BaseRecalibrator,ApplyBQSR,Mpileup) have now optional usage of interval files - #60 - Update documentation
- #71 - Update
README - #71 - Update
CHANGELOG - #74 - Update docs
- #74 - Improve CI tests (both Jenkins and GitHub actions tests)
- #74 - Move all CI from
ci-extra.ymltoci.yml
- #46 - Remove mention of old
build.nfscript which was included inmain.nf - #74 - Remove
download_image.shandrun_tests.shscripts - #76 - Remove
runOptions = "-u \$(id -u):\$(id -g)"innextflow.configto enableSparkpossibilities
- #40 - Fix issue with
publishDirModewithintestprofile - #42 - Fix typos, and minor updates in
README.md - #43 - Fix automated
VEPbuilds with circleCI - #54 - Apply fixes from release
2.5.1 - #58 - Fix issue with
.interval_listfile from theGATKbundle #56 that was not recognized in theCreateIntervalsBedprocess - #71 - Fix typos in
CHANGELOG - #73 - Fix issue with label
memory_maxforBaseRecalibratorprocess #72
2.5.1 - Årjep-Ålkatjjekna
Årjep-Ålkatjjekna is one of the two glaciers of the Ålkatj Massif.
- #53 - Release
2.5.1
- #48 - Fix
singularity.autoMountsissue - #49 - Use correct tag for annotation containers
- #50 - Fix paths for scripts
2.5 - Ålkatj
Ålkatj is one of the main massif in the Sarek National Park.
Initial release of nf-core/sarek, created with the nf-core template.
- #2 - Create
nf-core/sarekenvironment.ymlfile - #2, #3, #4, #5, #7, #9, #10, #11, #12 - Add CI for
nf-core/sarek - #3 - Add preprocessing to
nf-core/sarek - #4 - Add variant calling to
nf-core/sarekwithHaplotypeCaller, and single modeMantaandStrelka - #5, #34 - Add variant calling to
nf-core/sarekwithManta,Strelka,Strelka Best Practices,GATK Mutect2,FreeBayes,ASCAT,ControlFREEC - #6 - Add default containers for annotation to
nf-core/sarek - #7 - Add
MultiQC - #7 - Add annotation
- #7 - Add social preview image in
pngandsvgformat - #7, #8, #11, #21 - Add helper script
run_tests.shto run different tests - #7, #8, #9 - Add automatic build of specific containers for annotation for
GRCh37,GRCh38andGRCm38usingCircleCI - #7, #8, #9, #11 - Add helper script
build_reference.shto build small reference from nf-core/test-datasets:sarek - #7, #9, #11, #12 - Add helper script
download_image.shto download containers for testing - #8 - Add test configuration for easier testing
- #9, #11 - Add scripts for
ASCAT - #10 - Add
TIDDITto detect structural variants - #11 - Add automatic build of specific containers for annotation for
CanFam3.1usingCircleCI - #11, #12 - Add posters and abstracts
- #12 - Add helper script
make_snapshot.shto make an archive for usage on a secure cluster - #12 - Add helper scripts
filter_locifile.pyandselectROI.py - #12 - Use
labelfor processes configuration - #13 - Add Citation documentation
- #13 - Add
BamQCprocess - #13 - Add
CompressVCFsnpEffandCompressVCFvepprocesses - #18 - Add
--no-reportsoption for tests + add snpEff,VEP,merge to MULTIPLE test - #18 - Add logo to
MultiQCreport - #18, #29 - Add params
--skipQCto skip specified QC tools - #18 - Add possibility to download other genome for
sareksnpeffandsarekvepcontainers - #20 - Add
markdownlintconfig file - #21 - Add tests for latest
Nextflowversion as well - #21 - Add
genomes.configfor genomes withoutAWS iGenomes - #24 - Added
GATK4 Mutect2calling and filtering - #27, #30 - Use Github actions for CI, linting and branch protection
- #31 - Add
nf-core lint - #31 - Add extra CI to
GitHub Actionsnf-core extra CI - #35 - Building indexes from nf-core/test-datasets:sarek for CI and small tests
- #1, #2, #3, #4, #5, #6, #7, #8, #9, #10, #11, #12, #18, #20, #21, #23, #29 - Update docs
- #4 - Update
cancerit-allelecountfrom2.1.2to4.0.2 - #4 - Update
gatk4from4.1.1.0to4.1.2.0 - #7, #23 -
--sampleDiris now deprecated, use--inputinstead - #7, #23 -
--annotateVCFis now deprecated, use--inputinstead - #8, #12 - Improve helper script
build.nffor downloading and building reference files - #9 -
ApplyBQSRis now parallelized - #9 - Fastq files are named following "${idRun}_R1.fastq.gz" in the
FastQCoutput for easier reporting - #9 - Status is now a map with
idpatient,idsampleas keys (ie:status = statusMap[idPatient, idSample]) - #9 - Use
ensembl-vep95.2instead of96.0 - #11 - Summary HTML from
VEPis now in theReportsdirectory - #12 - Update configuration files
- #12 - Disable
Dockerinsingularityprofile - #12 - Disable
Singularityindockerprofile - #12 - Disable
DockerandSingularityincondaprofile - #12 - Simplify
check_max()function - #13 - Merge
BamQCmappedandBamQCrecalibratedprocesses intoBamQCprocess - #13 - Split
CompressVCFprocess intoCompressVCFsnpEffandCompressVCFvepprocesses - #16 - Make scripts in
bin/andscripts/executable - #18 - Use
--no-reportsfor TravisCI testing - #18 - Add
--no-reportsfor all tests but MULTIPLE in Jenkins - #18, #29 -
--noReportsis now--skipQC all - #18, #21 - Update logo
- #21 - Moved
smallGRCh37path togenomes.config - #23 - Rename
genomeFile,genomeIndexandgenomeDictbyfasta,fastaFaianddict - #23 -
--sampleis now deprecated, use--inputinstead - #23 -
--genomeFileis now deprecated, use--fastainstead - #23 -
--genomeIndexis now deprecated, use--fastaFaiinstead - #23 -
--genomeDictis now deprecated, use--dictinstead - #24 -
AWS iGenomesconfig now contains germline resource forGATK4 Mutect2 - #30 - Simplify code for
MapReadsprocess - #24 -
AWS iGenomesconfig now contains germline resource forGATK4 Mutect2 - #31 - Move extra CI to
GitHub Actionsnf-core extra CI - #32, #33 - Install
ASCATwithcondain theenvironment.ymlfile - #33 - Use
workflow.manifest.versionto specify workflow version in path to scripts forControlFREECandVEPprocesses - #35 - Building indexes is now done in
main.nf - #35 -
build.nfscript now only download cache, so renamed todownloadcache.nf - #35 - Use
tabixinstead ofIGVtoolsto build vcf indexes - #35 - Refactor references handling
- #35 - Use Channel values instead of
referenceMap - #37 - Bump version for Release
- #38 - File names before merge is based on
${idSample}_${idRun}instead of${idRun}
- #9 - Removed
relatedness2graph fromvcftools stats - #13 - Removed
BamQCmappedandBamQCrecalibratedprocesses - #13 - Removed
CompressVCF - #18 - Removed params
--noReports - #24 - Removed
GATK3.X Mutect2 - #31 - Remove extra CI from
Travis CIandGitHub Actionsnf-core CI - #32, #35 - Clean up
environment.ymlfile - #35 - Remove building indexes from
build.nfscript - #35 - Remove helper script
build_reference.sh - #35 - Remove
IGVtools - #35 - Remove
GATK Mutect2fromMULTIPLEtest - #35 - Remove
referenceMapanddefineReferenceMap()and use Channel values instead
- #3 - Fix
Dockerownership - #11 - Fix
MergeMpileupPublishDir - #13 - Fix merge in annotation
- #14 - Fix output name for vcf files
- #16 - Fix path to
Rscript - #18 - Improve cpu usage
- #18 - Use same font for
nf-coreandsarekin ascii art - #20 - Use new logo in README
- #20 - Fix path to references genomes
- #22 - Fix
--singleCPUMemissue - #30 - Fix choice between
inputPairReadsFastQCandinputBAMFastQCchannels - #31 - Fix badges according to nf-core lint
- #31 - Fix
rcolorbrewerversion according to nf-core lint - #33 - Fix MD Linting
- #38 - Avoid collision in
MultiQC - #39 - Fix
ch_dbsnpchannel
- #23 -
--sampleis now deprecated, use--inputinstead - #23 -
--genomeFileis now deprecated, use--fastainstead - #23 -
--genomeIndexis now deprecated, use--fastaFaiinstead - #23 -
--genomeDictis now deprecated, use--dictinstead - #29 -
--noReportsis now deprecated, use--skipQC all
2.3.FIX1 - 2019-03-04
2.3 - Äpar - 2019-02-27
Äpar is one of the main massif in the Sarek National Park.
- #628, #722 -
ASCATnow use.gcfile - #712, #718 - Added possibilities to run Sarek with
conda - #719 - Annotation documentation
- #719 - Helper script to download
snpeffandVEPcache files - #719 - New
--annotation_cache,--snpEff_cache,--vep_cacheparameters - #719 - Possibility to use cache wen annotating with
snpEffandVEP - #722 - Add path to ASCAT
.gcfile inigenomes.config - #728 - Update
Sarek-datasubmodule with multiple patients TSV file - #732 - Add
cadd_WG_SNVs,cadd_WG_SNVs_tbi,cadd_InDels,cadd_InDels_tbiandcadd_cacheparams - #732 - Add
tabixindexed cache forVEP - #732 - New
DownloadCADDprocess to download CADD files - #732 - Specify values for
cadd_WG_SNVs,cadd_WG_SNVs_tbi,cadd_InDels,cadd_InDels_tbiandcadd_cacheparams inmunin.conffile - #732 - Use
cadd_cacheparam for optional use of CADD VEP plugin inannotate.nf - #732 -
VEPcache has now fasta files for--HGVS - #735 - Added
--exomeforManta, and forStrelkaBP - #735 - Added
Travis CItest for targeted
- #710 - Improve release checklist and script
- #711 - Improve configuration priorities
- #716 - Update paths to containers and
AWS iGenomes - #717 -
checkFileExtensionhas changed tohasExtension, and now only verify if file has extension - #717 -
fastqFilesrenamed toinputFiles - #717 -
mappingstep can now map BAM files too - #717 -
MapReadscan now convert BAM to FASTQ and feed it to BWA on the fly - #717, #732 - Update documentation
- #719 -
snpeffandvepcontainers are now built withconda - #719 -
vepCacheVersionis now defined inconf/genomes.configorconf/igenomes.config - #722 - Add path to ASCAT
.gcfile inigenomes.config - #722 - Update
Sarek-datasubmodule - #723, #725 - Update docs
- #724 - Improved
AWS batchconfiguration - #728 - Improved usage of
targetBEDparams - #728 -
StrelkaBest Practices output is now prefixed withStrelkaBP_ - #728 - VCFs and Annotated VCFs are now ordered by Patient, then tools
- #732 - Merge
buildContainers.nfandbuildReferences.nfinbuild.nf - #732 - Reduce number of CPUs for
RunVEPto4cf: VEP docs - #732 - Update
VEPfrom95.1to95.2
- #715 - Remove
defReferencesFilesfunction frombuildReferences.nf - #719 -
snpEffbase container is no longer used - #721 - Remove
COSMICdocs - #728 - Remove
defineDirectoryMap() - #732 - Remove
--databaseoption for VEP cf: VEP docs
- #720 -
bamQCis now run on the recalibrated BAMs, and not afterMarkDuplicates - #726 - Fix
Ascatref file input (one file can't be a set) - #727 -
bamQCoutputs are no longer overwritten (name of dir is now the file instead of sample) - #728 - Fix issue with annotation that was consuming
cachechannels - #728 - Fix multi sample TSV file #691
- #733 - Fix the possibility to specify reference files on the command line
2.2.2 - 2018-12-19
- #671 - New
publishDirModeparam and docs - #673, #675, #676 - Profiles for BinAC and CFC clusters in Tübingen
- #679 - Add container for
CreateIntervalBeds - #692, #697 - Add
AWS iGenomespossibilities (withinconf/igenomes.conf) - #694 - Add monochrome and grey logos for light or dark background
- #698 - Add btb profile for munin server
- #702 - Add
font-ttf-dejavu-sans-mono2.37andfontconfig2.1devto container
- #663 - Update
do_release.shscript - #671 -
publishDirmodes are now params - #677, #698, #703 - Update docs
- #678 - Changing
VEPtov92and adjusting CPUs forVEP - #679 - Update old
awsbatchconfiguration - #682 - Specifications for memory and cpus for
awsbatch - #693 -
Qualimap bamQCis now ran after mapping and after recalibration for better QC - #700 - Update
GATKto4.0.9.0 - #702 - Update
FastQCto0.11.8 - #705 - Change
--TMP_DIRby--tmp-dirforGATK4.0.9.0BaseRecalibrator - #706 - Update
Travis CItesting
- #665 - Input bam file now has always the same name (whether it is from a single fastq pair or multiple) in the
MarkDuplicatesprocess, so metrics too - #672 - Process
PullSingularityContainersfrombuildContainers.nfnow expect a file with the correct.simgextension for singularity images, and no longer the.imgone - #679 - Add
publishDirModeforgermlineVC.nf - #700 - Fix #699 missing DP in the FORMAT column VCFs for Mutect2
- #702 - Fix #701
- #705 - Fix #704
2.2.1 - 2018-10-04
2.2.0 - Skårki - 2018-09-21
Skårki is one of the main massif in the Sarek National Park.
- #613 - Add Issue Templates (bug report and feature request)
- #614 - Add PR Template
- #615 - Add presentation
- #616 - Update documentation
- #620 - Add
tmp/to.gitignore - #625 - Add
pathfindras a submodule - #635 - To process targeted sequencing with a target BED
- #639 - Add a complete example analysis to docs
- #640, #642 - Add helper script for changing version number
- #608 - Update
Nextflowrequired version - #615 - Use
splitCsvinstead ofreadlines - #616 - Update
CHANGELOG - #621, #638 - Improve install script
- #621, #638 - Simplify tests
- #627, #629, #637 - Refactor docs
- #629 - Refactor config
- #632 - Use 2 threads and 2 cpus
FastQCprocesses - #637 - Update tool version gathering
- #638 - Use correct
.simgextension for Singularity images - #639 - Smaller refactoring of the docs
- #640 - Update RELEASE_CHECKLIST
- #642 -
MultiQC1.5 -> 1.6 - #642 -
Qualimap2.2.2a -> 2.2.2b - #642 - Update
condachannel order priorities - #642 -
VCFanno0.2.8 -> 0.3.0 - #642 -
VCFtools0.1.15 -> 0.1.16
2.1.0 - Ruotes - 2018-08-14
Ruotes is one of the main massif in the Sarek National Park.
- #555 -
snpEffoutput intoVEP - #556 -
StrelkaBest Practices - #563 - Use
SnpEFFreports inMultiQC - #568 -
VCFToolsprocessRunVcftoolsfor QC - #574, #580 - Abstracts for
NPMI,JOBIMandEACR25 - #577 - New repository for testing: Sarek-data
- #595 - New library
QCfor functionsbamQC,bcftools,samtoolsStats,vcftools,getVersionBCFtools,getVersionGATK,getVersionManta,getVersionSnpEFF,getVersionStrelka,getVersionVCFtools,getVersionVEP - #595 - New Processes
GetVersionBCFtools,GetVersionGATK,GetVersionManta,GetVersionSnpEFF,GetVersionStrelka,GetVersionVCFtools,GetVersionVEP - #595 - New
Pythonscriptbin/scrape_tool_versions.pyinspired by @ewels and @apeltzer - #595 - New QC Process
RunVcftools - #596 - New profile for
BinACcluster - #597 - New function
sarek_ascii()inSarekUtils - #599, #602 - New Process
CompressVCF - #601, #603 - Container for
GATK4 - #606 - Add test data as a submodule from
Sarek-data - #608 - Add documentation on how to install Nextflow on
bianca
- #557, #583, #585, #588 - Update help
- #560 -
GitHublangage for the repository is nowNextflow - #561 -
do_all.shbuild only containers for one genome reference (defaultGRCh38) only - #571 - Only one container for all QC tools
- #582, #587 - Update figures
- #595 - Function
defineDirectoryMap()is now part ofSarekUtils - #595 - Process
GenerateMultiQCconfigreplace by functioncreateMultiQCconfig() - #597 -
extractBams()now takes an extra parameter - #597 - Move
checkFileExtension(),checkParameterExistence(),checkParameterList(),checkReferenceMap(),checkRefExistence(),extractBams(),extractGenders(),returnFile(),returnStatus()andreturnTSV()functions toSarekUtils - #597 - Reduce data footprint for Process
CreateRecalibrationTable - #597 - Replace deprecated operator
phasebyjoin - #599 - Merge is tested with
ANNOTATEALL - #604 - Synching
GRCh38wgs_calling_regionsbedfiles - #607 - One container approach
- #607 - Update to
GATK4 - #608 - Update
Nextflowrequired version - #616 - Update
CHANGELOG - #617 - Replace deprecated
Nextflow ``$namesyntax withwithName
- #560 - Display message for
repositoryandcontainerPath - #566 -
slurmDownloadprofile - #579, #584 -
Mantaoutput reorganized after modification forStrelka Best Practicesprocess - #585 - Trace file is plain txt
- #590, #593 - Fix
Singularityinstallation inTravis CItesting - #598, #601 - Fixes for
PythonscriptselectROI.pyto work withCLCviewer
- #607 - Remove
Mutect1
2.0.0 - 2018-03-23
First release under the Sarek name, from the National Park in Northern Sweden.
- Basic wrapper script
- Abstract, posters and figures
- ROI selector and
FreeBayessanitizer scripts - New logo and icon for the project
- Check for existing tumor/normal channel
SarekUtilswithcheckParams(),checkParameterList(),checkParameterExistence()andisAllowedParams()functions- Some
runOptionsfordocker(prevent some user right problem) - This
CHANGELOG
CAWis nowSarek- Dissect Workflow in 5 new scripts:
annotate.nf,main.nf,germlineVC.nf,runMultiQC.nfandsomaticVC.nf report.html,timeline.htmlandtrace.htmlare generated inReports/--versionis now used to define the workflow version- Most params are now defined in the
base.configfile instead of in the scripts - Update
RELEASE_CHECKLIST.md checkParams(),checkParameterList(),checkParameterExistence()andisAllowedParams()in script functions are now called withinSarekUtilsnf_required_versionis nowparams.nfRequiredVersion- In
buildReferences.nfscript, channels now begin bych_, and files byf_ - Use
PublishDir mode: 'link'instead ofcopy directoryMapnow containsparams.outDir- #539 - Use Nextflow support of scratch
- Reordered
Travis CItests - Update documentation
MultiQCversion in container from v1.4to v1.5vepgrch37container base image fromrelease_90.6torelease_92vepgrch38container base image fromrelease_90.6torelease_92VEPversion in containers from v90to v91nucleotidesPerSecondis nowparams.nucleotidesPerSecond- Default
params.tagis nowlatestinstead of current version, so--tagneeds to be specified with the right version to be sure of using thecontainerscorresponding
standardprofileuppmax-localhost.configfile
scripts/skeleton_batch.sh- Old data and tsv files
UPPMAXdirectories from containers--stepinannotate.nf,germlineVC.nfandsomatic.nf- Some
runOptionsforSingularity(binding not needed anymore onUPPMAX) downloadprofile
1.2.5 - 2018-01-18
Zenodofor DOI- Delivery README
- Document use of the
--sampleDiroption - Contributing Guidelines
- Issue Templates
- Release Checklist
--outDirawsbatchprofileaws-batch.configconfig file--noBAMQCparams (failing sometimes onBianca)
- Update
Nextflowto0.26.0(new fancy report +AWS Batch) - Extra time on
Travis CItesting - Replace
bundleDirbyparams.genome_base - Update
MultiQCto1.3(MEGAQCFTW) - Move and rename some test files
- Version of
COSMICGRCh37v83 - Write an error message when
--sampleDirdoes not find any FASTQ files base.configforConcatVCFprocess- File specification for
recalibrationReportinRecalibrateBamprocess (got error onAWS Batch)
1.2.4 - 2017-10-27
- #488 - Better CPU requirements for
ConcatVCF - #489 - Exception handling for
ASCAT - #490 - CPU requirements for
runSingleStrelkaandrunSingleManta
1.2.3 - 2017-10-18
- #357 -
ASCATworks forGRCh38 - #471 - Running
Singularityon/scratch - #475 - 16 cpus for local executor
- #480 - No
tsvfile needed for stepannotate
1.2.2 - 2017-10-06
- #479 - Typo in
uppmax-localhost.config
1.2.1 - 2017-10-06
runascatandrunconvertallelecountscontainers are now replaced byr-basewillmclaren/ensembl-vep:release_90.5is now base forvepgrch37andvepgrch38
vepcontainerstrelka_config.inifile
- #471 - Running
Singularityon /scratch - #472 - Update function to check
Nextflowversion - #473 - Remove
returnMin()function
1.2.0 - 2017-10-02
- Fix version for Manuscript
1.1 - 2017-09-15
Singularitypossibilities
- Reports made by default
- Intervals file can be a bed file
- Normal sample preprocessing +
HaplotypeCalleris possible - Better
Travis CItests
- Memory requirements
1.0 - 2017-02-16
Dockerpossibilities