You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+25-38Lines changed: 25 additions & 38 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -109,48 +109,35 @@ template config file to copy and use for your runs.
109
109
110
110
To run the simulator in parallel with the same config file and significantly speed up runtime, please see the [Parallelization](#parallelization) section.
111
111
112
-
reference: full path to a fasta file to generate reads from
113
-
read_len: The length of the reads for the fastq (if using). Integer value, default 101.
114
-
coverage: desired coverage value. Float or int, default = 10
115
-
ploidy: Desired value for ploidy (# of copies of each chromosome in the organism). Default is 2
116
-
paired_ended: If paired-ended reads are desired, set this to True. Setting this to true requires
117
-
either entering values for fragment_mean and fragment_st_dev or entering the path to a
118
-
valid fragment_model.
119
-
fragment_mean: Use with paired-ended reads, set a fragment length mean manually
120
-
fragment_st_dev: use with paired-ended reads, set the standard deviation of the fragment length dataset
112
+
`reference`: full path to a fasta file to generate reads from
113
+
`read_len`: The length of the reads for the fastq (if using). _Integer value, default 101._
114
+
`coverage`: desired coverage value. _Float or int, default = 10_
115
+
`ploidy`: Desired value for ploidy (# of copies of each chromosome in the organism). _Default is 2_
116
+
`paired_ended`: If paired-ended reads are desired, set this to True. Setting this to true requires either entering values for fragment_mean and fragment_st_dev or entering the path to a valid fragment_model.
117
+
`fragment_mean`: Use with paired-ended reads, set a fragment length mean manually
118
+
`fragment_st_dev`: Use with paired-ended reads, set the standard deviation of the fragment length dataset
121
119
122
120
The following values can be set to true or omitted to use defaults. If True, NEAT will produce the file type.
123
121
The default is given:
124
122
125
-
produce_bam: False
126
-
produce_vcf: False
127
-
produce_fastq: True
128
-
129
-
error_model: full path to an error model generated by NEAT. Leave empty to use default model
130
-
(default model based on human, sequenced by Illumina)
131
-
mutation_model: full path to a mutation model generated by NEAT. Leave empty to use a default
132
-
model (default model based on human data sequenced by Illumina)
133
-
fragment_model: full path to fragment length model generate by NEAT. Leave empty to use default model
134
-
(default model based on human data sequenced by Illumina)
135
-
136
-
threads: The number of threads for NEAT to use.
137
-
Note: this feature is not yet fully implemented
138
-
avg_seq_error: average sequencing error rate for the sequencing machine. Use to increase or
139
-
decrease the rate of errors in the reads. Float betwoon 0 and 0.3. Default is set by the error model.
140
-
rescale_qualities: rescale the quality scores to reflect the avg_seq_error rate above. Set True to activate if you
141
-
notice issues with the sequencing error rates in your datatset.
142
-
include_vcf: full path to list of variants in vcf format to include in the simulation. These will be inserted as they
143
-
appear in the input VCF into the final VCF, and the corresponding fastq and bam files, if requested.
144
-
target_bed: full path to list of regions in bed format to target.
145
-
All areas outside these regions will have coverage of 0.
146
-
discard_bed: full path to a list of regions to discard, in BED format.
147
-
mutation_rate: Desired rate of mutation for the dataset. Float between 0.0 and 0.3
148
-
(default is determined by the mutation model)
149
-
mutation_bed: full path to a list of regions with a column describing the mutation rate of that region,
150
-
as a float with values between 0 and 0.3. The mutation rate must be in the third column as, e.g., mut_rate=0.00.
151
-
rng_seed: Manually enter a seed for the random number generator. Used for repeating runs. Must be an integer.
152
-
min_mutations: Set the minimum number of mutations that NEAT should add, per contig. Default is 0. We recommend setting
153
-
this to at least one for small chromosomes, so NEAT will produce at least one mutation per contig. |
123
+
`produce_bam`: False
124
+
`produce_vcf`: False
125
+
`produce_fastq`: True
126
+
127
+
`error_model`: full path to an error model generated by NEAT. Leave empty to use default model _(default model based on human, sequenced by Illumina)_
128
+
`mutation_model`: full path to a mutation model generated by NEAT. Leave empty to use a default model _(default model based on human data sequenced by Illumina)_
129
+
`fragment_model`: full path to fragment length model generate by NEAT. Leave empty to use default model _(default model based on human data sequenced by Illumina)_
130
+
131
+
`threads`: The number of threads for NEAT to use. _Note: this feature is not yet fully implemented_
132
+
`avg_seq_error`: average sequencing error rate for the sequencing machine. Use to increase or decrease the rate of errors in the reads. _Float between 0 and 0.3. Default is set by the error model._
133
+
`rescale_qualities`: rescale the quality scores to reflect the avg_seq_error rate above. Set True to activate if you notice issues with the sequencing error rates in your datatset.
134
+
`include_vcf`: full path to list of variants in vcf format to include in the simulation. These will be inserted as they appear in the input VCF into the final VCF, and the corresponding fastq and bam files, if requested.
135
+
`target_bed`: full path to list of regions in bed format to target. All areas outside these regions will have coverage of 0.
136
+
`discard_bed`: full path to a list of regions to discard, in BED format.
137
+
`mutation_rate`: Desired rate of mutation for the dataset. _Float between 0.0 and 0.3 (default is determined by the mutation model)_
138
+
`mutation_bed`: full path to a list of regions with a column describing the mutation rate of that region, as a float with values between 0 and 0.3. The mutation rate must be in the third column as, e.g., mut_rate=0.00.
139
+
`rng_seed`: Manually enter a seed for the random number generator. Used for repeating runs. _Must be an integer._
140
+
`min_mutations`: Set the minimum number of mutations that NEAT should add, per contig. _Default is 0._ We recommend setting this to at least one for small chromosomes, so NEAT will produce at least one mutation per contig.
0 commit comments