Skip to content

Commit cfdbeda

Browse files
committed
Update README.md
1 parent d444671 commit cfdbeda

1 file changed

Lines changed: 44 additions & 49 deletions

File tree

README.md

Lines changed: 44 additions & 49 deletions
Original file line numberDiff line numberDiff line change
@@ -11,73 +11,68 @@ A tool to check two different EingenStrat databses for shared individuals, and e
1111
```
1212
usage: eigenstrat_database_tools.py (-i <Input file prefix>) (-c <input ind file> | -R | -E) [-L <SAMPLE LIST> | -S Ind [-S Ind2]] [-o <OUTPUT FILE PREFIX>]
1313
14-
A tool to check two different EingenStrat databses for shared individuals, and
15-
extract or remove individuals from an EigenStrat database.
14+
A tool to check two different EingenStrat databses for shared individuals, and extract or remove individuals from an EigenStrat database.
1615
1716
Available options:
1817
-h, --help show this help message and exit
19-
-i <INPUT FILES PREFIX>, --Input <INPUT FILES PREFIX>
20-
The desired input file prefix. Input files are assumed
21-
to be <INPUT PREFIX>.geno, <INPUT PREFIX>.snp and
22-
<INPUT PREFIX>.ind .
18+
-g <GENO FILE NAME>, --genoFn <GENO FILE NAME>
19+
The path to the input geno file.
20+
-s <SNP FILE NAME>, --snpFn <SNP FILE NAME>
21+
The path to the input snp file.
22+
-i <IND FILE NAME>, --indFn <IND FILE NAME>
23+
The path to the input ind file.
2324
-o <OUTPUT FILES PREFIX>, --Output <OUTPUT FILES PREFIX>
24-
The desired output file prefix. Three output files are
25-
created, <OUTPUT FILES PREFIX>.geno , <OUTPUT FILES
26-
PREFIX>.snp and <OUTPUT FILES PREFIX>.ind .
27-
-s <INPUT FILE SUFFIX>, --Suffix <INPUT FILE SUFFIX>
28-
The suffix (if any) that follows .geno/.snp/.ind in
29-
the input files. For example, specifying '-s .txt'
30-
will treat <INPUT PREFIX>.{geno,snp,ind}.txt as the
31-
desired input files.
25+
The desired output file prefix. Three output files are created, <OUTPUT FILES PREFIX>.geno , <OUTPUT FILES PREFIX>.snp and <OUTPUT FILES PREFIX>.ind .
3226
-C <INPUT FILE>, --Check <INPUT FILE>
33-
Check the -i .ind file and the second .ind file for
34-
duplicate individuals. Population assignment and/or
35-
individual sex are not checked, only individual names.
36-
Names are case sensitive.
37-
-E, --Extract Extract the selected individuals from the EigenStrat
38-
database, creating a new set of EigenStrat files with
39-
the selected individuals only.
40-
-R, --Remove Remove the selected individuals from the EigenStrat
41-
database, creating a new set of EigenStrat files
42-
without the selected individuals.
27+
Check the -i .ind file and the second .ind file for duplicate individuals. Population assignment and/or individual sex are not checked, only individual names. Names are case
28+
sensitive.
29+
-E, --Extract Extract the selected individuals from the EigenStrat database, creating a new set of EigenStrat files with the selected individuals only.
30+
-R, --Remove Remove the selected individuals from the EigenStrat database, creating a new set of EigenStrat files without the selected individuals.
4331
-L <INDIVIDUAL LIST FILE>, --SampleList <INDIVIDUAL LIST FILE>
44-
A list of samples to be Extracted (-E) or Removed (-R)
45-
from the database. Can be a list of individual names
46-
(1 per line), or the output of duplicate check (-C).
47-
Required with -E or -R. Mutually exclusive with -S.
32+
A list of samples to be Extracted (-E) or Removed (-R) from the database. Can be a list of individual names (1 per line), or the output of duplicate check (-C). Required with -E
33+
or -R. Mutually exclusive with -S.
4834
-S <INDIVIDUAL>, --Sample <INDIVIDUAL>
49-
A samples to be Extracted (-E) or Removed (-R) from
50-
the database. Can be called multiple times. Required
51-
with -E or -R. Mutually exclusive with -L.
35+
A samples to be Extracted (-E) or Removed (-R) from the database. Can be called multiple times. Required with -E or -R. Mutually exclusive with -L.
5236
-v, --version Print the version and exit.
53-
5437
```
5538
---
5639
# Eigenstrat SNP Coverage
57-
A tool to calculate the numbered of covered and total reads for each individual in an EigenStrat dataset.
40+
A tool to calculate the numbered of covered and total reads for each individual in an EigenStrat dataset.
5841

5942
```
6043
usage: eigenstrat_snp_coverage.py (-i <Input file prefix>) [-s <Input file suffix>] [-o <Output filepath>] [-j | --json]
6144
62-
A tool to check two different EingenStrat databses for shared individuals, and
63-
extract or remove individuals from an EigenStrat database.
45+
A tool to check two different EingenStrat databses for shared individuals, and extract or remove individuals from an EigenStrat database.
6446
6547
Available options:
6648
-h, --help show this help message and exit
67-
-i <INPUT FILES PREFIX>, --Input <INPUT FILES PREFIX>
68-
The desired input file prefix. Input files are assumed
69-
to be <INPUT PREFIX>.geno, <INPUT PREFIX>.snp and
70-
<INPUT PREFIX>.ind .
71-
-s <INPUT FILE SUFFIX>, --Suffix <INPUT FILE SUFFIX>
72-
The suffix (if any) that follows .geno/.snp/.ind in
73-
the input files. For example, specifying '-s .txt'
74-
will treat <INPUT PREFIX>.{geno,snp,ind}.txt as the
75-
desired input files.
49+
-g <GENO FILE NAME>, --genoFn <GENO FILE NAME>
50+
The path to the input geno file.
51+
-s <SNP FILE NAME>, --snpFn <SNP FILE NAME>
52+
The path to the input snp file.
53+
-i <IND FILE NAME>, --indFn <IND FILE NAME>
54+
The path to the input ind file.
7655
-o <OUTPUT FILEPATH>, --Output <OUTPUT FILEPATH>
77-
The filepath where the output table should be saved.
78-
Omit to print to stdout.
79-
-j, --json Create additional json formatted output file named
80-
<OUTPUT FILE>.json . [Default: '<INPUT FILES
81-
PREFIX>_eigenstrat_coverage_mqc.json']
56+
The filepath where the output table should be saved. Omit to print to stdout.
57+
-j <JSON OUTPUT FILEPATH>, --json <JSON OUTPUT FILEPATH>
58+
Create additional json formatted output file named <JSON OUTPUT FILEPATH> .
8259
-v, --version Print the version and exit.
8360
```
61+
# Rename Snps
62+
A tool to rename the SNPs in an EigenStrat snp file based on a reference EigenStrat or plink snp file. Uses the genetic coordinate of SNPs to determine identity.
63+
64+
```
65+
usage: rename_snps.py (-i <INPUT .SNP FILE>) (-n <NAMED SNP FILE>) [-f <NAMED FILE FORMAT>]
66+
67+
Rename the SNPs in an eigenstrat snp file based on a reference eigenstrat or plink snp file. Uses the genetic coordinate of SNPs to determine identity.
68+
69+
Available options:
70+
-h, --help show this help message and exit
71+
-i <INPUT .SNP FILE>, --Input <INPUT .SNP FILE>
72+
The input snp file.
73+
-n <NAMED SNP FILE>, --Name_file <NAMED SNP FILE>
74+
An Eigenstrat or plink formatted snp file with the desired SNP names.
75+
-f <NAMED FILE FORMAT>, --Format <NAMED FILE FORMAT>
76+
The format of the desired snp name file. Can be either 'EIGENSTRAT' or 'PLINK' [default: EIGENSTRAT]
77+
-v, --version Print the version and exit.
78+
```

0 commit comments

Comments
 (0)