Tools for working with genomic and high throughput sequencing data.
Group: SAM/BAM
Sorts a SAM or BAM file. Several sort orders are available:
GroupReadByUmi. Sorts reads by
the earlier un-soft-clipped 5’ coordinate of the read pair, the higher un-soft-clipped 5’ coordinate of the
read pair, the cellular barcode (CB tag), the molecular identifier (MI tag), read name, library,
and if R1 has the lower coordinates of the pair.Uses a temporary directory to buffer sets of sorted reads to disk. The number of reads kept in memory
affects memory use and can be changed with the --max-records-in-ram option. The temporary directory
to use can be set with the fgbio global option --tmp-dir.
An example invocation might look like:
java -Xmx4g -jar fgbio.jar --tmp-dir=/my/big/scratch/volume \
SortBam --input=queryname.bam --sort-order=Coordinate --output coordinate.bam
| Name | Flag | Type | Description | Required? | Max # of Values | Default Value(s) |
|---|---|---|---|---|---|---|
| input | i | PathToBam | Input SAM or BAM. | Required | 1 | |
| output | o | PathToBam | Output SAM or BAM. | Required | 1 | |
| sort-order | s | SamOrder | Order into which to sort the records. | Optional | 1 | Coordinate |
| max-records-in-ram | m | Int | Max records in RAM. | Optional | 1 | 1000000 |