infer command takes a gene-resolved equivalence class count matrix, along with the
equivalence class description file (both output when running
quant with the
and performs inference via the EM to reduce the matrix to a cell-by-gene level count matrix.
This functionality makes it possible to separate the UMI resolution step (which may result in UMIs being assigned to equivalence classes of genes rather than individual genes), and the gene-level estimation step, that attempts to resolve gene-multimapping UMIs.
This command can takes the following options :
-c, --count-mat <eqc-mat>: This provides the path to the (mtx format) matrix of cells by equivalence class counts. Note: It is assumed that the parent directory where
eqc-matis located will also contain a file called
quants_mat_rows.txtcontaining the row names of the matrix and a file called
quants_mat_cols.txtcontaining the column names of the files. The
infercommand will not run if these other input files are absent from the parent directory of
-e, --eq-labels <eq-labels>: This provides the path to the file containing the gene labels of the equivalence class description.
-o, --output-dir <output-dir>: This provides the output file directory the quantification matrix, barcodes (row names), and genes (column names) will be written.
--quant-subset <sfile>: This optional argument provides a file containing list of barcodes to quantify (one barcode per line, written as a string), those not in this list will be ignored during inference and will not appear in the output quantification matrix. If this argument is not provided, then all of the original barcodes will be quantified.
-t, --threads <threads>: This option provides the number of threads to use for processing [default: number of hardware threads].
The output of the
infer command is written in the provided
output-dir. It consists of the cell-by-gene count matrix derived from the input cell-by-equivalence-class count matrix, as well as a
quants_mat_cols.txt file providing the row and column names for the output matrix, respectively.