Help with kallisto-bustools: how to interpret the inspect.json output file

79 views
Skip to first unread message

kt 427

unread,
May 20, 2022, 6:51:43 AM5/20/22
to kallisto and applications
Hi everyone,

I am trying to understand the entries of the `inspect.json` file that is output by the `kb-python` wrapper tool by the Pachter lab. I believe that this file is an output from `bustools`.  I have analysed (performed pseudoalignment and quantification with `kb-python`) the standard PBMC_1K_V3 scRNAseq dataset from 10x chromium.

Here is a screenshot of my `inspect.json` file obtained from running `kb-python` on the PBMC_1K_V3 dataset:
inspect_json.png

My questions are:
1. What do the various keys mean? For example, what does numRecords and numBarcodes mean? I suppose numRecords is the number of entries of the BUS format, but why is numBarcodes so high? The fastq files analysed corresponds to approximately 1000 cells, so why does the number of barcodes (I assume cell barcodes) exceed the number of cells by this much?  I am assuming that number of corrected barcodes = number of cells.


2. How important is this file for checking the performance of kb-python? I can understand the output of the `kb_info.json` and `run_info.json` files as they are self-explanatory, but the contents of `inspect.json` baffles me and I have yet to find any documentation of its contents online.

For your information, I share the cellRanger output [cellRanger_summary] and the `run_info.json` screenshot run_info.png

Thank you for reading and I appreciate your time.

Reply all
Reply to author
Forward
0 new messages