Indexing First Resolution
Input intervals can be organized in Di4's first resolution using a batch index algorithm, which can be executed in single and double pass modes.
Set indexing mode
Use setim single or
setim multi to set indexing mode to single and multi-pass modes
respectively. One can also get the current setting for indexing mode using
getim command. See the following console output as an example.
> setim multi Indexing mode is set to <Multi-pass> indexing > setim single Indexing mode is set to <Single-pass> indexing > getim Indexing mode is set to <Single-pass> indexing
Batch index
Di4 can index data in batch; user specifies the files to be indexed, then Di4BCLI and Di4B parses the files one at a time, and indexes them in Di4.
Double-pass batch index: first pass
To run the first-pass of the double-pass indexing algorithm, use the following command:
batchindex *.narrowPeak
This command parses and indexes all the files with narrowpeaks
extension in WorkingDirectory. Once the operation is finished,
it reports the runtime and the number of indexed intervals as shown in the following example.
[44\45] wgEncodeAwgTfbsSydhK562Stat1Ifng30UniPk
Loaded #i: 2,203 ET: 00:00:00.0172472 Speed: 127,731 #i\sec
Indexed #i: 2,203 ET: 00:00:00.0588744 Speed: 37,419 #i\sec
[45\45] wgEncodeAwgTfbsSydhK562Stat1Ifng6hUniPk
Loaded #i: 2,333 ET: 00:00:00.0111938 Speed: 208,419 #i\sec
Indexed #i: 2,333 ET: 00:00:00.0554094 Speed: 42,105 #i\sec
#indexed intervals: 456,385
Load ET (sec): 2.9263167
Index ET (sec): 8.8259436
Commit ET (sec): 0.0255342
Average indexing speed: 51523.98 #i\sec
-: Done ... Overall ET: 00:00:08.8577204
Double-pass batch index: second pass
Run 2pass command to execute the second pass of double-pass indexing algorithm.
Note, this command is not required if indexing mode was set to single-pass indexing.
> 2pass 2ndPass #b: 746,190 ET: 00:00:01.0253235 Speed: 727,761 #b\sec -: Done ... Overall ET: 00:00:01.0267024