Benchmark package
We prepared a package to benchmark Di4 against Di3, bedtools and BEDOPS. The package can be downloaded from this link. The package contents are as it follows:
-
files\
; this folder contains2000
files (ENCODE narrowPeak) downloaded from ENCODE repository. -
list
; the narrowPeaks infiles
folder are grouped in9
datasets labeled: A1, A2, A3, A4, B1, B2, C1, C2, and C3. The list of narrowPeaks in each of these datasets is given in9
text files underlist
folder, where the text files are named with the label of dataset they belong to. -
copy.py
; this is a python script that takes a dataset label (e.g., A1), and copies all the narrowPeaks belonging to that dataset (as given by text files underlist
folder) fromfiles
folder to a new folder named as dataset label. This script can be executed as the following:python copy.py a1
This script can be downloaded individually from this link.
-
run.py
; this is a python script that runs bedtools and BEDOPS. The syntax to run this script is at it follows:python run.py TOOL_NAME DATASET [--on-the-fly]
where
TOOL_NAME
can be either--bedtools
or--bedops
, andDATASET
is a dataset label (e.g., a1). When the--on-the-fly
flag is set, this script measures runtime considering both preprocessing (sorting data) and processing time; and if this flag is not provided, this script measures runtime considering only processing time. This script can be downloaded individually from this link. -
ref.narrowpeak
; this file is used as a reference for running bedtools and BEDOPSintersect
functions. -
README.pdf
; this files contains a thorough explanation on how to benchmark Di4 against Di3, bedtools and BEDOPS using the provided datasets and scripts. The README file can be downloaded individually from this link.