Benchmark package
We prepared a package to benchmark Di4 against Di3, bedtools and BEDOPS. The package can be downloaded from this link. The package contents are as it follows:
-
files\; this folder contains2000files (ENCODE narrowPeak) downloaded from ENCODE repository. -
list; the narrowPeaks infilesfolder are grouped in9datasets labeled: A1, A2, A3, A4, B1, B2, C1, C2, and C3. The list of narrowPeaks in each of these datasets is given in9text files underlistfolder, where the text files are named with the label of dataset they belong to. -
copy.py; this is a python script that takes a dataset label (e.g., A1), and copies all the narrowPeaks belonging to that dataset (as given by text files underlistfolder) fromfilesfolder to a new folder named as dataset label. This script can be executed as the following:python copy.py a1
This script can be downloaded individually from this link.
-
run.py; this is a python script that runs bedtools and BEDOPS. The syntax to run this script is at it follows:python run.py TOOL_NAME DATASET [--on-the-fly]
where
TOOL_NAMEcan be either--bedtoolsor--bedops, andDATASETis a dataset label (e.g., a1). When the--on-the-flyflag is set, this script measures runtime considering both preprocessing (sorting data) and processing time; and if this flag is not provided, this script measures runtime considering only processing time. This script can be downloaded individually from this link. -
ref.narrowpeak; this file is used as a reference for running bedtools and BEDOPSintersectfunctions. -
README.pdf; this files contains a thorough explanation on how to benchmark Di4 against Di3, bedtools and BEDOPS using the provided datasets and scripts. The README file can be downloaded individually from this link.