HiC-Pro Output folder alreads exists

125 views
Skip to first unread message

Ray Oq

unread,
May 30, 2018, 9:32:40 AM5/30/18
to HiC-Pro
Hi Group,
     I am trying to get HiC-Pro up and running for our Univ project.   I was able to run the complete Test Dataset and receive the 
correct output as per the HiC-Pro website.

I then proceed to run HiC-Pro with our data.   I received the following message after several days of processing:

Run HiC-Pro 2.10.0
--------------------------------------------
Fri May 18 08:29:10 EDT 2018
Bowtie2 alignment step1 ...
/isg/shared/apps/HiC-Pro/2.10.0/scripts/bowtie_wrap.sh -c /home/CAM/aromd/HiCPro_Config_Prod.txt -u >> hicpro.log
--------------------------------------------
Thu May 24 00:59:08 EDT 2018
Bowtie2 alignment step2 ...
/isg/shared/apps/HiC-Pro/2.10.0/scripts/bowtie_wrap.sh -c /home/CAM/aromd/HiCPro_Config_Prod.txt -l >> hicpro.log
/home/CAM/aromd/HiCPro_Prod_008_Output folder alreads exists. Do you want to delete it before running ?(y/n) [n] : 

real 0m0.418s
user 0m0.050s
sys 0m0.090s

My batch file has the following execution statement:

time HiC-Pro -c HiCPro_Config_Prod.txt -i rawdata -o HiCPro_Prod_008_Output

I did notice some additional post with this error message but they did not provide any clarification for my problem.

Any feedback would be appreciated.

Thanks in advance.
Ray

Moshe Olshansky

unread,
Jun 1, 2018, 10:03:51 PM6/1/18
to HiC-Pro
Hi Ray,

If you created HiCPro_Prod_008_Output folder so that HiCPro output goes there, you shouldn't have done this. HiCPro will create that folder. So if this is an empty folder just delete it (rmdir HiCPro_Prod_008_Output) and run HiCPro again.

Hope this helps,
Moshe.

nservant

unread,
Jun 4, 2018, 4:06:33 AM6/4/18
to HiC-Pro
Hi guys,
Could you send me you config file and input folder organization.
I would be interesting to try to reproduce this message ...
Thanks

Ray Oq

unread,
Jun 4, 2018, 4:15:33 PM6/4/18
to HiC-Pro
Hi Nicolas,
     I am away from the office and will post the files on Monday, 6/11.

Thanks for your response.
Ray

nservant

unread,
Jun 5, 2018, 3:47:31 AM6/5/18
to HiC-Pro
Hi Ray
Sure no worries. In practice, I don't think I need the 'real' input files. Their name and how they are organized should be enough.
Best

Ray Oq

unread,
Jun 7, 2018, 4:17:10 PM6/7/18
to HiC-Pro
Hi Nicolas,
    Here is my config file:

# Please change the variable settings below if necessary

#########################################################################
## Paths and Settings  - Do not edit !
#########################################################################

TMP_DIR = tmp
LOGS_DIR = logs
BOWTIE2_OUTPUT_DIR = bowtie_results
MAPC_OUTPUT = hic_results
RAW_DIR = rawdata

#######################################################################
## SYSTEM - PBS - Start Editing Here !!
#######################################################################
N_CPU = 64
LOGFILE = hicpro.log

JOB_NAME = IMR90_split
JOB_MEM = 100gb
JOB_WALLTIME = 25-00:00:00
JOB_QUEUE = general
#########################################################################
## Data
#########################################################################

PAIR1_EXT = _R1
PAIR2_EXT = _R2

#######################################################################
## Alignment options
#######################################################################

FORMAT = phred33
MIN_MAPQ = 0

BOWTIE2_IDX_PATH = /isg/shared/databases/alignerIndex/animal/hg19_ucsc/hg19_bowtie2
BOWTIE2_GLOBAL_OPTIONS = --very-sensitive -L 30 --score-min L,-0.6,-0.2 --end-to-end --reorder
BOWTIE2_LOCAL_OPTIONS =  --very-sensitive -L 20 --score-min L,-0.6,-0.2 --end-to-end --reorder

#######################################################################
## Annotation files
#######################################################################

REFERENCE_GENOME = Human_genome
GENOME_SIZE = chrom_hg19.sizes

#######################################################################
## Allele specific
#######################################################################

ALLELE_SPECIFIC_SNP = 

#######################################################################
## Digestion Hi-C
#######################################################################

GENOME_FRAGMENT = HindIII_resfrag_hg19.bed
LIGATION_SITE = AAGCTAGCTT
MIN_FRAG_SIZE = 100
MAX_FRAG_SIZE = 100000
MIN_INSERT_SIZE = 100
MAX_INSERT_SIZE = 600

#######################################################################
## Hi-C processing
#######################################################################

MIN_CIS_DIST =
GET_ALL_INTERACTION_CLASSES = 1
GET_PROCESS_SAM = 1
RM_SINGLETON = 1
RM_MULTI = 1
RM_DUP = 1

#######################################################################
## Contact Maps
#######################################################################

BIN_SIZE = 500000 1000000
MATRIX_FORMAT = upper

#######################################################################
## ICE Normalization
#######################################################################
MAX_ITER = 100
FILTER_LOW_COUNT_PERC = 0.02
FILTER_HIGH_COUNT_PERC = 0
EPS = 0.1


Here is my batch file:
#!/bin/bash

#SBATCH -J AROMD06032018_008

#SBATCH -N 1

#SBATCH -n 1

#SBATCH -c 16

#SBATCH --mail-user=xxxxx...@uchc.edu

#SBATCH --mail-type=ALL

#SBATCH -o HiCPro-%j.output

##SBATCH -e jobname-%j.error

#SBATCH --partition=general

#SBATCH --mem=64G


# code starts here
module load HiC-Pro

time HiC-Pro -c HiCPro_Config_Prod_060318.txt -i rawdata -o HiCPro_Prod_008_Output_060318


This is the file directory structure:

/home/CAM/aromd/rawdata/S1_008

The S1_008 directory has our fastq files R1 and R2.

Thanks for your feedback in advance.
Ray
Reply all
Reply to author
Forward
0 new messages