-
Notifications
You must be signed in to change notification settings - Fork 2
Description
Hi
I was wondering if you could help me because I have done everything on the install instructions but ExonDel still fails.
I have all required perl modules and both R and samtools are on my path.
The command I am using is:
perl ~/Programs/ExonDel-master/ExonDel.pl -i ExonDel_test.txt -o /home/user/ND-34/ExonDelResults -c /home/user/ND-34/ExonDel_cfg.txt -g /home/user/ND-34/gene_list.txt
here is the output from the run command
[Mon Nov 30 10:41:59 2015] Only the genes in /home/user/Tasks/ND-34/gene_list.txt will be used
[Mon Nov 30 10:41:59 2015] GC adjustment will not be performed, and the constant cutoffs in config file will be used
[Mon Nov 30 10:41:59 2015] If your bed file has fourth column, it should be gene name and can be matched to genes in home/user/Tasks/ND-34/gene_list.txt
[Mon Nov 30 10:41:59 2015] Loading BED file
Use of uninitialized value $in in open at /home/user/Programs/ExonDel-master/ExonDel.pl line 497, <GENELIST> line 7.
here are the content of each file:
ExonDel_cgf.txt
#reference .bed file
bedfile=/home/user/ND-34/genes.bed
#reference .gtf file
refseq=/home/user/ND-34/refseq_no_chr_prefix.txt
#reference .fa file
reffa=/home/user/ND-34/hg19.fa
#config file for ExonDel software
[perl]
#Minimal percent of covered base pairs for each exon
exon_bp_cover_threshold=0.1
#Minimal percent of covered exons for each gene. 1 means 100%
overall_exon_count_threshold=1
#Minimum mapping quality for an alignment to be used
mapQ=20
#Minimum base quality for a base to be considered
baseQ=20
#where the R bin file is
RBin=R
#where the SAMtools bin file is
samtoolsBin=samtools
#name of the log file
logFileName=ExonDel.log
[R]
#Maximal number of exons for the moving-window in exon deletion detection
maxWinLength=9
#Minimum number of exons for the moving-window in exon deletion detection
minWinLength=1
#Minimum number of exons for a gene to be considered in exon deletion detection
minExonNum=10
#T means True and F means False. If only some genes were used (use -g option), GC adjustment will not be performed no matter what adjustGC below was set (As GC adjustment is based on all genes).
adjustGC=F
#If all genes were used (didn't use -g option), cutoff 1,2 will be determined by cutoffQuantile 1,2 in all genes
cutoffQuantile1=0.01
cutoffQuantile2=0.1
#If only some genes were used (use -g option), cutoff 1,2 will be determined by constant cutoff 1,2 below
cutoff1=2
cutoff2=20
here is the list of genes in gene_list.txt
BRCA1
BRCA2
P53
PTEN
ATM
NF1
ATR
here is the list of genes in genes.bed
10 89623194 89731687 PTEN
11 108093558 108239826 ATM
13 32889616 32973809 BRCA2
17 29421944 29704695 NF1
17 41196311 41277500 BRCA1
3 142168076 142297668 ATR
Here are the first few lines of the refseq_no_chr_prefix.txt (RefSeq UCSC track)
This is exactly the same file downloaded from UCSC, I simply removed the chr prefix because all my bam files do not have the prefix. I made sure all my files are not using the chr prefix, so my fasta reference file also does not have prefix.
#bin name chrom strand txStart txEnd cdsStart cdsEnd exonCount exonStarts exonEnds score name2 cdsStartStat cdsEndStat exonFrames
0 NM_032291 1 + 66999638 67216822 67000041 67208778 25 66999638,67091529,67098752,67101626,67105459,67108492,67109226,67126195,67133212,67136677,67137626,67138963,67142686,67145360,67147551,67154830,67155872,67161116,67184976,67194946,67199430,67205017,67206340,67206954,67208755, 67000051,67091593,67098777,67101698,67105516,67108547,67109402,67126207,67133224,67136702,67137678,67139049,67142779,67145435,67148052,67154958,67155999,67161176,67185088,67195102,67199563,67205220,67206405,67207119,67216822, 0 SGIP1 cmpl cmpl 0,1,2,0,0,0,1,0,0,0,1,2,1,1,1,1,0,1,1,2,2,0,2,1,1,
0 NM_001308203 1 + 66999251 67216822 67000041 67208778 22 66999251,66999928,67091529,67098752,67105459,67108492,67109226,67136677,67137626,67138963,67142686,67145360,67154830,67155872,67160121,67184976,67194946,67199430,67205017,67206340,67206954,67208755, 66999355,67000051,67091593,67098777,67105516,67108547,67109402,67136702,67137678,67139049,67142779,67145435,67154958,67155999,67160187,67185088,67195102,67199563,67205220,67206405,67207119,67216822, 0 SGIP1 cmpl cmpl -1,0,1,2,0,0,1,0,1,2,1,1,1,0,1,1,2,2,0,2,1,1,
1 NM_001301825 1 + 33547778 33586132 33547850 33585783 9 33547778,33549554,33557650,33558882,33560148,33562307,33563607,33583502,33585644, 33547955,33549728,33557823,33559017,33560314,33562470,33563780,33583717,33586132, 0 AZIN2 cmpl cmpl 0,0,0,2,2,0,1,0,2,
1 NM_001080397 1 + 8378144 8404227 8378168 8404073 9 8378144,8384365,8385357,8385877,8390268,8395496,8397875,8399552,8403806, 8378246,8384786,8385450,8386102,8390996,8395650,8398052,8399758,8404227, 0 SLC45A1 cmpl cmpl 0,0,1,1,1,0,1,1,0,
my ExonDel_test contains the list of the bam files I want to process
H521_S2_L001.bam
H926_S3_L001.bam
208210423-ADN-3_S7_L001.bam