Skip to content

Can't run ExonDel.. please help #10

@duartemolha

Description

@duartemolha

Hi

I was wondering if you could help me because I have done everything on the install instructions but ExonDel still fails.

I have all required perl modules and both R and samtools are on my path.

The command I am using is:

perl  ~/Programs/ExonDel-master/ExonDel.pl -i ExonDel_test.txt -o /home/user/ND-34/ExonDelResults -c /home/user/ND-34/ExonDel_cfg.txt  -g /home/user/ND-34/gene_list.txt

here is the output from the run command

[Mon Nov 30 10:41:59 2015] Only the genes in /home/user/Tasks/ND-34/gene_list.txt will be used
[Mon Nov 30 10:41:59 2015] GC adjustment will not be performed, and the constant cutoffs in config file will be used
[Mon Nov 30 10:41:59 2015] If your bed file has fourth column, it should be gene name and can be matched to genes in home/user/Tasks/ND-34/gene_list.txt
[Mon Nov 30 10:41:59 2015] Loading BED file
Use of uninitialized value $in in open at /home/user/Programs/ExonDel-master/ExonDel.pl line 497, <GENELIST> line 7.

here are the content of each file:

ExonDel_cgf.txt

#reference .bed file
bedfile=/home/user/ND-34/genes.bed
#reference .gtf file
refseq=/home/user/ND-34/refseq_no_chr_prefix.txt
#reference .fa file
reffa=/home/user/ND-34/hg19.fa
#config file for ExonDel software
[perl]
#Minimal percent of covered base pairs for each exon
exon_bp_cover_threshold=0.1
#Minimal percent of covered exons for each gene. 1 means 100%
overall_exon_count_threshold=1
#Minimum mapping quality for an alignment to be used
mapQ=20
#Minimum base quality for a base to be considered
baseQ=20
#where the R bin file is
RBin=R
#where the SAMtools bin file is
samtoolsBin=samtools
#name of the log file
logFileName=ExonDel.log
[R]
#Maximal number of exons for the moving-window in exon deletion detection
maxWinLength=9
#Minimum number of exons for the moving-window in exon deletion detection
minWinLength=1
#Minimum number of exons for a gene to be considered in exon deletion detection
minExonNum=10
#T means True and F means False. If only some genes were used (use -g option), GC adjustment will not be performed no matter what adjustGC below was set (As GC adjustment is based on all genes).
adjustGC=F
#If all genes were used (didn't use -g option), cutoff 1,2 will be determined by cutoffQuantile 1,2 in all genes
cutoffQuantile1=0.01
cutoffQuantile2=0.1
#If only some genes were used (use -g option), cutoff 1,2 will be determined by constant cutoff 1,2 below
cutoff1=2
cutoff2=20

here is the list of genes in gene_list.txt

BRCA1
BRCA2
P53
PTEN
ATM
NF1
ATR

here is the list of genes in genes.bed

10  89623194    89731687    PTEN
11  108093558   108239826   ATM
13  32889616    32973809    BRCA2
17  29421944    29704695    NF1
17  41196311    41277500    BRCA1
3   142168076   142297668   ATR

Here are the first few lines of the refseq_no_chr_prefix.txt (RefSeq UCSC track)
This is exactly the same file downloaded from UCSC, I simply removed the chr prefix because all my bam files do not have the prefix. I made sure all my files are not using the chr prefix, so my fasta reference file also does not have prefix.

#bin    name    chrom   strand  txStart txEnd   cdsStart    cdsEnd  exonCount   exonStarts  exonEnds    score   name2   cdsStartStat    cdsEndStat  exonFrames
0   NM_032291   1   +   66999638    67216822    67000041    67208778    25  66999638,67091529,67098752,67101626,67105459,67108492,67109226,67126195,67133212,67136677,67137626,67138963,67142686,67145360,67147551,67154830,67155872,67161116,67184976,67194946,67199430,67205017,67206340,67206954,67208755,   67000051,67091593,67098777,67101698,67105516,67108547,67109402,67126207,67133224,67136702,67137678,67139049,67142779,67145435,67148052,67154958,67155999,67161176,67185088,67195102,67199563,67205220,67206405,67207119,67216822,   0   SGIP1   cmpl    cmpl    0,1,2,0,0,0,1,0,0,0,1,2,1,1,1,1,0,1,1,2,2,0,2,1,1,
0   NM_001308203    1   +   66999251    67216822    67000041    67208778    22  66999251,66999928,67091529,67098752,67105459,67108492,67109226,67136677,67137626,67138963,67142686,67145360,67154830,67155872,67160121,67184976,67194946,67199430,67205017,67206340,67206954,67208755,  66999355,67000051,67091593,67098777,67105516,67108547,67109402,67136702,67137678,67139049,67142779,67145435,67154958,67155999,67160187,67185088,67195102,67199563,67205220,67206405,67207119,67216822,  0   SGIP1   cmpl    cmpl    -1,0,1,2,0,0,1,0,1,2,1,1,1,0,1,1,2,2,0,2,1,1,
1   NM_001301825    1   +   33547778    33586132    33547850    33585783    9   33547778,33549554,33557650,33558882,33560148,33562307,33563607,33583502,33585644,   33547955,33549728,33557823,33559017,33560314,33562470,33563780,33583717,33586132,   0   AZIN2   cmpl    cmpl    0,0,0,2,2,0,1,0,2,
1   NM_001080397    1   +   8378144 8404227 8378168 8404073 9   8378144,8384365,8385357,8385877,8390268,8395496,8397875,8399552,8403806,    8378246,8384786,8385450,8386102,8390996,8395650,8398052,8399758,8404227,    0   SLC45A1 cmpl    cmpl    0,0,1,1,1,0,1,1,0,

my ExonDel_test contains the list of the bam files I want to process

H521_S2_L001.bam
H926_S3_L001.bam
208210423-ADN-3_S7_L001.bam

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions