You must read TermsOfUsage before installing or using any of this software.

You should obtain the user's guide from the CDSplotcon webpage, for more 
  information.


INSTALLATION:

First unzip and unpack the distribution file:

gunzip CDSplotcon.tar.gz 
tar xvf CDSplotcon.tar 
cd CDSplotcon

This will give you the files:

aa2codon.dat  EXAMPLE1/   mlrgd.R         README           runmean.cxx
aa.dat        EXAMPLE2/   ntadjust.cxx    redo_mlrgd       run_mlrgd
addgaps.cxx   minmax.cxx  ntunadjust.cxx  redo_plots       TermsOfUsage
codon.dat     mlrgd.cxx   nuc.dat         runclipmean.cxx

Now compile the programmes (replace 'g++' by an appropriate alternative, 
  e.g. 'c++' or 'gcc', if you're using a different C++ compiler):

g++ -o addgaps addgaps.cxx 
g++ -o minmax minmax.cxx
g++ -o mlrgd mlrgd.cxx
g++ -o ntadjust ntadjust.cxx
g++ -o runclipmean runclipmean.cxx
g++ -o runmean runmean.cxx

Then copy these programmes to your ~/bin directory:

mkdir ~/bin
cp addgaps     ~/bin
cp minmax      ~/bin
cp mlrgd       ~/bin
cp ntadjust    ~/bin
cp runclipmean ~/bin
cp runmean     ~/bin

Also make sure that run_mlrgd, redo_mlrgd and redo_plots are executable:

chmod u+x run_mlrgd redo_mlrgd redo_plots


REQUIRED SOFTWARE:

You will also need to download and install the following software if you don't 
already have it:

code2aln     Sequence alignment programme.
             Obtain from http://www.tbi.univie.ac.at/~roman/Code2aln/

seqret       Part of the EMBOSS package.
seqretsplit  Obtain from http://emboss.sourceforge.net/
infoseq 
degapseq 
               
ednaml       Phylogeny programmes in the PHYLIP package.
ednapars     Repackaged as part of the EMBASSY package.  
             Obtain from http://emboss.sourceforge.net/

R            Statistics and graphics package.
             Obtain from http://www.r-project.org/


RUNNING CDSplotcon:

Use the script run_mlrgd.  Open run_mlrgd in a text-editor and edit the 
parameters in the 'Options' section if you like.  Make sure run_mlrgd, 
aa2codon.dat, aa.dat, codon.dat and nuc.dat (and redo_mlrgd, redo_plots, 
if required) are in the working directory, together with the input sequences,
sequence list and ORF annotation.  Then type

./run_mlrgd 'prefix'

where 'prefix'.seqs is the name of the sequences list.


EXAMPLES:

1) Hepatitis B Virus example.

cd EXAMPLE1
./run_mlrgd hbv

Note that a few of the parameters in EXAMPLE1/run_mlrgd differ from those in 
the default run_mlrgd - e.g. the 'circular' genome option is switched on. 

The output file 'hbv.fcm.eps' (view with e.g. 'ggv hbv.fcm.eps', or use
'epstopdf hbv.fcm.eps' then 'xpdf hbv.fcm.pdf') contains the conservation 
plots for the full-coding model.  See the user's guide for details of all the 
other output files.

2) Barley Yellow Dwarf Virus example.

cd EXAMPLE2
./run_mlrgd bydv


Note for MacOS-X users:

I've noticed that MacOS-X tends to have a system-wide maximum
stacksize limit of 65536 kB.  To avoid exceeding this limit (resulting
in a 'Segmentation Fault' error), you may need to edit the lines

#define maxlength 50000
#define maxpairs 200

to 

#define maxlength 10000
#define maxpairs 20

in mlrgd.cxx before compiling.  Unfortunately this will limit the
maximum length of input sequences to 10000 nt and maximum number of
input sequences to 10.