Introduction

TRACTS is a program designed to detect and analyze binary tracts in a DNA sequence. Binary DNA tracts are nucleotide base sequences composed of two nucleotide bases only. Six two-base compositions are possible, and can be organized in two pairs plus a semi pair:

·        oligopurine.oligopyrimidine or R.Y tracts;

·        Keto.imino - G,T on one strand, A,C on the complimentary one - or K.M tracts;

·        Strong (S, - G,C) or weak (W - A,T) tracts; together - S;W tracts.  

The R, Y, K, M, W and S letters are the IUBMB (International Union of Biochemistry and Molecular Biology) specified letters for binary base combinations. A dot (.) is the IUBMB convention for separating bases on complimentary strands. The comma is the IUBMB convention for mixed bases on the same strand  (e.g. GGTTTG). The semicolon is the IUBMB convention for separating autocomplimentary oligonucleotides (For dot, comma and semicolon see reference [1] in the IUBMB recommendation). R.Y and K.M tracts can be run jointly because usually these tracts distribute evenly between strands. S;W are better run separately, as the autocomplimentary Weak and Strong sequences tend to behave quite differently.  An option to run also the other members separately is planned.

The main input file to the program is an annotated sequence in flat DDBJ/EMBL/GenBank format. This input file can be of a gene, a clone, contig, scaffold, chromosome etc. The user can insert the annotation part and the sequence part separately, as detailed below. The user can choose between the “mRNA” and  “CDS” feature keys. In the last case, noncoding exons will be considered part of the intergenic region (i.e. UTR regions will not be identified) and is the preferred key when mRNA data are available. Several additional parameters and options to be described have to be entered.

The program performs the following operations:





l