University Information Technology Services
Research and Academic Computing

GeneIndex

/**********************************************************************
 Copyright 2003, The Trustees of Indiana University. All right reserved.
 Original author: Huian Li.
 Last modified by Huian Li, June 29, 2005

Filename:       README
**********************************************************************/
This code simply reads a sequence file, and then lists all locations
and frequency of each word of a given length.

Input: a word length, a sequence filename
       The sequence file should either be in FASTA format or
       just sequences (consisting of  A, C, G, or T)


Output: enumerate each word occurred in the sequence, as well as all
        locations in the sequence and its total occurrence.

Compiling GeneIndex:
====================
* Change to Makefile may be required. The default Makefile works on IBM
  SP machine with AIX operating system. 
  For serial code, choose a C compiler availabe on your machine;
  for parallel code, choose a parallel C compiler that supports MPI.

Running GeneIndex:
==================
* Run "geneindex -h" to see help information.