Motivation

Over the years the technology for sequencing of genomic material have advanced greatly. The time and cost of sequencing has decreased while the amount of data produced by sequencing has increased. This trend in decrease in time and cost, while increasing in data has created a greater need for mining algorithms to make sense of the data. Without data mining the data does not yield much, if any useful information.

 

The goal of this project was to identify coexpression of genes and classify organs from tissue samples using data compiled from from the lamprey genome and data mining techinques.