can span tens of megabases and contain potentially hundreds of genes. Furthermore, low replication rates and identification of non-functional markers in most studies makes the search for true genetic signals difficult [9-11]. While there are issues with data reduction or summarization, integration at the level of the gene can be used as a link across a number of commonly used approaches.