Automating Genomic Data Mining via a Sequence-based Matrix Format and Associative Rule Set
2005

Automating Genomic Data Mining

publication Evidence: moderate

Author Information

Author(s): Wren Jonathan D, Johnson David, Gruenwald Le

Primary Institution: The University of Oklahoma

Hypothesis

Can a method be developed to automate the exploration of genomic data for correlations?

Conclusion

The study successfully developed a prototype that detects coinciding genomic features among various elements.

Supporting Evidence

  • The prototype was successful in detecting genomic features.
  • The study highlights the need for automated data mining in genomics.
  • Integration of diverse data types is crucial for effective analysis.

Takeaway

This study created a new way to look at genetic data that helps find patterns and connections automatically, like a smart detective for DNA.

Methodology

A sequence matrix was developed to integrate and compare different genomic annotation sources, and a classification tree was created to guide analyses.

Limitations

The system's performance is limited by the amount of memory available for analysis and the complexity of integrating various data types.

Digital Object Identifier (DOI)

10.1186/1471-2105-6-S2-S2

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication