ParPEST: A Pipeline for Analyzing EST Data
Author Information
Author(s): D'Agostino Nunzio, Aversano Mario, Chiusano Maria Luisa
Primary Institution: Department of Structural and Functional Biology, University 'Federico II', Naples, Italy
Hypothesis
Can a parallel computing pipeline improve the analysis of expressed sequence tags (ESTs)?
Conclusion
The ParPEST pipeline efficiently analyzes EST data and provides a curated information set using a relational database.
Supporting Evidence
- ParPEST integrates multiple steps of EST processing into a single pipeline.
- The pipeline is designed to run on low-cost hardware and is scalable.
- It allows for interactive browsing of results through a web interface.
Takeaway
ParPEST is a tool that helps scientists quickly analyze DNA sequences called ESTs by using many computers at once.
Methodology
The pipeline processes raw EST data, clusters sequences, assembles them into contigs, and performs functional annotation using parallel computing.
Limitations
The only limiting factor for execution is the memory space required for database storage.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website