ParPEST: a pipeline for EST data analysis based on parallel computing
2005

ParPEST: A Pipeline for Analyzing EST Data

publication Evidence: high

Author Information

Author(s): D'Agostino Nunzio, Aversano Mario, Chiusano Maria Luisa

Primary Institution: Department of Structural and Functional Biology, University 'Federico II', Naples, Italy

Hypothesis

Can a parallel computing pipeline improve the analysis of expressed sequence tags (ESTs)?

Conclusion

The ParPEST pipeline efficiently analyzes EST data and provides a curated information set using a relational database.

Supporting Evidence

  • ParPEST integrates multiple steps of EST processing into a single pipeline.
  • The pipeline is designed to run on low-cost hardware and is scalable.
  • It allows for interactive browsing of results through a web interface.

Takeaway

ParPEST is a tool that helps scientists quickly analyze DNA sequences called ESTs by using many computers at once.

Methodology

The pipeline processes raw EST data, clusters sequences, assembles them into contigs, and performs functional annotation using parallel computing.

Limitations

The only limiting factor for execution is the memory space required for database storage.

Digital Object Identifier (DOI)

10.1186/1471-2105-6-S4-S9

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication