Entity Identification in Molecular Biology Using a Stochastic Tagger

Sample size: 5000 publication 10 minutes Evidence: moderate

Author Information

Author(s): Kinoshita Shuhei, Cohen K Bretonnel, Ogren Philip V, Hunter Lawrence

Primary Institution: Center for Computational Pharmacology, University of Colorado School of Medicine

Can a part-of-speech tagger be effectively used for entity identification in molecular biology?

A part-of-speech tagger can be enhanced with post-processing rules to create a competitive entity identification system.

The researchers used a special tagging system to find gene names in scientific texts, and they made it better by adding extra rules to fix mistakes.

The study used a stochastic part-of-speech tagger with post-processing rules to identify gene mentions in biomedical literature.

The system's performance may be influenced by the specific training data used.

The study did not rigorously compare the performance of different taggers.

p<0.05

p<0.05

Access the complete publication on the publisher's website