FEBS Letters
Volume 582, Issue 8 , Pages 1171-1177, 9 April 2008

Linking entries in protein interaction database to structured text: The FEBS Letters experiment

  • Arnaud Ceol

      Affiliations

    • Department of Biology, University of Rome, Tor Vergata, Rome, Italy
  • ,
  • Andrew Chatr-Aryamontri

      Affiliations

    • Department of Biology, University of Rome, Tor Vergata, Rome, Italy
  • ,
  • Luana Licata

      Affiliations

    • Department of Biology, University of Rome, Tor Vergata, Rome, Italy
  • ,
  • Gianni Cesareni

      Affiliations

    • Department of Biology, University of Rome, Tor Vergata, Rome, Italy
    • IRCCS Fondazione Santa Lucia, 00143 Rome, Italy
    • Corresponding Author InformationCorresponding author. Address: Department of Biology, University of Rome, Tor Vergata, Rome, Italy.

published online 06 March 2008.

Abstract 

The corpus of the scientific literature has reached such size that a lot of useful data, dispersed throughout millions different articles, are now hard to recover. For instance, many articles in the biological domain describe relationships between entities (gene, proteins, small molecules, etc.) yet this crucial information cannot be efficiently used because of the difficulties in retrieving it automatically from unstructured text. Databases are striving to capture this valuable information and to organize it in a structured format ready for automatic analysis. However, the current database model, based on manual curation, is not sustainable because the limited support is not compatible with complete and accurate coverage of published information. Several proposals have been put forward to increase the efficiency and accuracy of the curation process. Here we present an experiment, designed by the editorial board of FEBS Letters, aimed at integrating each manuscript with a structured summary precisely reporting, with database identifiers and predefined controlled vocabularies, the protein interactions reported in the manuscript. The authors play an important role in this process as they are requested to provide structured information to be appended, in the form of human-readable paragraphs, at the end of traditional summaries. It is envisaged that the structured text will become an integral part of Medline abstracts. In 6 months time the experience gained with this experiment will form the basis for a community discussion to propose a widely accepted strategy for information storage and retrieval.

Keywords: Protein interaction, Database, Information extraction, Network

To access this article, please choose from the options below

Login to an existing account or Register a new account.

  • Purchase this article for 31.50 USD (You must login/register to purchase this article)

    Online access for 24 hours. The PDF version can be downloaded as your permanent record.

  • Subscribe to this title

    Get unlimited online access to this article and all other articles in this title 24/7 for one year.

  • Claim access now

    For current subscribers with Society Membership or Account Number.

  • Visit SciVerse ScienceDirect to see if you have access via your institution.
 

PII: S0014-5793(08)00196-8

doi:10.1016/j.febslet.2008.02.071

FEBS Letters
Volume 582, Issue 8 , Pages 1171-1177, 9 April 2008