TY - JOUR
T1 - Gene characterization index
T2 - Assessing the depth of gene annotation
AU - Kemmer, Danielle
AU - Podowski, Raf M.
AU - Yusuf, Dimas
AU - Brumm, Jochen
AU - Cheung, Warren
AU - Wahlestedt, Claes
AU - Lenhard, Boris
AU - Wasserman, Wyeth W.
PY - 2008/1/23
Y1 - 2008/1/23
N2 - Background. We introduce the Gene Characterization Index, a bioinformatics method scoring the extent to which a protein-encoding gene is functionally described. Inherently a reflection of human perception, the Gene Characterization Index is applied for assessing the characterization status of individual genes, thus serving the advancement of both genome annotation and applied genomics research by rapid and unbiased identification of groups of uncharacterized genes for diverse applications such as directed functional studies and delineation of novel drug targets. Methodology/Principal Findings. The scoring procedure is based on a global survey of researchers, who assigned characterization scores from 1 (poor) to 10 (extensive) for a sample of genes based on major online resources. By evaluating the survey as training data, we developed a bioinformatics procedure to assign gene characterization scores to all genes in the human genome. We analyzed snapshots of functional genome annotation over a period of 6 years to assess temporal changes reflected by the increase of the average Gene Characterization Index. Applying the Gene Characterization Index to genes within pharmaceutically relevant classes, we confirmed known drug targets as high-scoring genes and revealed potentially interesting novel targets with low characterization indexes. Removing known drug targets and genes linked to sequence-related patent filings from the entirety of indexed genes, we identified sets of low-scoring genes particularly suited for further experimental investigation. Conclusions/Significance. The Gene Characterization Index is intended to serve as a tool to the scientific community and granting agencies for focusing resources and efforts on unexplored areas of the genome. The Gene Characterization Index is available from.
AB - Background. We introduce the Gene Characterization Index, a bioinformatics method scoring the extent to which a protein-encoding gene is functionally described. Inherently a reflection of human perception, the Gene Characterization Index is applied for assessing the characterization status of individual genes, thus serving the advancement of both genome annotation and applied genomics research by rapid and unbiased identification of groups of uncharacterized genes for diverse applications such as directed functional studies and delineation of novel drug targets. Methodology/Principal Findings. The scoring procedure is based on a global survey of researchers, who assigned characterization scores from 1 (poor) to 10 (extensive) for a sample of genes based on major online resources. By evaluating the survey as training data, we developed a bioinformatics procedure to assign gene characterization scores to all genes in the human genome. We analyzed snapshots of functional genome annotation over a period of 6 years to assess temporal changes reflected by the increase of the average Gene Characterization Index. Applying the Gene Characterization Index to genes within pharmaceutically relevant classes, we confirmed known drug targets as high-scoring genes and revealed potentially interesting novel targets with low characterization indexes. Removing known drug targets and genes linked to sequence-related patent filings from the entirety of indexed genes, we identified sets of low-scoring genes particularly suited for further experimental investigation. Conclusions/Significance. The Gene Characterization Index is intended to serve as a tool to the scientific community and granting agencies for focusing resources and efforts on unexplored areas of the genome. The Gene Characterization Index is available from.
UR - http://www.scopus.com/inward/record.url?scp=44849123353&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=44849123353&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0001440
DO - 10.1371/journal.pone.0001440
M3 - Article
C2 - 18213364
AN - SCOPUS:44849123353
VL - 3
JO - PLoS One
JF - PLoS One
SN - 1932-6203
IS - 1
M1 - e1440
ER -