An examination of the OMIM database for associating mutation to a consensus reference sequence.

Author: Zuofeng LI ¹ ; Beili YING ; Xingnan LIU ; Xiaoyan ZHANG ; Hong YU
Author Information

1. Shanghai Center for Bioinformation Technology, Shanghai 200235, China. lizuofeng@gmail.com
Publication Type:Journal Article
MeSH: Amino Acid Sequence; Consensus Sequence; Databases, Genetic; Molecular Sequence Data; Point Mutation; Sequence Alignment
From: Protein & Cell 2012;3(3):198-203
CountryChina
Language:English
Abstract: Gene mutation (e.g. substitution, insertion and deletion) and related phenotype information are important biomedical knowledge. Many biomedical databases (e.g. OMIM) incorporate such data. However, few studies have examined the quality of this data. In the current study, we examined the quality of protein single-point mutations in the OMIM and identified whether the corresponding reference sequences align with the mutation positions. Our results show that close to 20% of mutation data cannot be mapped to a single reference sequence. The failed mappings are caused by position conflict, site shifting (peptide, N-terminal methionine) and other types of data error. We propose a preliminary model to resolve such inconsistency in the OMIM database.