Expert manual curation allows the capture of very high detail across mutation positions, disease descriptions, and other patient and population data. Manual curation additionally provides improved quality control over systematic approaches. Experienced curators can identify inconsistencies or errors in publications, allowing the rejection of untrustworthy, incomplete or unspecific data sources.
We have assembled a list of genes that are somatically mutated and causally implicated in human cancer ( Futreal et al, 2004 ). We call this list the The Cancer Gene Census and it is updated periodically with new genes. From this list we are selecting genes for COSMIC expert curation with an emphasis on genes for which there are no existing databases. A list of expert curated genes (also called COSMIC classic genes) can be found at the bottom of this page. The list of expert curated genes grows at each release and newly released genes can be found in the release notice alert.
To identify papers reporting somatic mutations PubMed is broadly searched for papers containing relevant mutation data (example search: (ras OR genes, ras) AND human AND mutation). Those identified from their abstracts to include somatic mutation information relating to cancer or pre-cancerous conditions are then selected for curating. After examination of the information in the full text of the paper, the sample and mutation data are extracted. Any papers containing incomplete data (e.g. mutations that are reported but not fully described) or data of insufficient quality (e.g. errors identified in the data) are not fully curated but are added to a list of "additional references containing somatic mutation information".
