Tips for accurate curation
This document explains the intended use of the fields of the template and provides guidance on common problems.
PMID:
Please use the form PMID:38143450
. Please remove the space, if any, between the colon and the number.
Indentifier:
Please use the identifier used in the original publication. If the original publication has a Table, it is usually obvious what identifier to use. In other cases, no explicit identifier is given. Try to extract a short identifier that will be obvious to others who might look at your curation later on. For instance, if the paper is a case report that states this:
Example
Here we present the case of a 28-year-old man with X-linked immunodeficiency with magnesium defect, Epstein–Barr virus (EBV) infection and neoplasia (XMEN) disease.
Then use “28-year-old man” as the identifier (see PMID:38143450).
Disease id and name
Please use the OMIM identifier. This can be taken from the OMIM webpage for the disease. For instance, OMIM:300853 shows Phenotype MIM number 300853. Please enter this as OMIM:300853
. The Phenotype
(i.e., disease) is shown in the table as Immunodeficiency, X-linked, with magnesium defect, Epstein-Barr virus infection and neoplasia
. Please use this label. This will make it much easier for software to correctly identify the disease that is being curated.
Comment
Sometimes it is hard to curate information and it is a good idea to enter some additional information about the case to help others understand the curation. Please use the comment
field for this. The following box shows an example of the kind of comment that might be useful.
Example
incl Table 1 with many neurological symptoms in these two patients + Table 2 with neurological symptoms in pther XMEN patients
Transcript
It can be difficult to choose the optimal transcript, and articles about one and the same disease may even use different transcripts. We try to use the transcript that is preferred by ClinVar. See Varint notation for explanations.
We recommend using VariantValidator to check the variant identifiers used in original publications.
Note that incorrect variant nomenclature is still common in the literature. For instance, c.C414A
should be c.414C>A
.
HPO Term columns
Please use only HPO terms for these columns. Do not use terms from other ontologies or terminologies such as OMIM, ORPHA, or other sources.
Please use exactly the original HPO term label, e.g., do not use the words they used in the original publication, instead use the label found on the HPO website
For instance, if the original publication states recurrent bronchopulmonary infection
, go to the HPO website to determine the corresponding HPO term, which in this case is Recurrent bronchopulmonary infections. If hypogammaglobulinemia
is used in the original publication, choose Decreased circulating antibody level.
This will make the curation unambiguous and reduce potential errors.
New terms
The HPO is still incomplete. If you find a phenotypic abnormality in a publication but cannot find a corresponding HPO term, enter the label for
the term in the first row and NTR
(new term request) in the second row. The HPO team will help to decide if there is an existing term for the abnormality or will create a new term. Following this, the Excel file can be updated.
Scope
The HPO describes phenotypic abnormalities in a clinical context. It does not cover topics such as molecular mechanisms. Terms such as “loss-of function” are not in scope.