Rare Disease analysis in Mondo

To answer the question of ‘How many rare diseases are there?’ we analyzed terms in Mondo to get a total count of Rare Diseases as defined in Mondo Disease Ontology (Mondo).

Methods

This analysis was performed on the Mondo 2019-09-30 release.

1. Get all ‘Disease’ terms from Mondo

First we get all the terms in Mondo that are a descendants of MONDO:0000001 'Disease'.

There are 21633 Mondo disease terms.

2. Identify terms that are ‘rare’

Any disease term in Mondo is considered rare if the term, or its ancestor, has modifier MONDO:0021136 'Rare' in the ontology.

There are 12970 Mondo rare disease terms.

3. Consider terms in ‘gard_rare’ subset

There are 3176 Mondo disease terms that are in gard_rare subset that are yet to be treated as ‘rare’.

We add these classes to our set of Mondo rare disease terms.

This increases the Mondo rare disease term count to 13922.

4. Filter terms that are descendants of ‘disease susceptibility’

We then to filter out terms that are descendants of MONDO:0042489 'disease susceptibility', to avoid counting ambiguous terms that are related to disease susceptibility and not the actual disease itself.

This gives us a list of 13866 Mondo rare disease terms.

But for this analysis, we are interested in terms that are both rare and are leaf nodes in the ontology.

After considering only leaf nodes, we get 10394 as the final count of Mondo rare disease terms.

Results

all-mondo-disease-terms.tsv: As part of our analysis, we generated a TSV containing 21633 Mondo disease terms, each with annotations that signifies whether the term is a rare disease term and whether that term is a leaf node in the ontology.