Skip to content

universalizer

Details

GitHub Knowledge-Graph-Hub/universalizer
Language Python
Description The KG-Hub Universalizer provides functions for knowledge graph cleanup and identifier normalization.

Dependencies

External Dependencies

Package Version
python >=3.9,<3.12
click ^8.1.3
kgx >=2.0.0
oaklib >=0.5.10
prefixmaps >=0.1.5

Documentation

universalizer

The KG-Hub Universalizer provides functions for knowledge graph cleanup and identifier normalization.

Installation

Install with pip:

pip install universalizer

OR

Install with Poetry.

git clone https://github.com/Knowledge-Graph-Hub/universalizer.git
cd universalizer
poetry install

Usage

With KGX format node and edge files in the same directory:

universalizer run path/to/directory

Or, if they're in a single tar.gz file:

universalizer run -c graph.tar.gz

ID and category mapping

SSSOM-format maps are supported. Use a single map file:

univeralizer run -m poro-mp-exact-1.0.sssom.tsv path/to/directory

or a whole directory of them:

univeralizer run -m path/to/mapfiles path/to/directory

To map node categories as well as identifiers, use the -u flag:

univeralizer run -m path/to/mapfiles path/to/directory -u

For SSSOM maps from subject_id to object_id, subject node IDs will be remapped to object IDs.

If the object_category value is specified the node's category ID will be remapped as well.

Note that this will complete node normalization and ID remapping.

Maps should use the normalized form (e.g., specify "FBbt:00005201", not "FBBT:00005201", even if the latter form is in the input graph.)