Given a KG engine, returns a graph representing the diversity of node categories and edge predicates for browsing. The returned graph is guaranteed to contain at least one node of every category, and at least one edge of every predicate. No other guarantees are made: the example graph is not minimal to satisfy these criteria, it is not random or even pseudo-random, and it may not be connected.
example_graph(engine, ...)
A tbl_kgx graph
# Using example KGX file packaged with monarchr
data(eds_marfan_kg)
# prints a readable summary and returns a list of dataframes
g <- eds_marfan_kg |> example_graph()
print(g)
#> # A tbl_graph: 12 nodes and 10 edges
#> #
#> # A directed acyclic multigraph with 3 components
#> #
#> # Node Data: 12 × 16 (active)
#> id pcategory name symbol in_taxon_label description synonym category
#> <chr> <chr> <chr> <chr> <chr> <chr> <list> <list>
#> 1 MONDO:000… biolink:… Marf… NA NA A disorder… <chr> <chr>
#> 2 MONDO:002… biolink:… Ehle… NA NA The Ehlers… <chr> <chr>
#> 3 MONDO:000… biolink:… Ehle… NA NA Ehlers-Dan… <chr> <chr>
#> 4 MONDO:001… biolink:… Ehle… NA NA A form of … <chr> <chr>
#> 5 ZFIN:ZDB-… biolink:… b3ga… NA Danio rerio NA <chr> <chr>
#> 6 HGNC:3603 biolink:… FBN1 FBN1 Homo sapiens NA <chr> <chr>
#> 7 HP:0000974 biolink:… Hype… NA NA A conditio… <chr> <chr>
#> 8 HP:0000007 biolink:… Auto… NA NA A mode of … <chr> <chr>
#> 9 MONDO:002… biolink:… rare NA NA A disease … <chr> <chr>
#> 10 CHEBI:508… biolink:… doxy… NA NA Tetracycli… <chr> <chr>
#> 11 CLINVAR:2… biolink:… NM_0… NA Homo sapiens NA <chr> <chr>
#> 12 CLINVAR:2… biolink:… NM_0… NA Homo sapiens NA <chr> <chr>
#> # ℹ 8 more variables: iri <chr>, xref <list>, namespace <chr>,
#> # provided_by <chr>, in_taxon <chr>, full_name <chr>, type <list>,
#> # has_gene <chr>
#> #
#> # Edge Data: 10 × 25
#> from to subject predicate object primary_knowledge_so…¹ agent_type
#> <int> <int> <chr> <chr> <chr> <chr> <chr>
#> 1 11 1 CLINVAR:200198 biolink:a… MONDO… infores:clingen manual_ag…
#> 2 6 1 HGNC:3603 biolink:c… MONDO… infores:omim manual_ag…
#> 3 6 1 HGNC:3603 biolink:g… MONDO… infores:orphanet manual_ag…
#> # ℹ 7 more rows
#> # ℹ abbreviated name: ¹primary_knowledge_source
#> # ℹ 18 more variables: knowledge_level <chr>, knowledge_source <chr>,
#> # aggregator_knowledge_source <chr>, provided_by <chr>, id <chr>,
#> # category <chr>, original_object <chr>, original_subject <chr>,
#> # frequency_qualifier <chr>, has_evidence <chr>, has_total <dbl>,
#> # has_quotient <dbl>, has_count <dbl>, has_percentage <dbl>, …
# prints a readable summary and returns a list of dataframes
g <- monarch_engine() |> example_graph()
#> Trying to connect to https://neo4j.monarchinitiative.org
#> Connected to https://neo4j.monarchinitiative.org
print(g)
#> # A tbl_graph: 79 nodes and 44 edges
#> #
#> # A directed acyclic simple graph with 37 components
#> #
#> # Node Data: 79 × 22 (active)
#> id category pcategory name symbol in_taxon_label description synonym
#> <chr> <list> <chr> <chr> <chr> <chr> <chr> <list>
#> 1 WB:WBGene… <chr> biolink:… abc-1 abc-1 Caenorhabditi… NA <lgl>
#> 2 GO:0051306 <chr> biolink:… mito… NA NA The proces… <chr>
#> 3 NCBIGene:… <chr> biolink:… CRAB… CRABP2 Bos taurus cellular r… <lgl>
#> 4 FB:FBgn00… <chr> biolink:… fabp fabp Drosophila me… NA <chr>
#> 5 dictyBase… <chr> biolink:… DDB_… DDB_G… Dictyostelium… NA <lgl>
#> 6 GO:0070876 <chr> biolink:… SOSS… NA NA A protein … <lgl>
#> 7 GO:0001556 <chr> biolink:… oocy… NA NA A developm… <lgl>
#> 8 WB:WBGene… <chr> biolink:… acy-4 acy-4 Caenorhabditi… NA <chr>
#> 9 ZFIN:ZDB-… <chr> biolink:… klf2… NA Danio rerio NA <lgl>
#> 10 ZFIN:ZDB-… <chr> biolink:… bns12 NA Danio rerio NA <lgl>
#> # ℹ 69 more rows
#> # ℹ 14 more variables: full_name <chr>, xref <list>, in_taxon <chr>,
#> # namespace <chr>, provided_by <chr>, type <list>, iri <chr>,
#> # related_synonyms <chr>, exact_synonyms <chr>, subsets <chr>,
#> # narrow_synonyms <chr>, has_gene <chr>, broad_synonyms <chr>,
#> # deprecated <lgl>
#> #
#> # Edge Data: 44 × 23
#> from to subject predicate object primary_knowledge_so…¹ knowledge_level
#> <int> <int> <chr> <chr> <chr> <chr> <chr>
#> 1 1 2 WB:WBGene… biolink:… GO:00… infores:wb knowledge_asse…
#> 2 3 4 NCBIGene:… biolink:… FB:FB… infores:panther knowledge_asse…
#> 3 5 6 dictyBase… biolink:… GO:00… infores:go-central knowledge_asse…
#> # ℹ 41 more rows
#> # ℹ abbreviated name: ¹primary_knowledge_source
#> # ℹ 16 more variables: negated <lgl>, species_context_qualifier <chr>,
#> # agent_type <chr>, aggregator_knowledge_source <chr>, has_evidence <chr>,
#> # provided_by <chr>, id <chr>, category <list>, publications <list>,
#> # qualifier <chr>, original_predicate <chr>, qualifiers <list>,
#> # disease_context_qualifier <chr>, original_object <chr>, …