graphit.graph_io package¶

graphit.graph_io.io_adl_format module¶

Reading and writing graphs as adjacency lists (.adl)

Adjacency lists are a simple textual representation of node identifiers and their linkage (adjacency) to one another.

The graph with edges a-b, a-c, d-e can be represented as the following adjacency list (anything following the # in a line is a comment):

a b c # source target target d e

graphit.graph_io.io_adl_format.read_adl(adl_file, graph=None)¶

Construct a graph from a adjacency list (ADL)

Note

the directionality of the graph is not defined explicitly in the adjacency list and thus depends on the graph.directional attribute that is False (undirectional) by default.

Parameters

adl_file (File, string, stream or URL) – ADL graph data.
graph (:graphit:Graph) – Graph object to import ADL data in

Returns

Graph object

Return type

:graphit:Graph

graphit.graph_io.io_adl_format.write_adl(graph)¶

Export graph as adjacency list (ADL)

Note

This format does not store graph, node, or edge data.

Parameters: graph (:graphit:Graph) – Graph object to export
Returns: Graph object
Return type: :py:str

graphit.graph_io.io_cwl_format module¶

Functions for importing data structures in Common Workflow Language format.

The Common Workflow Language (CWL) is a specification for describing analysis workflows and tools in a way that makes them portable and scalable across a variety of software and hardware environments, from workstations to cluster, cloud, and high performance computing (HPC) environments.

CWL data structures are stored in JSON or YAML format. The lie_graph CWL parser supports syntax version 1.0.2 as described here:

https://www.commonwl.org/v1.0/

Citation:: Peter Amstutz, Michael R. Crusoe, Nebojša Tijanić (editors), Brad Chapman, John Chilton, Michael Heuer, Andrey Kartashov, Dan Leehr, Hervé Ménager, Maya Nedeljkovich, Matt Scales, Stian Soiland-Reyes, Luka Stojanovic (2016): Common Workflow Language, v1.0. Specification, Common Workflow Language working group. https://w3id.org/cwl/v1.0/ doi:10.6084/m9.figshare.3115156.v2
For more information on CWL consult:: https://www.commonwl.org

graphit.graph_io.io_cwl_format.read_cwl(cwl_file, graph=None, **kwargs)¶

Parse Common Wokflow Language data structures to a graph

Additional keyword arguments (kwargs) are passed to read_pydata

Parameters

cwl_file (File, string, stream or URL) – CWL data to parse
graph (:graphit:Graph) – Graph object to import dictionary data in

Returns

GraphAxis object

Return type

:graphit:GraphAxis

graphit.graph_io.io_dot_format module¶

Functions for exporting and importing graphs to and from graph description language (DOT) format

graphit.graph_io.io_dot_format.write_dot(graph, graph_name=None)¶

DOT graphs are either directional (digraph) or undirectional, mixed mode is not supported.

Nodes and edges are all exported separably, short hand notations are not supported. Grouping and supgraphs are not supported. Graph attributes in graph.data, graph.edges and graph.nodes will be exported as DOT directives regardless if they are official GraphVis DOT graph directives as listed in the reference documentation:

https://www.graphviz.org/doc/info/attrs.html

Dot reserved rendering keywords part of the graphs global attributes in graph.data or part of the node and edge attributes are exported as part of the DOT graph.

Parameters

graph (:graphit:Graph) – Graph object to export
graph_name (:py:str) – name of the ‘graph’ or ‘digraph’. Uses the ‘title’ attribute in graph.data by default, else graph_name

Returns

DOT graph representation

Return type

:py:str

graphit.graph_io.io_dot_format.read_dot(dot, graph=None)¶

Read graph in DOT format

Parameters

dot (File, string, stream or URL) – DOT graph data.
graph (:graphit:Graph) – Graph object to import DOT data in

Returns

Graph object

Return type

:graphit:Graph

graphit.graph_io.io_flattened_data_format module¶

Functions for importing and exporting flattened (dot seperated) data structures

graphit.graph_io.io_flattened_data_format.read_flattened()¶

graphit.graph_io.io_flattened_data_format.write_flattened(graph, sep='.', default=None, allow_none=False, **kwargs)¶

graphit.graph_io.io_gexf_format module¶

Reading and writing graphs in GEXF format.

GEXF (Graph Exchange XML Format) is a language for describing complex network structures, their associated data and dynamics.

Reference and specification:

graphit.graph_io.io_gexf_format.read_gexf(gexf_file, graph=None)¶

Read graphs in GEXF format

Uses the Python build-in etree cElementTree parser to parse the XML document and convert the elements into nodes. The XML element tag becomes the node key, XML text becomes the node value and XML attributes are added to the node as additional attributes.

Parameters

gexf_file (File, string, stream or URL) – XML data to parse
graph (:graphit:Graph) – Graph object to import dictionary data in

Returns

GraphAxis object

Return type

:graphit:GraphAxis

graphit.graph_io.io_gexf_format.write_gexf(graph, node_tools=<class 'graphit.graph_io.io_gexf_format.GEXFNodeTools'>, edge_tools=<class 'graphit.graph_io.io_gexf_format.GEXFEdgeTools'>)¶

Export a graph to an GEXF data format

Custom XML serializers may be introduced as a custom NodeTools class using the node_tools attribute. In addition, the graph ORM may be used to inject tailored serialize methods in specific nodes or edges.

Parameters

graph (:graphit:Graph) – Graph to export
node_tools (:graphit:NodeTools) – NodeTools class with node serialize method
edge_tools (:graphit:EdgeTools) – EdgeTools class with node serialize method

Returns

Graph exported as a hierarchical XML node structure

Return type

:py:str

graphit.graph_io.io_gml_format module¶

Functions for exporting and importing graphs to and from graph modelling language (GML) format as described in the online documentation:

https://en.wikipedia.org/wiki/Graph_Modelling_Language http://www.fim.uni-passau.de/index.php?id=17297&L=1

graphit.graph_io.io_gml_format.read_gml(gml, graph=None)¶

Read graph in GML format

Parameters

gml (File, string, stream or URL) – GML graph data.
graph (:graphit:Graph) – Graph object to import GML data in

Returns

Graph object

Return type

:graphit:Graph

graphit.graph_io.io_gml_format.write_gml(graph, node_tools=None, edge_tools=None)¶

Export a graphit graph to GML format

Export graphit Graph data, nodes and edges in Graph Modelling Language (GML) format. The function replaces the graph NodeTools and EdgeTools with a custom version exposing a serialize method responsible for serializing the node/edge attributes in a GML format. The NodeTools class is also used to export Graph.data attributes.

Custom serializers may be introduced as custom NodeTools or EdgeTools classes using the node_tools and/or edge_tools attributes. In addition, the graph ORM may be used to inject tailored serialize methods in specific nodes or edges.

Parameters

graph (:graphit:Graph) – Graph object to export
node_tools (:graphit:NodeTools) – NodeTools class with node serialize method
edge_tools (:graphit:EdgeTools) – EdgeTools class with edge serialize method

Returns

GML graph representation

Return type

:py:str

graphit.graph_io.io_helpers module¶

graphit.graph_io.io_helpers.initial_node(nodes)¶

Return node ID of node with smallest _ID identifier.

Parameters: nodes – graph ‘nodes’ object
Returns: node ID

graphit.graph_io.io_helpers.resolve_root_node(graph)¶

Resolve the node ID of the root node of the graph.

For Graph objects there is no strict concept of a root node and by default the ‘root’ attribute of the grpah is not defined. Here, the root will resolve to the node nid with the smallest _id number which usually is the first node added when the graph was created.

For GraphAxis object a root is essential for defining the graph hierarchy and thus, the graph ‘root’ attribute should be defined. If it is not defined it will also default to the node nid with the smallest _id number. If the user defined or default root is in the (sub)graph it is returned. If not, an attempt will be made to resolve it following:

If the graph is a single node, its node ID will be root.
If the graph has multiple nodes and the root is defined in the full_graph, return the node ID closest to the root

Parameters: graph – graph to resolve root node for
Returns: root node ID

graphit.graph_io.io_helpers.coarse_type(n)¶

graphit.graph_io.io_helpers.check_graphit_version(file_version)¶

Check if the graph version of the file is (backwards) compatible with the current graphit module version

Parameters: file_version (:py:str) – graphit version to check

graphit.graph_io.io_helpers.open_anything(source, mode='r')¶

Open input available from a file, a Python file like object, standard input, a URL or a string and return a uniform Python file like object with standard methods.

Parameters

source (mixed) – Input as file, Python file like object, standard input, URL or a string
mode (string) – file access mode, defaults to ‘r’

Returns

Python file like object

class graphit.graph_io.io_helpers.FormatDetect(set_locale='en_US.UTF-8', decimal_point=None, thousands_sep=None)¶

Bases: object

Type cast string or unicode objects to float, integer or boolean.

Uses localization to identify

TODO: comma separated strings fail if one comma

parse(value, target_type=None)¶

Parse an unknown value to a float, integer, boolean or else remain in unicode.

Parameters

value – value to parse
target_type – type to convert to as ‘integer’, ‘number’, ‘string’, ‘boolean’ or automatic ‘detect’

Returns

parsed value

to_boolean(value)¶

to_detect(value)¶

static to_integer(value)¶

static to_number(value)¶

static to_string(value)¶

class graphit.graph_io.io_helpers.StreamReader(stream)¶

Bases: object

StreamReader class

Extention of the Python file like object (io class) to read data as flexible streams. Enables a stream to be read by character, or block of characters crossing file lines.

Parameters: stream – textual data that can be parsed as file-like object.

next()¶

Iterator next method

Returns next character in the iterations as long as there are characters left in the file-like object

Raises: StopIteration, if no more characters

read_upto_block(blocks, sep=(' ', '\n'), keep=False)¶

Return characters from active position up to a certain block of characters or the first occurrence of one of multiple blocks. A block is defined as a sequence of characters bounded by separator characters sep usually spaces and newline characters.

Parameters

blocks (:py:str, :py:list, :py:tuple) – block(s) to search for.
sep (:py:tuple, :py:list) – block seperation characters
keep (:py:bool) – keep the block to search for as part of the returned string

Returns

tuple of text segment and termination character

Return type

:py:tuple

read_upto_char(chars, keep=False)¶

Return characters from active position up to a certain character or the first occurrence of one of multiple characters.

Parameters

chars (:py:str, :py:list, :py:tuple) – character(s) to search for.
keep (:py:bool) – keep the character to search for as part of the returned string

Returns

tuple of text segment and termination character

Return type

:py:tuple

readline()¶: Returns ‘readline’ method of the base file-like object

set_cursor(position)¶

Move the file reader cursor to a new position in the file

Parameters: position (:py:int) – position to move to

slice(start, stop, step=1)¶

Text slice method.

Returns a segment of text defined by a start and stop character position relative to the start of the text.

Parameters

start (:py:int) – start character position
stop (:py:str) – stop character position

Return type

:py:str

tell()¶

Return current position of file cursor

Return type: :py:int

graphit.graph_io.io_jgf_format module¶

Functions for reading and writing graph files in the graphit .jgf JSON format

This is a propitiatory format in which the graph meta-data, the nodes, edges and their data dictionaries are stored in JSON format.

graphit.graph_io.io_jgf_format.read_jgf(jgf_format, graph=None)¶

Read JSON graph format (.jgf)

This is a propitiatory format in which the graph meta-data, the nodes, edges and their data dictionaries are stored in JSON format.

Format description. Primary key/value pairs: * graph: Graph class meta-data. Serializes all class attributes of type

int, float, bool, long, str or unicode.

nodes: Graph node identifiers (keys) and attributes (values)
edges: Graph enumerated edge identifiers
edge_attr: Graph edge attributes

Parameters

jgf_format (:py:str) – JSON encoded graph data to parse
graph (:graphit:Graph) – Graph object to import TGF data in

Returns

Graph object

Return type

Graph or GraphAxis object

graphit.graph_io.io_jgf_format.write_jgf(graph, indent=2, encoding='utf-8', **kwargs)¶

Write JSON graph format

This is a propitiatory format in which the graph meta-data, the nodes, edges and their data dictionaries are stored in JSON format.

Format description. Primary key/value pairs: * graph: Graph class meta-data. Serializes all class attributes of type

int, float, bool, long, str or unicode.

data: Graph meta-data dictionary
nodes: Graph node identifiers (keys) and attributes (values)
edges: Graph enumerated edge identifiers
edge_attr: Graph edge attributes

Parameters

graph (Graph or GraphAxis object) – graph object to serialize
indent (:py:int) – JSON indentation count
encoding (:py:str) – JSON string encoding
kwargs (:py:dic) – additional data to be stored as file meta data

Returns

JSON encoded graph dictionary

Return type

:py:str

graphit.graph_io.io_json_format module¶

Functions for importing and exporting JSON data into a graph data structure

graphit.graph_io.io_json_format.read_json(json_file, graph=None, **kwargs)¶

Parse (hierarchical) JSON data structure to a graph

Use the default Python json parser to parse the JSON file to a dictionary followed by io_dict_format.read_pydata to parse to a graph structure.

Additional keyword arguments (kwargs) are passed to read_pydata

Parameters

json_file (File, string, stream or URL) – json data to parse
graph (:graphit:Graph) – Graph object to import dictionary data in

Returns

GraphAxis object

Return type

:graphit:GraphAxis

graphit.graph_io.io_json_format.write_json(graph, default=None, include_root=False, allow_none=True, **kwrags)¶

Export a graph to a (nested) JSON structure

Convert graph representation of the dictionary tree into JSON using a nested or flattened representation of the dictionary hierarchy.

Dictionary keys and values are obtained from the node attributes using key_tag and value_tag. The key_tag is set to graph key_tag by default.

Additional keyword arguments (kwargs) are passed to json.dumps()

Parameters

graph (:graphit:GraphAxis) – Graph object to export
default (mixed) – value to use when node value was not found using value_tag.
include_root (:py:bool) – Include the root node in the hierarchy
root_nid – root node ID in graph hierarchy
allow_none (:py:bool) – allow None values in the output

Return type

:py:json

graphit.graph_io.io_jsonschema_format module¶

Functions for building and validating graphs based on a JSON schema definition. http://json-schema.org

graphit.graph_io.io_jsonschema_format.read_json_schema(schema, graph=None, exclude_args=None, resolve_ref=True)¶

Import hierarchical data structures defined in a JSON schema format

Parameters

schema (dict, file, string, stream or URL) – JSON Schema data format to import
graph (:graphit:Graph) – graph object to import TGF data in
exclude_args (:py:list) – JSON schema arguments to exclude from import
resolve_ref (:py:bool) – Parse JSON schema ‘definitions’

Returns

Graph object

Return type

:graphit:Graph