Tabular output (indra.assemblers.tsv.assembler)

class indra.assemblers.tsv.assembler.TsvAssembler(statements=None)[source]

Assembles Statements into a set of tabular files for export or curation.

Currently designed for use with “raw” Statements, i.e., Statements with a single evidence entry. Exports Statements into a single tab-separated file with the following columns:

INDEX

A 1-indexed integer identifying the statement.

UUID

The UUID of the Statement.

TYPE

Statement type, given by the name of the class in indra.statements.

STR

String representation of the Statement. Contains most relevant information for curation including any additional statement data beyond the Statement type and Agents.

AG_A_TEXT

For Statements extracted from text, the text in the sentence corresponding to the first agent (i.e., the ‘TEXT’ entry in the db_refs dictionary). For all other Statements, the Agent name is given. Empty field if the Agent is None.

AG_A_LINKS

Groundings for the first agent given as a comma-separated list of identifiers.org links. Empty if the Agent is None.

AG_A_STR

String representation of the first agent, including additional agent context (e.g. modification, mutation, location, and bound conditions). Empty if the Agent is None.

AG_B_TEXT, AG_B_LINKS, AG_B_STR

As above for the second agent. Note that the Agent may be None (and these fields left empty) if the Statement consists only of a single Agent (e.g., SelfModification, ActiveForm, or Translocation statement).

PMID

PMID of the first entry in the evidence list for the Statement.

TEXT

Evidence text for the Statement.

IS_HYP

Whether the Statement represents a “hypothesis”, as flagged by some reading systems and recorded in the evidence.epistemics[‘hypothesis’] field.

IS_DIRECT

Whether the Statement represents a direct physical interactions, as recorded by the evidence.epistemics[‘direct’] field.

In addition, if the add_curation_cols flag is set when calling TsvAssembler.make_model(), the following additional (empty) columns will be added, to be filled out by curators:

AG_A_IDS_CORRECT

Correctness of Agent A grounding.

AG_A_STATE_CORRECT

Correctness of Agent A context (e.g., modification, bound, and other conditions).

AG_B_IDS_CORRECT, AG_B_STATE_CORRECT

As above, for Agent B.

EVENT_CORRECT

Whether the event is supported by the evidence text if the entities (Agents A and B) are considered as placeholders (i.e., ignoring the correctness of their grounding).

RES_CORRECT

For Modification statements, whether the amino acid residue indicated by the Statement is supported by the evidence.

POS_CORRECT

For Modification statements, whether the amino acid position indicated by the Statement is supported by the evidence.

SUBJ_ACT_CORRECT

For Activation/Inhibition Statements, whether the activity indicated for the subject (Agent A) is supported by the evidence.

OBJ_ACT_CORRECT

For Activation/Inhibition Statements, whether the activity indicated for the object (Agent B) is supported by the evidence.

HYP_CORRECT

Whether the Statement is correctly flagged as a hypothesis.

HYP_CORRECT

Whether the Statement is correctly flagged as direct.

Parameters

stmts (Optional[list[indra.statements.Statement]]) – A list of INDRA Statements to be assembled.

statements

A list of INDRA Statements to be assembled.

Type

list[indra.statements.Statement]

make_model(output_file, add_curation_cols=False, up_only=False)[source]

Export the statements into a tab-separated text file.

Parameters
  • output_file (str) – Name of the output file.

  • add_curation_cols (bool) – Whether to add columns to facilitate statement curation. Default is False (no additional columns).

  • up_only (bool) – Whether to include identifiers.org links only for the Uniprot grounding of an agent when one is available. Because most spreadsheets allow only a single hyperlink per cell, this can makes it easier to link to Uniprot information pages for curation purposes. Default is False.