Tabular output (indra.assemblers.tsv.assembler
)
- class indra.assemblers.tsv.assembler.TsvAssembler(statements=None)[source]
Assembles Statements into a set of tabular files for export or curation.
Currently designed for use with “raw” Statements, i.e., Statements with a single evidence entry. Exports Statements into a single tab-separated file with the following columns:
- INDEX
A 1-indexed integer identifying the statement.
- UUID
The UUID of the Statement.
- TYPE
Statement type, given by the name of the class in indra.statements.
- STR
String representation of the Statement. Contains most relevant information for curation including any additional statement data beyond the Statement type and Agents.
- AG_A_TEXT
For Statements extracted from text, the text in the sentence corresponding to the first agent (i.e., the ‘TEXT’ entry in the db_refs dictionary). For all other Statements, the Agent name is given. Empty field if the Agent is None.
- AG_A_LINKS
Groundings for the first agent given as a comma-separated list of identifiers.org links. Empty if the Agent is None.
- AG_A_STR
String representation of the first agent, including additional agent context (e.g. modification, mutation, location, and bound conditions). Empty if the Agent is None.
- AG_B_TEXT, AG_B_LINKS, AG_B_STR
As above for the second agent. Note that the Agent may be None (and these fields left empty) if the Statement consists only of a single Agent (e.g., SelfModification, ActiveForm, or Translocation statement).
- PMID
PMID of the first entry in the evidence list for the Statement.
- TEXT
Evidence text for the Statement.
- IS_HYP
Whether the Statement represents a “hypothesis”, as flagged by some reading systems and recorded in the evidence.epistemics[‘hypothesis’] field.
- IS_DIRECT
Whether the Statement represents a direct physical interactions, as recorded by the evidence.epistemics[‘direct’] field.
In addition, if the add_curation_cols flag is set when calling
TsvAssembler.make_model()
, the following additional (empty) columns will be added, to be filled out by curators:- AG_A_IDS_CORRECT
Correctness of Agent A grounding.
- AG_A_STATE_CORRECT
Correctness of Agent A context (e.g., modification, bound, and other conditions).
- AG_B_IDS_CORRECT, AG_B_STATE_CORRECT
As above, for Agent B.
- EVENT_CORRECT
Whether the event is supported by the evidence text if the entities (Agents A and B) are considered as placeholders (i.e., ignoring the correctness of their grounding).
- RES_CORRECT
For Modification statements, whether the amino acid residue indicated by the Statement is supported by the evidence.
- POS_CORRECT
For Modification statements, whether the amino acid position indicated by the Statement is supported by the evidence.
- SUBJ_ACT_CORRECT
For Activation/Inhibition Statements, whether the activity indicated for the subject (Agent A) is supported by the evidence.
- OBJ_ACT_CORRECT
For Activation/Inhibition Statements, whether the activity indicated for the object (Agent B) is supported by the evidence.
- HYP_CORRECT
Whether the Statement is correctly flagged as a hypothesis.
- HYP_CORRECT
Whether the Statement is correctly flagged as direct.
- Parameters
stmts (Optional[list[indra.statements.Statement]]) – A list of INDRA Statements to be assembled.
- make_model(output_file, add_curation_cols=False, up_only=False)[source]
Export the statements into a tab-separated text file.
- Parameters
output_file (str) – Name of the output file.
add_curation_cols (bool) – Whether to add columns to facilitate statement curation. Default is False (no additional columns).
up_only (bool) – Whether to include identifiers.org links only for the Uniprot grounding of an agent when one is available. Because most spreadsheets allow only a single hyperlink per cell, this can makes it easier to link to Uniprot information pages for curation purposes. Default is False.