Release INDRA v1.17.0 · sorgerlab/indra

Package structure and dependencies

The high-throughput reading tools in indra.tools.reading have been
moved into an independent repository at
https://github.com/indralab/indra_reading.
The obonet package is now a required dependency.

Input processors

New VirHostNet API and processor in indra.sources.virhostnet.
New Hypothes.is API and processor allowing processing
annotations into INDRA Statements in indra.sources.hypothesis.
Integrated new version of Sparser reader with improved and adapted
processor in indra.sources.sparser.
Refactored Eidos API into separate reader, client and server with
significantly extended usage instructions in indra.sources.eidos.
New endpoints in Eidos API implementing biology-oriented processing, as
well as regrounding with respect to a user-supplied ontology in
indra.sources.eidos.
Overhauled Reach API supporting the latest Reach version and prioritizing
using Reach as a (local) web service over JAR-based reading
in indra.sources.reach, also with significantly extended usage
instructions.
Support for extracting Conversion statements from Reach output in
indra.sources.reach.
ISI reader processor in indra.sources.isi no longer requires a Python 2
runtime and the separate nxml2text tool.
Updated RLIMS-P API in indra.sources.rlimsp to work with latest web service at
https://research.bioinformatics.udel.edu/itextmine/api/data/rlims/.

Resources and clients

Overhauled GO resources and client to be based on OBO in
indra.databases.go_client.
MeSH tree numbers, isa relationships and determination of
subtree implemented in indra.databases.mesh_client.
Extended mappings added between MeSH and GO, ChEBI, EFO, DOID, HP.
All resource files updated to latest version in indra.resources.

Core assembly modules

New Pipeline module in indra.pipeline allowing building
and running custom assembly pipelines programmatically or from a JSON file.
UniProt protein features and EFO, DOID, HP ontologies
integrated with name/grounding standardization.
Overhauled integration of Gilda (https://github.com/indralab/gilda) with
indra.preassembler.grounding_mapper allowing selective re-grounding
of statements as well as disambiguation.
New one-statement-per-line JSON output mode in indra.statements.io to
support streaming large INDRA Statement JSONs.

Literature

New bioRxiv/medRxiv client to monitor collections and download PDF/XML/text
content in indra.literature.biorxiv_client.
Significantly improved text extraction from PMC NXML in
indra.literature.pmc_client.

Output assemblers

Refactoried English assembler to allow keeping track of
entity coordinates within each assembled sentence in
indra.assemblers.english.
Improved construction of entity links in HTML assembler
in indra.assemblers.html.
More annotations propagated into PyBEL graph in
indra.assemblers.pybel.
Made the upload and update of CX networks more robust in
indra.assemblers.cx.

Explanation

Tools

Refactored LiveCuration service into a separate module in
indra.tools.live_curation.
Updated outdated assembly approach in GeneNetwork tool in
indra.tools.gene_network.

Other

Provide feedback