Skip to content

bbglab/openvariant

OpenVariant is a comprehensive Python package that provides different functionalities to read, parse and operate different multiple input file formats (e. g. tsv, csv, vcf, maf, bed), being able to customize the output.

Its aim is being able to manage a ton of data represented in multiple ways and be able to build an unified output with a proper annotation file structure. This package was thought to work with any kind of data that can be represented as a table.

Documentation: https://openvariant.readthedocs.io

Features

OpenVariant offers a toolkit to transform and operate the parsed input data. We will be able to apply different functionalities on our parsed result some of them are the following ones:

  • Find files
  • Read and save
  • Cat
  • Group by
  • Count
  • Command-line interface (CLI)
  • Plugins

Check User guide in OpenVariant's documentation to find all the information about how it works and how can be applied the different functionalities.

Installation

It requires Python 3 or higher and can be installed as PyPI package with:

pip install open-variant

For more details check our Installation section.

Examples

We provide a variety of examples to help to understand how OpenVariant can be applied. Explore the Examples section in OpenVariant's documentation for more details.

As well, we present a small dataset for hands-on use with OpenVariant, allowing users to test the tool's functionalities. It may be found at Zenodo and it can be downloaded using the following commands:

pip install zenodo_get                      
zenodo_get 14215914

Contributing

You're welcome to contribute to the code as much as you'd like!

Please review the guidelines outlined in the Contributing document and adhere to the standards of conduct detailed in the Code of Conduct.

License

The software is licensed under BSD-3-Clause.

Reference

If you use OpenVariant in your research, please cite:

David Martínez-Millán, Federica Brando, Miguel L. Grau, Mònica Sánchez-Guixé, Carlos López-Elorduy, Iker Reyes-Salazar, Jordi Deu-Pons, Núria López-Bigas, Abel González-Pérez, OpenVariant: a toolkit to parse and operate multiple input file formats, Bioinformatics, Volume 40, Issue 12, December 2024, btae714, https://doi.org/10.1093/bioinformatics/btae714