OpenVariant is a comprehensive Python package that provides different functionalities to read, parse and operate
different multiple input file formats (e. g. tsv
, csv
, vcf
, maf
, bed
), being able to customize the output.
Its aim is being able to manage a ton of data represented in multiple ways and be able to build an unified output with a proper annotation file structure. This package was thought to work with any kind of data that can be represented as a table.
Documentation: https://openvariant.readthedocs.io
OpenVariant offers a toolkit to transform and operate the parsed input data. We will be able to apply different functionalities on our parsed result some of them are the following ones:
- Find files
- Read and save
- Cat
- Group by
- Count
- Command-line interface (CLI)
- Plugins
Check User guide in OpenVariant's documentation to find all the information about how it works and how can be applied the different functionalities.
It requires Python 3 or higher and can be installed as PyPI package with:
pip install open-variant
For more details check our Installation section.
We provide a variety of examples to help to understand how OpenVariant can be applied. Explore the Examples section in OpenVariant's documentation for more details.
As well, we present a small dataset for hands-on use with OpenVariant, allowing users to test the tool's functionalities. It may be found at Zenodo and it can be downloaded using the following commands:
pip install zenodo_get
zenodo_get 14215914
You're welcome to contribute to the code as much as you'd like!
Please review the guidelines outlined in the Contributing document and adhere to the standards of conduct detailed in the Code of Conduct.
The software is licensed under BSD-3-Clause.
If you use OpenVariant in your research, please cite:
David Martínez-Millán, Federica Brando, Miguel L. Grau, Mònica Sánchez-Guixé, Carlos López-Elorduy, Iker Reyes-Salazar, Jordi Deu-Pons, Núria López-Bigas, Abel González-Pérez, OpenVariant: a toolkit to parse and operate multiple input file formats, Bioinformatics, Volume 40, Issue 12, December 2024, btae714, https://doi.org/10.1093/bioinformatics/btae714