
Product
Introducing Tier 1 Reachability: Precision CVE Triage for Enterprise Teams
Socket’s new Tier 1 Reachability filters out up to 80% of irrelevant CVEs, so security teams can focus on the vulnerabilities that matter.
.. image:: https://rawgit.com/carlosp420/dataset-creator/master/media/logo.svg :width: 240px :align: center :alt: Dataset-creator
.. list-table:: :stub-columns: 1
* - tests
- | |travis| |requires| |coveralls|
* - package
- |version| |wheel| |supported-versions| |supported-implementations|
.. |travis| image:: https://travis-ci.org/carlosp420/dataset-creator.svg?branch=master :alt: Travis-CI Build Status :target: https://travis-ci.org/carlosp420/dataset-creator
.. |requires| image:: https://requires.io/github/carlosp420/dataset-creator/requirements.svg?branch=master :alt: Requirements Status :target: https://requires.io/github/carlosp420/dataset-creator/requirements/?branch=master
.. |coveralls| image:: https://coveralls.io/repos/carlosp420/dataset-creator/badge.svg?branch=master&service=github :alt: Coverage Status :target: https://coveralls.io/r/carlosp420/dataset-creator
.. |version| image:: https://img.shields.io/pypi/v/dataset-creator.svg?style=flat :alt: PyPI Package latest release :target: https://pypi.python.org/pypi/dataset-creator
.. |wheel| image:: https://img.shields.io/pypi/wheel/dataset-creator.svg?style=flat :alt: PyPI Wheel :target: https://pypi.python.org/pypi/dataset-creator
.. |supported-versions| image:: https://img.shields.io/pypi/pyversions/dataset-creator.svg?style=flat :alt: Supported versions :target: https://pypi.python.org/pypi/dataset-creator
.. |supported-implementations| image:: https://img.shields.io/pypi/implementation/dataset-creator.svg?style=flat :alt: Supported implementations :target: https://pypi.python.org/pypi/dataset-creator
dataset-creator.readthedocs.org <http://dataset-creator.readthedocs.org/en/latest/>
_Takes SeqRecordExpanded objects and creates datasets for phylogenetic software such as MrBayes, TNT, BEAST, RAxML, MEGA, etc.
First::
pip install dataset_creator
Then the list of SeqRecordExpanded objects should be sorted by gene_code first then by voucher_code.
.. code-block:: python
>>> from seqrecord_expanded import SeqRecord
>>> from dataset_creator import Dataset
>>>
>>> # `table` is the Translation Table code based on NCBI
>>> seq_record1 = SeqRecord('ACTACCTA', reading_frame=2, gene_code='RpS5',
... table=1, voucher_code='CP100-10',
... taxonomy={'genus': 'Aus', 'species': 'bus'})
>>>
>>> seq_record2 = SeqRecord('ACTACCTA', reading_frame=2, gene_code='RpS5',
... table=1, voucher_code='CP100-10',
... taxonomy={'genus': 'Aus', 'species': 'bus'})
>>>
>>> seq_record3 = SeqRecord('ACTACCTA', reading_frame=2, gene_code='wingless',
... table=1, voucher_code='CP100-10',
... taxonomy={'genus': 'Aus', 'species': 'bus'})
>>>
>>> seq_record4 = SeqRecord('ACTACCTA', reading_frame=2, gene_code='winglesss',
... table=1, voucher_code='CP100-10',
... taxonomy={'genus': 'Aus', 'species': 'bus'})
>>>
>>> seq_records = [
... seq_record1, seq_record2, seq_record3, seq_record4,
... ]
>>> # codon positions can be 1st, 2nd, 3rd, 1st-2nd, ALL (default)
>>> dataset = Dataset(seq_records, format='TNT', partitioning='by codon position',
... codon_positions='ALL')
>>> dataset = Dataset(seq_records, format='PHYLIP', partitioning='1st-2nd, 3rd',
... codon_positions='ALL')
>>> dataset = Dataset(seq_records, format='NEXUS', partitioning='by gene',
... codon_positions='1st')
>>> dataset = Dataset(seq_records, format='NEXUS', partitioning='by gene',
... codon_positions='ALL', aminoacids=True)
>>> # Produce a dataset of degenerated sequences using the 'S' method:
>>> dataset = Dataset(seq_records, format='NEXUS', partitioning='by gene',
... codon_positions='ALL', degenerate='S')
>>> print(dataset.dataset_str)
#NEXUS
blah blah ...
Further documentation can be found at
dataset-creator.readthedocs.org <http://dataset-creator.readthedocs.org/en/latest/>
_
To run the all tests run::
tox
seqrecord-expanded
.Aus_aus_CP100-10 [org=Aus aus] [Specimen-voucher=CP100-10] [note=ArgKin gene, partial cds.] [Lineage=]
taxonomy
field.FAQs
Takes SeqRecordExpanded objects and creates datasets for phylogenetic software
We found that dataset-creator demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Product
Socket’s new Tier 1 Reachability filters out up to 80% of irrelevant CVEs, so security teams can focus on the vulnerabilities that matter.
Research
/Security News
Ongoing npm supply chain attack spreads to DuckDB: multiple packages compromised with the same wallet-drainer malware.
Security News
The MCP Steering Committee has launched the official MCP Registry in preview, a central hub for discovering and publishing MCP servers.