New Case Study:See how Anthropic automated 95% of dependency reviews with Socket.Learn More
Socket
Sign inDemoInstall
Socket

tfkit

Package Overview
Dependencies
Maintainers
1
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

tfkit

Transformers kit - Multi-task QA/Tagging/Multi-label Multi-Class Classification/Generation with BERT/ALBERT/T5/BERT

  • 0.8.20
  • PyPI
  • Socket score

Maintainers
1




PyPI Download Build Last Commit CodeFactor Visitor

What is it

TFKit is a tool kit mainly for language generation.
It leverages the use of transformers on many tasks with different models in this all-in-one framework.
All you need is a little change of config.

Task Supported

With transformer models - BERT/ALBERT/T5/BART......

Text Generation:memo: seq2seq language model
Text Generation:pen: causal language model
Text Generation:printer: once generation model / once generation model with ctc loss
Text Generation:pencil: onebyone generation model

Getting Started

Learn more from the document.

How To Use

Step 0: Install

Simple installation from PyPI

pip install git+https://github.com/voidful/TFkit.git@refactor-dataset

Step 1: Prepare dataset in csv format

Task format

input, target

Step 2: Train model

tfkit-train \
--task clas \
--config xlm-roberta-base \
--train training_data.csv \
--test testing_data.csv \
--lr 4e-5 \
--maxlen 384 \
--epoch 10 \
--savedir roberta_sentiment_classificer

Step 3: Evaluate

tfkit-eval \
--task roberta_sentiment_classificer/1.pt \
--metric clas \
--valid testing_data.csv

Advanced features

Multi-task training
tfkit-train \
  --task clas clas \
  --config xlm-roberta-base \
  --train training_data_taskA.csv training_data_taskB.csv \
  --test testing_data_taskA.csv testing_data_taskB.csv \
  --lr 4e-5 \
  --maxlen 384 \
  --epoch 10 \
  --savedir roberta_sentiment_classificer_multi_task

Not maintained task

Due to time constraints, the following tasks are temporarily not supported

Classification:label: multi-class and multi-label classification
Question Answering:page_with_curl: extractive qa
Question Answering:radio_button: multiple-choice qa
Tagging:eye_speech_bubble: sequence level tagging / sequence level with crf
Self-supervise Learning:diving_mask: mask language model

Supplement

  • transformers models list: you can find any pretrained models here
  • nlprep: download and preprocessing data in one line
  • nlp2go: create demo api as quickly as possible.

Contributing

Thanks for your interest.There are many ways to contribute to this project. Get started here.

License PyPI - License

Icons reference

Icons modify from Freepik from www.flaticon.com
Icons modify from Nikita Golubev from www.flaticon.com

Keywords

FAQs


Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc