Security News
Research
Data Theft Repackaged: A Case Study in Malicious Wrapper Packages on npm
The Socket Research Team breaks down a malicious wrapper package that uses obfuscation to harvest credentials and exfiltrate sensitive data.
This project is a c++ implementation with a python wrapper of the Succinct BWT-Based Sequence Prediction
model.
Subseq is a sequence prediction model in a finite alphabet. It is a lossless model (does not discard information while training) and utilizes the succinct Wavelet Tree data structure and the Burrows-Wheeler Transform to compactly store and efficiently access training sequences for prediction.
This implementation is based on the following research paper:
Subseq is published on pypi. pip install subseq
should be enough.
You can test the model with the following code:
from subseq.subseq import Subseq
model = Subseq(1)
model.fit([['hello', 'world']])
model.predict(['hello'])
# Output: ['world']
The model can be trained with the fit
method.
Subseq has only 1 meta parameter that need to be tuned. threshold_query
, the number of similar queries that needs to be retrieved to make a confident prediction.
A threshold_query
at 0 does not limit the number of query.
The benchmark has been made on the FIFA dataset, the data can be found on the SPMF website.
Details on the benchmark can be found here.
FAQs
Succinct BWT-Based SequencePrediction
We found that subseq demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Research
The Socket Research Team breaks down a malicious wrapper package that uses obfuscation to harvest credentials and exfiltrate sensitive data.
Research
Security News
Attackers used a malicious npm package typosquatting a popular ESLint plugin to steal sensitive data, execute commands, and exploit developer systems.
Security News
The Ultralytics' PyPI Package was compromised four times in one weekend through GitHub Actions cache poisoning and failure to rotate previously compromised API tokens.