Feature extraction is done with a combination of on-chain data and OCR using the tesseract.js library. Classification is done with naive bayes and a hand-picked set of spam and ham cNFTs.

Live Example

You can try a live (slow + heavily rate limited) example of the library running on AWS Lambda here:

https://api.filtoor.xyz/classify?address=A1xhLVywcq6SeZnmRG1pUzoSWxVMpS6J5ShEbt3smQJr

Try a new cNFT by replacing the address={...} parameter. The classifier will either spit out "spam" or "ham" (or "error" if something went wrong).

If you'd like to use this API in your production project, please DM me to get set up!

Installation

First, install the library:

npm i cnft-spam-filter

then import the requisite function:

const { extractAndClassify } = require("cnft-spam-filter")

import { extractAndClassify } from "cnft-spam-filter"

Finally, call the function wherever you want to classify:

const classification = await extractAndClassify(assetId, rpcUrl);

Note that you'll need to bring your own rpcUrl that supports the DAS api--I recommend Helius for their generous free plan https://www.helius.dev/.

Examples

You can find a few lightweight examples of how to use the library in different environments in the /examples folder of the repository.

cnft-spam-filter aims to be portable, so you can run it in pretty much any environment that you want.

Training

You can train your own model and pass it to classify(tokens, model). Code for this is in the /train folder.

You'll see spam_ids.json and ham_ids.json there; these are the cNFTs used to train the model.

Testing

You can test the accuracy of a model using the code in the /test folder. Make sure that your training set and test set do not overlap.

Usage in Production

If you want to use cnft-spam-filter in production, we recommend setting up a caching layer so that you don't have to analyze each cNFT multiple times. This should be done at your own app level: you can use redis, a database, localstorage--whatever you want.

Contributing

Feel free to open pull requests to contribute if you think this is interesting! I will try to get to them as best as I can. There are definitely some tasks that need to be implemented.

License

All code is released under the MIT license -- go crazy.

Solana asset donations are appreciated but not required by any means:

filtoor.sol

Keywords

solana

FAQs

What is cnft-spam-filter?

Is cnft-spam-filter popular?

Is cnft-spam-filter well maintained?

Package last updated on 07 Feb 2024

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install