semantic-chunking
Advanced tools
Changelog
[2.3.0] - 2024-11-11
transformers.js
from v2 to v3onnxEmbeddingModelQuantized
(boolean) to dtype
('p32', 'p16', 'q8', 'q4')dtype
optionChangelog
[2.2.4] - 2024-11-08
Changelog
[2.2.1] - 2024-11-06
Changelog
[2.2.0] - 2024-11-05
excludeChunkPrefixInResults
option to chunkit
and cramit
functions
Changelog
[2.1.4] - 2024-03-01
cramit
example script to use updated document object input format.Changelog
[2.1.3] - 2024-11-04
cramit
function to properly pack sentences up to maxTokenSizeChangelog
[2.1.2] - 2024-11-04
Changelog
[2.1.1] - 2024-11-01
Changelog
[2.1.0] - 2024-11-01
document_id
: Timestamp in milliseconds when processing starteddocument_name
: Original document name or ""number_of_chunks
: Total number of chunks for the documentchunk_number
: Current chunk number (1-based)model_name
: Name of the embedding model usedis_model_quantized
: Whether the model is quantized