
Research
Two Malicious Rust Crates Impersonate Popular Logger to Steal Wallet Keys
Socket uncovers malicious Rust crates impersonating fast_log to steal Solana and Ethereum wallet keys from source code.
aibrix-kvcache
Advanced tools
AIBrix KV cache offloading framework provides several common functionalities for cross-engine KV reuse use cases:
Tensor Parallelism Aware Management: When inference engine (e.g., vLLM) uses tensor parallelism, each participating engine instance fetches KV tensors independently of the cache backend. In case of cache misses, before proceeding with prefill computation, participants must align the potentially different number of KV tensors fetched from the external KV cache service to ensure a consistent view.
Embedded Cache w/ CPU Memory: To meet performance requirements, it's common to have a small CPU memory-based cache embedded in the engine to avoid frequently accessing remote cache backends.
Selective KV Cache Offloading: Enables fine-grained control over offloading strategies and thus is crucial in optimizing performance across diverse deployment environments:
AIBrix KV cache offloading framework can be installed by pip
.
pip install aibrix-kvcache
We welcome contributions from the community! Check out our contributing guidelines to see how you can make a difference.
# This may take several minutes
pip install -e .
Before contribute your code, please run the following commands to ensure that your code passes the tests and linting checks.
# install dependencies
pip install -r requirements/build.txt -r requirements/dev.txt -r requirements/core.txt
# linting, formatting and type checking
bash ./scripts/format.sh
AI Runtime is licensed under the APACHE License.
FAQs
AIBrix KV Cache offloading framework for cross-engine KV reuse
We found that aibrix-kvcache demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 0 open source maintainers collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Research
Socket uncovers malicious Rust crates impersonating fast_log to steal Solana and Ethereum wallet keys from source code.
Research
A malicious package uses a QR code as steganography in an innovative technique.
Research
/Security News
Socket identified 80 fake candidates targeting engineering roles, including suspected North Korean operators, exposing the new reality of hiring as a security function.