
Security News
PolinRider: North Korea-Linked Supply Chain Campaign Expands Across Open Source Ecosystems
PolinRider expands across npm, Packagist, Go modules, and Chrome extensions, using hidden loaders to target developer environments.
AI SDK middleware for idempotent LLM calls. Wrap once, every retry is free.
import { openai } from '@ai-sdk/openai';
import { generateText, wrapLanguageModel } from 'ai';
import { idempotencyMiddleware } from '@oncely/ai';
// ❌ Before: Every call costs money
const model = openai('gpt-4-turbo');
await generateText({ model, prompt: 'Hello' }); // API call → $0.01
await generateText({ model, prompt: 'Hello' }); // API call → $0.01
await generateText({ model, prompt: 'Hello' }); // API call → $0.01
// Total: $0.03 for the same response 3x 💸
// ✅ After: Add one wrapper
const idempotentModel = wrapLanguageModel({
model: openai('gpt-4-turbo'),
middleware: idempotencyMiddleware(),
});
await generateText({ model: idempotentModel, prompt: 'Hello' }); // API call → $0.01
await generateText({ model: idempotentModel, prompt: 'Hello' }); // Cache hit → $0.00 ✨
await generateText({ model: idempotentModel, prompt: 'Hello' }); // Cache hit → $0.00 ✨
// Total: $0.01 — saved 66% 🎉
npm install @oncely/ai @oncely/core ai
For production, add a storage adapter:
npm install @oncely/redis ioredis # Standard Redis
npm install @oncely/upstash # Serverless (Upstash, Vercel KV)
import { wrapLanguageModel } from 'ai';
import { idempotencyMiddleware } from '@oncely/ai';
const model = wrapLanguageModel({
model: yourModel,
middleware: idempotencyMiddleware(),
});
import { wrapLanguageModel } from 'ai';
import { idempotencyMiddleware } from '@oncely/ai';
import { redis } from '@oncely/redis';
const model = wrapLanguageModel({
model: openai('gpt-4-turbo'),
middleware: idempotencyMiddleware({
storage: redis(),
ttl: '5m',
}),
});
import { idempotencyMiddleware } from '@oncely/ai';
import { upstash } from '@oncely/upstash';
const model = wrapLanguageModel({
model: anthropic('claude-3-opus'),
middleware: idempotencyMiddleware({
storage: upstash(),
ttl: '10m',
}),
});
By default, the middleware generates a cache key by hashing:
Same inputs = same key = cached response.
Pass an explicit key via providerOptions:
const result = await generateText({
model,
prompt: 'Hello',
providerOptions: {
oncely: { key: 'user-123-greeting' },
},
});
const model = wrapLanguageModel({
model: yourModel,
middleware: idempotencyMiddleware({
getKey: (params) => {
// Your custom key logic
return `custom:${hashObject(params.prompt)}`;
},
}),
});
| Option | Type | Default | Description |
|---|---|---|---|
storage | StorageAdapter | MemoryStorage | Storage backend |
ttl | string | number | '5m' | Cache duration |
getKey | (params) => string | Auto-hash | Custom key generation |
includeModelInKey | boolean | true | Include model ID in cache key |
onHit | (key, response) => void | — | Callback on cache hit |
onMiss | (key) => void | — | Callback on cache miss |
Override settings per-request via providerOptions.oncely:
const result = await generateText({
model,
prompt: 'Hello',
providerOptions: {
oncely: {
key: 'explicit-key', // Override auto-generated key
ttl: '1h', // Override TTL for this request
skip: true, // Skip idempotency entirely
},
},
});
The middleware works with any AI SDK provider:
@ai-sdk/openai)@ai-sdk/anthropic)@ai-sdk/google)@ai-sdk/mistral)@ai-sdk/cohere)Works with both generateText and streamText:
const result = await streamText({
model,
prompt: 'Write a poem',
});
// Cached streams are replayed from storage
for await (const chunk of result.textStream) {
console.log(chunk);
}
import { wrapLanguageModel } from 'ai';
import { idempotencyMiddleware } from '@oncely/ai';
const model = wrapLanguageModel({
model: openai('gpt-4-turbo'),
middleware: [
idempotencyMiddleware({ storage: redis() }),
loggingMiddleware(),
rateLimitMiddleware(),
],
});
app.post('/api/chat', async (c) => {
const { message, userId } = await c.req.json();
// User can spam the send button — you only pay once
const { text } = await generateText({
model: idempotentModel,
prompt: message,
providerOptions: {
oncely: { key: `chat:${userId}:${hash(message)}` },
},
});
return c.json({ response: text });
});
const sendEmail = tool({
description: 'Send an email',
parameters: z.object({ to: z.string(), subject: z.string(), body: z.string() }),
execute: async (params) => {
// Without idempotency: retry = duplicate email sent
// With idempotency: retry = cached result, no duplicate
return await emailService.send(params);
},
});
const { text } = await generateText({
model: idempotentModel,
tools: { sendEmail },
prompt: task,
providerOptions: {
oncely: { key: `agent:${taskId}` }, // Entire agent run is idempotent
},
});
for (const item of items) {
// Crash at item 50/100? Restart and items 1-49 are instant from cache
const { text } = await generateText({
model: idempotentModel,
prompt: `Summarize: ${item.content}`,
providerOptions: {
oncely: { key: `batch:${item.id}` },
},
});
}
let tokensSaved = 0;
const model = wrapLanguageModel({
model: openai('gpt-4-turbo'),
middleware: idempotencyMiddleware({
storage: redis(),
onHit: (key, response) => {
tokensSaved += response.data?.usage?.totalTokens ?? 0;
console.log(`💰 Saved ${tokensSaved} tokens so far`);
},
}),
});
MIT
FAQs
AI SDK middleware for oncely idempotency - prevent duplicate LLM calls
We found that @oncely/ai demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
PolinRider expands across npm, Packagist, Go modules, and Chrome extensions, using hidden loaders to target developer environments.

Security News
Open source attacks are accelerating as AI coding agents pull in dependencies faster, with less human review.

Research
/Security News
Malicious Chrome and Firefox extensions posed as free VPNs while stealing clipboard data through later extension updates.