
Security News
US Government Forces Anthropic to Pull Claude Fable Days After Launch
Anthropic says the directive cited national security concerns over a narrow jailbreak, but offered no specific technical details.
weavebot-core
Advanced tools
Generic content processing framework for web scraping and AI extraction.
@weavebot/core is a lightweight, plugin-based framework for extracting structured data from web content. It provides infrastructure without implementation details, allowing you to build custom content processing pipelines.
npm install @weavebot/core
import ContentProcessor, {
createWebScraper,
createAIExtractor,
SchemaRegistry
} from '@weavebot/core';
import { z } from 'zod';
// Create processor instance
const processor = new ContentProcessor();
// Register your schema
const ArticleSchema = z.object({
title: z.string(),
author: z.string(),
content: z.string(),
publishedAt: z.date()
});
processor.registerSchema('article', ArticleSchema);
// Set up processors
const scraper = createWebScraper();
const extractor = createAIExtractor({
provider: 'openai',
apiKey: process.env.OPENAI_API_KEY
});
// Register extraction configuration
extractor.registerExtractor('article', {
schema: ArticleSchema,
systemPrompt: 'Extract article information from the content',
userPromptTemplate: 'Extract article from: {{content}}'
});
processor.addProcessor('web-scraper', scraper);
processor.addProcessor('ai-extractor', extractor);
// Process a URL
const result = await processor.process({
type: 'url',
data: 'https://example.com/article',
schema: 'article'
});
Create platform-specific plugins for the web scraper:
import { WebScraperPlugin } from '@weavebot/core';
class MyPlatformPlugin implements WebScraperPlugin {
name = 'my-platform';
canHandle(url: string): boolean {
return url.includes('myplatform.com');
}
getConfig(url: string) {
return {
strategy: 'spa',
waitSelectors: ['.content-loaded'],
timeout: 10000
};
}
}
scraper.registerPlugin(new MyPlatformPlugin());
Implement the generic storage interface for your backend:
import { StorageAdapter } from '@weavebot/core';
class MyStorageAdapter implements StorageAdapter {
async initialize(config) { /* ... */ }
async create(collection, data) { /* ... */ }
async read(collection, id) { /* ... */ }
async update(collection, id, data) { /* ... */ }
async delete(collection, id) { /* ... */ }
async query(collection, filter) { /* ... */ }
}
processor.addStorage('my-storage', new MyStorageAdapter());
For complete documentation, visit the GitHub repository.
MIT
FAQs
Generic content processing framework for web scraping and AI extraction
We found that weavebot-core demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
Anthropic says the directive cited national security concerns over a narrow jailbreak, but offered no specific technical details.

Security News
A network of 152 Chrome live wallpaper extensions hid ad tracking and made extension-driven traffic look like Google search clicks.

Company News
Socket’s first CISO brings deep experience securing high-growth SaaS companies as open source supply chain threats accelerate.