IMG.LY AI Generation Utilities
A powerful toolkit for implementing AI generation providers in CreativeEditor SDK.
Overview
Note: This package is only relevant if you need to create new AI providers or extend existing functionality. For simple integration of AI features, use the @imgly/plugin-ai-apps-web package instead.
This package provides the foundation for creating AI generation plugins for CreativeEditor SDK. It offers a standardized interface for implementing AI generation providers that can create images, videos, audio, or text assets. The package includes utilities for handling:
- Provider registration and initialization
- User interface generation
- Global action registry for quick actions and plugin actions
- Type-safe quick action definitions
- Cross-plugin action support
Getting Started
Installation
npm install @imgly/plugin-ai-generation-web
Creating a Custom Provider
The core of this package is the Provider
interface which defines the contract for AI generation providers. Here's how to implement a basic provider:
import {
Provider,
ImageOutput,
initializeProvider,
loggingMiddleware,
CommonProviderConfiguration
} from '@imgly/plugin-ai-generation-web';
interface MyProviderConfiguration
extends CommonProviderConfiguration<MyInputType, ImageOutput> {
baseURL?: string;
}
function createMyImageProvider(config: MyProviderConfiguration): Provider<'image', MyInputType, ImageOutput> {
return {
id: 'my-image-provider',
kind: 'image',
initialize: async ({ engine, cesdk }) => {
myAIApi.configure({
apiKey: 'YOUR_API_KEY',
headers: config.headers
});
},
input: {
panel: {
type: 'schema',
document: myApiSchema,
inputReference: '#/components/schemas/GenerationInput',
getBlockInput: async (input) => ({
image: { width: 1024, height: 1024 }
})
},
quickActions: {
supported: {
'ly.img.editImage': {
mapInput: (input) => ({
prompt: input.prompt,
image_url: input.uri
})
},
'ly.img.styleTransfer': {
mapInput: (input) => ({
prompt: input.style,
image_url: input.uri
})
}
}
}
},
output: {
abortable: true,
history: '@imgly/indexedDB',
middleware: [loggingMiddleware()],
notification: {
success: {
show: true,
message: 'Generation successful!'
}
},
generate: async (input, { abortSignal, engine }) => {
const response = await myAIApi.generateImage(input, {
headers: config.headers
});
return {
kind: 'image',
url: response.imageUrl
};
}
}
};
}
const myImageProvider = createMyImageProvider({
proxyUrl: 'http://your-proxy-server.com/api/proxy',
headers: {
'x-client-version': '1.0.0',
'x-request-source': 'cesdk-plugin'
},
debug: false,
middleware: [loggingMiddleware()],
baseURL: 'https://assets.example.com'
});
Action Registry
The package includes a global ActionRegistry
for managing quick actions and plugin actions. To register a new action:
import { ActionRegistry } from '@imgly/plugin-ai-generation-web';
const registry = ActionRegistry.get();
const unregister = registry.register({
id: 'my-quick-action',
type: 'quick',
kind: 'image',
label: 'My Quick Action',
enable: true,
render: (context) => {
context.builder.Button('my-button', {
label: 'Generate',
onClick: async () => {
await context.generate({ prompt: 'Hello world' });
}
});
}
});
Provider Interface
The Provider interface is generic and type-safe, supporting four output kinds:
interface Provider<K extends OutputKind, I, O extends Output, C = O> { ... }
Common Provider Configuration
All providers should extend the CommonProviderConfiguration
interface, which provides standardized configuration options:
interface CommonProviderConfiguration<I, O extends Output> {
proxyUrl: string;
debug?: boolean;
middleware?: Middleware<I, O>[];
headers?: Record<string, string>;
}
The headers
property allows you to include custom HTTP headers in all API requests made by your provider. This is useful for:
- Adding custom client identification headers
- Including version information
- Passing through metadata required by your API
- Adding correlation IDs for request tracing
Implementation Note: When implementing your provider's generate
function, ensure you merge the custom headers with any required headers for your API:
const response = await fetch(apiUrl, {
method: 'POST',
headers: {
'Content-Type': 'application/json',
'Authorization': `Bearer ${apiKey}`,
...config.headers
},
body: JSON.stringify(requestData)
});
Key Provider Options
- id: Unique identifier for your provider
- kind: Type of asset generated ('image', 'video', 'audio', 'text')
- name: Optional human-readable name
- initialize: Setup function called when the provider is loaded
- input: Configuration for input UI and parameters
- output: Configuration for generation and result handling
Provider Output Options
The output
property has several important options:
- generate: Main function that performs the actual generation
- history: Asset storage strategy ('false', '@imgly/local', '@imgly/indexedDB', or custom ID)
- abortable: Whether generation can be cancelled by the user
- middleware: Array of middleware functions for pre/post-processing
- notification: Success and error notification configuration
- generationHintText: Text to display below the generation button
Notification Configuration
The notification system allows fine-grained control over success and error messages:
notification: {
success: {
show: true,
message: 'Generation successful!',
action: {
label: 'View',
onClick: (context) => { }
},
duration: 'short'
},
error: {
show: true,
message: 'Generation failed',
}
}
Streaming Generation
The generate
function can return a simple output object or an AsyncGenerator for streaming results:
generate: async (input, options) => {
const result = await api.generateImage(input);
return { kind: 'image', url: result.url };
}
generate: async function* (input, options) {
const stream = api.streamGenerationResult(input);
let inferredText: string = '';
for await (const chunk of stream) {
inferredText += chunk;
yield { kind: 'text', text: inferredText };
}
return { kind: 'text', text: inferredText };
}
Generation Hint Text
The generationHintText
property allows providers to display helpful information below the generation button:
generationHintText: "Generation may take up to a minute. You can close this panel and will be notified when ready."
Input Panel Types
The package supports two approaches for creating input panels:
1. Schema-based Input Panels
The schema
type uses OpenAPI specification to declaratively define your input form.
input: {
panel: {
type: 'schema',
document: myOpenAPISchema,
inputReference: '#/components/schemas/GenerationInput',
orderExtensionKeyword: 'x-order-properties',
getBlockInput: async (input) => ({
image: { width: 1024, height: 1024 }
}),
renderCustomProperty: {
imageUrl: (context, property) => {
const valueState = context.state('imageUrl', '');
context.builder.TextInput('imageUrl', {
inputLabel: 'Image URL',
...valueState
});
return () => { id: property.id, type: 'string', value: valueState.value };
}
}
}
}
OpenAPI Schema Example
{
"openapi": "3.0.0",
"components": {
"schemas": {
"GenerationInput": {
"type": "object",
"required": ["prompt"],
"properties": {
"prompt": {
"type": "string",
"title": "Prompt",
"description": "Describe what you want to generate",
"x-imgly-builder": {
"component": "TextArea"
}
},
"width": {
"type": "integer",
"title": "Width",
"default": 1024,
"enum": [512, 1024, 2048],
"x-imgly-builder": {
"component": "Select"
}
}
},
"x-order-properties": ["prompt", "width"]
}
}
}
}
Benefits of Schema-based Input
- Built-in validation based on schema constraints
- AI providers like fal.ai provide schemas for their models
- Automatic UI component generation based on property types
- Extensions like
x-imgly-builder
to specify component types
- Property ordering via
orderExtensionKeyword
- Customizable property rendering with
renderCustomProperty
2. Custom Input Panels
The custom
type gives you complete control over UI components. For more details on how to build custom panels and see all available builder components, refer to the Create a Custom Panel guide.
input: {
panel: {
type: 'custom',
render: (context, options) => {
const promptState = context.state('prompt', '');
context.builder.TextArea('prompt', {
inputLabel: 'Prompt',
...promptState
});
const widthState = context.state('width', 1024);
context.builder.Select('width', {
inputLabel: 'Width',
options: [
{ value: 512, label: '512px' },
{ value: 1024, label: '1024px' },
{ value: 2048, label: '2048px' }
],
...widthState
});
return {
getInput: () => ({
prompt: promptState.value,
width: widthState.value
}),
getBlockInput: () => ({
image: {
width: widthState.value,
height: widthState.value,
label: `AI Image: ${promptState.value.substring(0, 20)}...`
}
})
};
}
}
}
Benefits of Custom Input Panels
- Complete control over UI components and layout
- Complex logic between fields (dependencies, conditionals)
- Dynamic UI that changes based on user interactions
Panel User Flow Options
Both panel types accept additional configuration:
panel: {
type: 'schema',
userFlow: 'placeholder',
includeHistoryLibrary: true
}
The getBlockInput
Function
The getBlockInput
function is crucial for both panel types. It converts your input into the parameters needed to create a block in CreativeEditor SDK.
What It Does
- Defines dimensions, duration, and appearance of asset blocks
- Creates placeholders before generation completes
- Maps your AI provider's inputs to standardized block parameters
Required Return Values by Output Kind
Each output kind requires specific parameters:
For Images
getBlockInput: async (input) => ({
image: {
width: 1024,
height: 1024,
label: 'My Image'
}
});
For Videos
getBlockInput: async (input) => ({
video: {
width: 1280,
height: 720,
duration: 10,
label: 'My Video'
}
});
For Audio
getBlockInput: async (input) => ({
audio: {
duration: 30,
thumbnailUrl: 'path/to/img.jpg',
label: 'My Audio'
}
});
For Text
getBlockInput: async (input) => ({
text: {
length: 250,
label: 'My Text'
}
});
Quick Actions
Quick Actions provide context-aware AI generation capabilities directly in CreativeEditor SDK's canvas menu. Unlike panels (which appear in the side panel), quick actions appear when users select elements on the canvas.
Available Quick Action IDs
Here are all the quick action IDs that can be used in the supported
field of your provider configuration:
Image Quick Actions
-
ly.img.artistTransfer
: Transform image in the style of famous artists
- Input:
{ artist: string, uri: string }
-
ly.img.combineImages
: Combine multiple images with instructions
- Input:
{ prompt: string, uris: string[], exportFromBlockIds: number[] }
-
ly.img.createVariant
: Create a variation of the image
- Input:
{ prompt: string, uri: string }
-
ly.img.editImage
: Change image based on description
- Input:
{ prompt: string, uri: string }
-
ly.img.remixPage
: Convert the page into a single image
- Input:
{ prompt: string, uri: string }
-
ly.img.remixPageWithPrompt
: Remix the page with custom instructions
- Input:
{ prompt: string, uri: string }
-
ly.img.styleTransfer
: Transform image into different art styles
- Input:
{ style: string, uri: string }
-
ly.img.swapBackground
: Change the background of the image
- Input:
{ prompt: string, uri: string }
-
ly.img.gpt-image-1.changeStyleLibrary
: Apply different art styles (GPT-specific)
- Input:
{ prompt: string, uri: string }
Text Quick Actions
-
ly.img.changeTextTo
: Change text to a different format or style
- Input:
{ prompt: string, customPrompt: string }
-
ly.img.changeTone
: Change the tone of the text
- Input:
{ prompt: string, type: string }
-
ly.img.fix
: Fix spelling and grammar
- Input:
{ prompt: string }
-
ly.img.improve
: Improve writing quality
- Input:
{ prompt: string }
-
ly.img.longer
: Make text longer
- Input:
{ prompt: string }
-
ly.img.shorter
: Make text shorter
- Input:
{ prompt: string }
-
ly.img.translate
: Translate text to different languages
- Input:
{ prompt: string, language: string }
Video Quick Actions
ly.img.createVideo
: Opens the image2video generation panel with the current image
Provider Quick Action Support
Providers declare which quick actions they support and how to map quick action inputs to provider inputs:
const myProvider = {
input: {
quickActions: {
supported: {
'ly.img.editImage': {
mapInput: (quickActionInput) => ({
prompt: quickActionInput.prompt,
image_url: quickActionInput.uri
})
},
'ly.img.styleTransfer': {
mapInput: (quickActionInput) => ({
style: quickActionInput.style,
image_url: quickActionInput.uri
})
}
}
}
}
};
Quick Action Expanded View
Quick actions can have two rendering modes:
- Collapsed View: Shows as a simple button in the quick action menu alongside other actions
- Expanded View: Takes over the entire menu space, hiding other actions while the user interacts with this specific action
The expanded view is useful for quick actions that need user input (like text prompts). When a quick action is expanded, the complete menu is replaced with the expanded interface, and other menu items are not shown until the user either completes the action or cancels back to the collapsed view.
render: ({ builder, isExpanded, toggleExpand }) => {
if (isExpanded) {
builder.TextArea('prompt', { });
builder.ButtonRow('actions', { });
} else {
builder.Button('expand', {
label: 'Edit Image...',
onClick: toggleExpand
});
}
}
Using Your Provider
Once you've created your provider, you need to initialize it with CreativeEditor SDK and integrate it into the UI.
Initializing Your Provider
Use the initializeProvider
function to register your provider:
import { initializeProvider } from '@imgly/plugin-ai-generation-web';
const myProvider = createMyProvider({
proxyUrl: 'http://your-proxy-server.com/api/proxy',
headers: {
'x-custom-header': 'value',
'x-client-version': '1.0.0'
}
});
function setupMyProvider(cesdk) {
const result = initializeProvider(
myProvider,
{
engine: cesdk.engine,
cesdk
},
{
debug: false,
dryRun: false
}
);
return result;
}
Panel IDs and Registration
When a provider is initialized, it automatically registers panels with specific IDs:
ly.img.ai.{provider-id}
For example:
- A provider with ID
my-image-provider
registers a panel with ID ly.img.ai.my-image-provider
- A provider with ID
fal-ai/recraft-v3
registers a panel with ID ly.img.ai.fal-ai/recraft-v3
You can programmatically get a panel ID using the getPanelId
function:
import { getPanelId } from '@imgly/plugin-ai-generation-web';
const panelId = getPanelId('my-image-provider');
cesdk.ui.openPanel(panelId);
Quick actions are automatically registered in canvas menus with these IDs:
ly.img.ai.{kind}.canvasMenu
For example:
- Image quick actions:
ly.img.ai.image.canvasMenu
- Video quick actions:
ly.img.ai.video.canvasMenu
- Audio quick actions:
ly.img.ai.audio.canvasMenu
- Text quick actions:
ly.img.ai.text.canvasMenu
Using with Existing AI Generation Plugins
IMG.LY offers several pre-built AI generation packages that work with this base plugin:
import CreativeEditorSDK from '@cesdk/cesdk-js';
import ImageGeneration from '@imgly/plugin-ai-image-generation-web';
import FalAiImage from '@imgly/plugin-ai-image-generation-web/fal-ai';
import VideoGeneration from '@imgly/plugin-ai-video-generation-web';
import FalAiVideo from '@imgly/plugin-ai-video-generation-web/fal-ai';
CreativeEditorSDK.create(domElement, {
license: 'your-license-key'
}).then(async (cesdk) => {
await cesdk.addDefaultAssetSources();
cesdk.addPlugin(
ImageGeneration({
text2image: FalAiImage.RecraftV3({
proxyUrl: 'http://your-proxy-server.com/api/proxy'
}),
image2image: FalAiImage.GeminiFlashEdit({
proxyUrl: 'http://your-proxy-server.com/api/proxy'
})
})
);
cesdk.addPlugin(
VideoGeneration({
text2video: FalAiVideo.MinimaxVideo01Live({
proxyUrl: 'http://your-proxy-server.com/api/proxy'
})
})
);
cesdk.ui.setCanvasMenuOrder([
'ly.img.ai.image.canvasMenu',
'ly.img.ai.video.canvasMenu',
...cesdk.ui.getCanvasMenuOrder()
]);
});
Advanced Features
Middleware
The package includes a middleware system to augment the generation flow:
Rate Limiting Middleware
import { rateLimitMiddleware } from '@imgly/plugin-ai-generation-web';
const rateLimit = rateLimitMiddleware({
maxRequests: 10,
timeWindowMs: 60000,
onRateLimitExceeded: (input, options, info) => {
console.log(
`Rate limit exceeded: ${info.currentCount}/${info.maxRequests}`
);
return false;
}
});
const provider = {
output: {
middleware: [rateLimit]
}
};
Note: This middleware provides client-side rate limiting for UI purposes only. Always implement proper server-side rate limiting and authentication for production APIs.
Upload Middleware
The uploadMiddleware
allows you to upload generated content to your own servers:
import { uploadMiddleware } from '@imgly/plugin-ai-generation-web';
const upload = uploadMiddleware(async (output) => {
const response = await fetch('https://your-api.example.com/upload', {
method: 'POST',
headers: { 'Content-Type': 'application/json' },
body: JSON.stringify(output)
});
const result = await response.json();
return {
...output,
url: result.url
};
});
const provider = {
output: {
middleware: [upload]
}
};
Provider Registry
The ProviderRegistry
is a global singleton that manages all registered providers:
import { ProviderRegistry } from '@imgly/plugin-ai-generation-web';
const registry = ProviderRegistry.get();
const allProviders = registry.getAll();
const imageProviders = registry.getByKind('image');
const myProvider = registry.getById('my-provider-id');
TypeScript Support
This package is fully typed with TypeScript, providing excellent IntelliSense support during development:
- Generic Provider Types: Strongly typed providers with input/output validation
- Quick Action Types: Type-safe quick action definitions with proper input mapping
- Registry Types: Fully typed action and provider registries
- Middleware Types: Typed middleware functions for better composition
API Reference
Core Exports
export { Provider, ImageOutput, VideoOutput, AudioOutput, TextOutput } from './core/provider';
export { ActionRegistry, QuickActionDefinition, PluginActionDefinition } from './core/ActionRegistry';
export { ProviderRegistry } from './core/ProviderRegistry';
export { initializeProvider, initializeProviders } from './providers/';
export { loggingMiddleware, rateLimitMiddleware, uploadMiddleware } from './middleware/';
export { getPanelId, enableQuickActionForImageFill } from './utils/';
Common Types
interface CommonProviderConfiguration<I, O extends Output> {
proxyUrl: string;
debug?: boolean;
middleware?: Middleware<I, O>[];
headers?: Record<string, string>;
}
interface QuickActionDefinition<Q extends Record<string, any>> {
id: string;
type: 'quick';
kind: OutputKind;
label?: string;
enable: boolean | ((context: { engine: CreativeEngine }) => boolean);
render: (context: QuickActionRenderContext<Q>) => void;
}