English | 中文
WordCanvas
Introduction
This project is a text image rendering tool based on Pillow, designed specifically for random image generation.
By adding a variety of parameter settings, users can flexibly adjust input text, font styles, colors, and other attributes to achieve large-scale random generation of text images. Whether addressing issues such as data scarcity, class imbalance, or enhancing image diversity, WordCanvas provides a simple and efficient solution, offering a solid data foundation for deep learning training.
Technical Documentation
For installation and usage instructions, please refer to WordCanvas Documents.
There you will find detailed information about the project.
Installation
Install via PyPI
-
Install wordcanvas-docsaid
:
pip install wordcanvas-docsaid
-
Verify installation:
python -c "import wordcanvas; print(wordcanvas.__version__)"
-
If you see the version number, the installation is successful.
Install from GitHub
-
Clone the project from GitHub:
git clone https://github.com/DocsaidLab/WordCanvas.git
-
Install the wheel package:
pip install wheel setuptools
-
Build the whl file:
cd WordCanvas
python setup.py bdist_wheel
-
Install the whl file:
pip install dist/wordcanvas_docsaid-*-py3-none-any.whl
Quick Start
The hardest part is getting started, so we need a simple beginning.
Start with a String
Simply provide a basic declaration and you're ready to go.
from wordcanvas import WordCanvas
gen = WordCanvas(return_infos=True)
Using all default settings, you can directly call the function to generate a text image.
text = "你好!Hello, World!"
img, infos = gen(text)
print(img.shape)
print(infos)
data:image/s3,"s3://crabby-images/05ebc/05ebc0f6c105097a71e8fc6418184266574c7b52" alt="sample1"
[!TIP]
In default mode, the output image size depends on:
- Font size: The default is 64. As the font size increases, the image size will also increase.
- Text length: The longer the text, the wider the image will be, with the exact length determined by
pillow
. - The output information
infos
contains all drawing parameters, including text, background color, text color, etc. - To output only the image, set
return_infos=False
, which is the default setting.
Specify a Specific Font
You can specify your preferred font using the font
parameter.
from wordcanvas import WordCanvas
gen = WordCanvas(
font_path="/path/to/your/font/OcrB-Regular.ttf"
)
text = 'Hello, World!'
img = gen(text)
data:image/s3,"s3://crabby-images/23c93/23c934c933e727067ba9b76a9b80a5a073a03f7d" alt="sample14"
If the font does not support the input text, tofu characters will appear.
text = 'Hello, 中文!'
img = gen(text)
data:image/s3,"s3://crabby-images/46fa2/46fa2185b8a3855724e71106ee89ce3503408b51" alt="sample15"
[!TIP]
How to Check if the Font Supports Characters:
I currently don’t have this requirement, so I’ve left a basic method. This is a simple check that only checks one character at a time, so you need to iterate through all the characters. If you have other requirements, feel free to expand on this.
from wordcanvas import is_character_supported, load_ttfont
target_text = 'Hello, 中文!'
font = load_ttfont("/path/to/your/font/OcrB-Regular.ttf")
for c in target_text:
status = is_character_supported(font, c)
Set Image Size
You can adjust the image size using the output_size
parameter.
from wordcanvas import WordCanvas
gen = WordCanvas(output_size=(64, 1024))
img = gen(text)
print(img.shape)
data:image/s3,"s3://crabby-images/941ad/941ade80382f36e4137e32972f59d925710fe9aa" alt="sample4"
When the specified size is smaller than the text image size, the text image will be automatically scaled.
In other words, the text will be squeezed together, becoming a narrow rectangle, for example:
from wordcanvas import WordCanvas
text = '你好' * 10
gen = WordCanvas(output_size=(64, 512))
img = gen(text)
data:image/s3,"s3://crabby-images/f7137/f7137ae0ac2a3e56d2bf3bbbc110520c16a8ca22" alt="sample8"
Adjust Background Color
You can adjust the background color using the background_color
parameter.
from wordcanvas import WordCanvas
gen = WordCanvas(background_color=(255, 0, 0))
img = gen(text)
data:image/s3,"s3://crabby-images/7f44d/7f44dd82e16d7d778c0e6deb63e990f777db7c38" alt="sample2"
Adjust Text Color
You can adjust the text color using the text_color
parameter.
from wordcanvas import WordCanvas
gen = WordCanvas(text_color=(0, 255, 0))
img = gen(text)
data:image/s3,"s3://crabby-images/572df/572df8e57d2b1eb5067d4fef4326b671e8b0d24b" alt="sample3"
Adjust Text Alignment
[!WARNING]
Do you remember the image size mentioned earlier?
By default, setting text alignment does not have any effect. When drawing the image, there must be extra space in the text image to see the alignment effect.
You can adjust the text alignment using the align_mode
parameter.
from wordcanvas import AlignMode, WordCanvas
gen = WordCanvas(
output_size=(64, 1024),
align_mode=AlignMode.Center
)
text = '你好! Hello, World!'
img = gen(text)
-
Center alignment: AlignMode.Center
data:image/s3,"s3://crabby-images/9af8f/9af8fbf455dcfc4e63cfee8606e01749926388f8" alt="sample5"
-
Right alignment: AlignMode.Right
data:image/s3,"s3://crabby-images/c8d7d/c8d7da5fc980091da86b957467d17e18c1e3e9d8" alt="sample6"
-
Left alignment: AlignMode.Left
data:image/s3,"s3://crabby-images/941ad/941ade80382f36e4137e32972f59d925710fe9aa" alt="sample4"
-
Scatter alignment: AlignMode.Scatter
data:image/s3,"s3://crabby-images/0d66b/0d66be77eeed20d84538f44168a89cbcac9ecc03" alt="sample7"
[!TIP]
In scatter alignment mode, not every character will be spread out, but words will be spread as a unit. In Chinese, the unit of a word is a character; in English, the unit of a word is a space.
For example, the input text "你好! Hello, World!" will be split into:
- ["你", "好", "!", "Hello,", "World!"]
Spaces are ignored, and scatter alignment is applied.
Additionally, when the input text can only be split into a single word, scatter alignment for Chinese words is equivalent to center alignment, and English words will be split into individual characters for scatter alignment.
The logic we use for this is:
def split_text(text: str):
""" Split text into a list of characters. """
pattern = r"[a-zA-Z0-9\p{P}\p{S}]+|."
matches = regex.findall(pattern, text)
matches = [m for m in matches if not regex.match(r'\p{Z}', m)]
if len(matches) == 1:
matches = list(text)
return matches
[!WARNING]
This is a very simple implementation and may not meet all requirements. If you have a more complete solution for string splitting, feel free to provide it.
Adjust Text Direction
You can adjust the text direction using the direction
parameter.
-
Output horizontal text
from wordcanvas import AlignMode, WordCanvas
text = '你好!'
gen = WordCanvas(direction='ltr')
img = gen(text)
data:image/s3,"s3://crabby-images/99539/99539eb590583c3de111bcfe6ba94e49fa9e6672" alt="sample9"
-
Output vertical text
from wordcanvas import AlignMode, WordCanvas
text = '你好!'
gen = WordCanvas(direction='ttb')
img = gen(text)
data:image/s3,"s3://crabby-images/85837/858375b4b0ed8aac2abf588eb3f406087e46cbc6" alt="sample10"
-
Output vertical text with scatter alignment
from wordcanvas import AlignMode, WordCanvas
text = '你好!'
gen = WordCanvas(
direction='ttb',
align_mode=AlignMode.Scatter,
output_size=(64, 512)
)
img = gen(text)
data:image/s3,"s3://crabby-images/c95c4/c95c4161b446954bb8f5be68d95854a017a82b1a" alt="sample11"
Adjust Output Direction
You can adjust the output direction using the output_direction
parameter.
[!TIP]
When to use this parameter: When you choose "Output vertical text" but still want to view the text image horizontally, you can use this parameter.
-
Vertical text, horizontal output
from wordcanvas import OutputDirection, WordCanvas
gen = WordCanvas(
direction='ttb',
output_direction=OutputDirection.Horizontal
)
text = '你好!'
img = gen(text)
data:image/s3,"s3://crabby-images/943f3/943f3ebeafb30a5885b57d7800eaa2be0535dc72" alt="sample12"
-
Horizontal text, vertical output
from wordcanvas import OutputDirection, WordCanvas
gen = WordCanvas(
direction='ltr',
output_direction=OutputDirection.Vertical
)
text = '你好!'
img = gen(text)
data:image/s3,"s3://crabby-images/ab58b/ab58be27a9f8fb3f3de3cefb22fa1785bce3d9fb" alt="sample13"
Squeeze Text
In some scenarios, the text might need to be specially squeezed. You can use the text_aspect_ratio
parameter to adjust this.
from wordcanvas import WordCanvas
gen = WordCanvas(
text_aspect_ratio=0.25,
output_size=(32, 1024),
)
text="壓扁測試"
img = gen(text)
data:image/s3,"s3://crabby-images/b9de8/b9de867019189d16c560ea95f1290152fc2f66e3" alt="sample16"
[!IMPORTANT]
It is important to note that if the squeezed text size exceeds the output_size
, the image will go through an automatic scaling process. Therefore, you might end up squeezing the text, but it will be scaled back to its original size, and nothing will appear to have happened.
Text Stroke
You can adjust the width of the text stroke using the stroke_width
parameter.
from wordcanvas import WordCanvas
gen = WordCanvas(
font_size=64,
text_color=(0, 0, 255),
background_color=(255, 0, 0),
stroke_width=2,
stroke_fill=(0, 255, 0),
)
text="文字外框測試"
img = gen(text)
data:image/s3,"s3://crabby-images/9c77f/9c77ff1b11ac453cf6cf7f39ec07830b12b50793" alt="sample29"
[!WARNING]
Using stroke_width
may result in an OSError: array allocation size too large error with certain text.
Using `stroke_width` may cause an OSError: array allocation size too large error with certain text.
This is a known issue with the `Pillow` library (see https://github.com/python-pillow/Pillow/issues/7287) and cannot be resolved directly.
We found in testing that using stroke_width
in Pillow
may intermittently result in an OSError
. This is a known issue with Pillow
, and we have linked the relevant issue in the warning. You can click the link to view it.
Multi-line Text
You can use the \n
newline character to create multi-line text.
from wordcanvas import WordCanvas
gen = WordCanvas()
text = '你好!\nHello, World!'
img = gen(text)
data:image/s3,"s3://crabby-images/9ac6c/9ac6cad634a856b82a08e82a9854c9d95aa7057a" alt="sample30"
In the case of multi-line text, you can combine it with most of the features mentioned above, for example:
from wordcanvas import WordCanvas, AlignMode
gen = WordCanvas(
text_color=(0, 0, 255),
output_size=(128, 512),
background_color=(0, 0, 0),
align_mode=AlignMode.Center,
stroke_width=2,
stroke_fill=(0, 255, 0),
)
text = '你好!\nHello, World!'
img = gen(text)
data:image/s3,"s3://crabby-images/2c650/2c650ffca98893ecad44ea89b761033c09e236b9" alt="sample31"
[!WARNING]
The following situations do not support multi-line text:
-
align_mode
does not support AlignMode.Scatter
gen = WordCanvas(align_mode=AlignMode.Scatter)
-
direction
does not support ttb
gen = WordCanvas(direction='ttb')
If you need these features, please avoid using multi-line text.
Dashboard
The basic functionality is more or less as described.
Finally, let's introduce the dashboard feature.
from wordcanvas import WordCanvas
gen = WordCanvas()
print(gen)
You can also directly output without using print
, as we have implemented the __repr__
method.
Once output, you will see a simple dashboard.
data:image/s3,"s3://crabby-images/350b6/350b686131e9232c71abf21ef783496595b80531" alt="dashboard"
Here you can see:
- The first column is Property, which lists all the setting parameters.
- The second column is Current Value, showing the current value of the parameter.
- The third column is SetMethod, which shows how the parameter is set.
- Parameters set with
set
can be directly modified; - Parameters set with
reinit
require reinitialization of the WordCanvas
object.
- The fourth column is DType, which shows the data type of the parameter.
- The fifth column is Description, which describes the parameter. (This column is not shown in the image above to save space.)
Most parameters can be directly set, meaning that when you need to modify output characteristics, you don't need to create a new object. Just modify the settings directly. Parameters that require reinit
usually involve the initialization of font formats, such as text height (font_size
) and others.
gen.output_size = (64, 1024)
gen.text_color = (0, 255, 0)
gen.align_mode = AlignMode.Center
gen.direction = 'ltr'
gen.output_direction = OutputDirection.Horizontal
After setting, you can directly call the function to get the new text image. Additionally, if you modify parameters related to reinit
, you will receive the corresponding error:
[!CAUTION]
Of course, you can still forcefully modify the parameters, but as a fellow Python user, I can't stop you:
gen._font_size = 128
However, this will cause errors when generating the image later!
Don't insist; just reinitialize the object.
Summary
Many features haven't been mentioned, but the basic functionality has been covered.
This concludes the basic usage of the project. For more detailed information and usage methods, please refer directly to WordCanvas Advanced Usage.
Citation
If you find our work helpful, please cite the following:
@misc{yuan2024wordcanvas,
author = {Ze Yuan},
title = {WordCanvas},
year = {2024},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/DocsaidLab/WordCanvas}}
}