字型圖片批次生成-字型識別模型資料

陈作立的博客發表於2024-12-09

原文網址 : https://www.cnblogs.com/chenzuoli/p/18596308

眾所周知，我們的文字有各種字型，字型透過字型檔案方式供作業系統使用，在需要使用字型圖片的場景，我們如何快速生成呢？

這篇文章介紹下，如何透過作業系統自帶的字型檔案，利用python的pillow包快速生成字型圖片。

各作業系統字型檔案路徑
windows\linux\macos：

dirs = []
if sys.platform == "win32":
    # check the windows font repository
    # NOTE: must use uppercase WINDIR, to work around bugs in
    # 1.5.2's os.environ.get()
    windir = os.environ.get("WINDIR")
    if windir:
        dirs.append(os.path.join(windir, "fonts"))
elif sys.platform in ("linux", "linux2"):
    data_home = os.environ.get("XDG_DATA_HOME")
    if not data_home:
        # The freedesktop spec defines the following default directory for
        # when XDG_DATA_HOME is unset or empty. This user-level directory
        # takes precedence over system-level directories.
        data_home = os.path.expanduser("~/.local/share")
    xdg_dirs = [data_home]

    data_dirs = os.environ.get("XDG_DATA_DIRS")
    if not data_dirs:
        # Similarly, defaults are defined for the system-level directories
        data_dirs = "/usr/local/share:/usr/share"
    xdg_dirs += data_dirs.split(":")

    dirs += [os.path.join(xdg_dir, "fonts") for xdg_dir in xdg_dirs]
elif sys.platform == "darwin":
    dirs += [
        "/Library/Fonts",
        "/System/Library/Fonts",
        os.path.expanduser("~/Library/Fonts"),
    ]

pillow生成圖片


import os
import random

import nltk
from PIL import Image, ImageDraw, ImageFont

# Download the necessary data from nltk
nltk.download('inaugural')

def wrap_text(text, line_length=4):
    """Wraps the provided text every 'line_length' words."""
    words = text.split()
    return "\n".join([" ".join(words[i:i + line_length]) for i in range(0, len(words), line_length)])


def random_prose_text(line_length=4):
    """Returns a random snippet from the Gutenberg corpus."""
    corpus = nltk.corpus.inaugural.raw()
    start = random.randint(0, len(corpus) - 800)
    end = start + 800
    return wrap_text(corpus[start:end], line_length=line_length)


def gen_images():
    # get font name and font files
    font_files = []
    for font_dir in dirs:
        for font_file in os.listdir(font_dir):
            if font_file.endswith('.ttf') or font_file.endswith('.ttc'):
                font_path = os.path.join(font_dir, font_file)
                font_name = font_file.split('.')[0]
                font_files.append((font_path, font_name))

    # Generate images for each font file
    for font_path, font_name in font_files:
        # Output the font name so we can see the progress
        print(font_path, font_name)

        # Counter for the image filename
        j = 0
        for i in range(IMAGES_PER_FONT):  # Generate 50 images per font - reduced to 10 for now to make things faster
            # Random font size
            font_size = random.choice(range(18, 72))

            if font_path.endswith('.ttc'):
                # ttc fonts have multiple fonts in one file, so we need to specify which one we want
                font = ImageFont.truetype(font_path, font_size, index=0)
            elif font_name in FONT_EXCEPTS:
                continue
            else:
                # ttf fonts have only one font in the file
                font = ImageFont.truetype(font_path, font_size)

            # Determine the number of words that will fit on a line
            font_avg_char_width = font.getbbox('x')[2]
            words_per_line = int(800 / (font_avg_char_width * 5))
            prose_sample = random_prose_text(line_length=words_per_line)

            # print("generate font image: " + str(prose_sample))
            for text in [prose_sample]:
                img = Image.new('RGB', (800, 400), color="white")  # Canvas size
                draw = ImageDraw.Draw(img)

                # Random offsets, but ensuring that text isn't too far off the canvas
                offset_x = random.randint(-20, 10)
                offset_y = random.randint(-20, 10)

                # vary the line height
                line_height = random.uniform(0, 1.25) * font_size
                draw.text((offset_x, offset_y), text, fill="black", font=font, spacing=line_height)

                j += 1
                output_file = os.path.join(GEN_IMAGES_DIR, f"{font_name}_{j}.png")
                img.save(output_file)

原始碼都記錄在這裡了：
https://github.com/chenzuoli/font-identifier

本程式碼參考開源專案：https://huggingface.co/gaborcselle/font-identifier

好了，記錄到這裡，持續更新中。

記錄問題也是一種修行。

歡迎關注微信公眾號，你的資源可變現：【樂知付加密平臺】

歡迎關注微信公眾號，這裡記錄博主的創業之旅：【程式設計師寫書】

一起學習，一起進步。

使用icomoon把svg圖片生成字型圖示
2019-03-15
SVG
使用IcoMoon生成圖示字型
2021-09-09
圖片文字識別工具怎樣進行批次識別圖片？
2019-06-17
使用ML.NET模型生成器來完成圖片性別識別
2020-10-27
模型
便捷生成 Iconfont 圖示字型在用於 Flutter
2019-05-24
Flutter
如何使用 Javascript 將圖示字型渲染為圖片
2022-05-11
JavaScript
字型圖示
2018-10-09
word 中批次替換字型顏色
2024-05-05
AI大模型實現圖片OCR識別
2024-11-11
AI大模型
Webpack乾貨系列 | Webpack5 怎麼處理字型圖示、圖片資源
2022-07-18
Web
Vue富文字帶圖片修改圖片大小自定義選擇項自定義字型
2019-02-15
Vue自定義字型
win10怎麼批處理批次安裝字型_win10批次安裝字型的步驟
2020-06-09
Win10
Python資料展示 - 生成表格圖片
2022-04-09
Python
「Adobe國際認證」字型與字型有區別嗎？字型區別的真正“奧義”秘籍，你掌握了嗎！
2021-08-10
學習Pytorch+Python之MNIST手寫字型識別
2021-10-21
PyTorchPython
Python數字型別有哪些
2021-09-11
Python型別
Tesseract 圖片識別
2019-08-05
字型圖示的應用
2018-11-05
Css字型圖示引入方式
2020-11-07
CSS
點陣圖字型匯入
2024-10-17
字型安裝在哪個資料夾電腦中的字型庫具體位置
2021-11-02
webpack自動用svg生成iconfont字型圖示，支援熱過載
2019-02-16
WebSVG
midjourney 生成相似型別圖片
2024-07-22
型別
如何檢視字型詳細資訊，修改字型名稱？
2024-10-25
Python基礎(01)：數字型別
2019-01-30
Python型別
字型資料夾在哪裡win10
2021-10-30
Win10
iPhone手機換字型圖文教程蘋果iPhone怎麼換字型
2018-06-11
iPhone蘋果
paddleocr圖片文字識別
2024-04-17
CSS 字型新玩法之彩色字型
2019-02-20
CSS
Win10系統下字型不能貼上到Fonts字型資料夾如何解決
2020-06-19
Win10
字型圖示固用程式碼
2018-11-22
win10系統批次安裝新字型的方法
2018-08-31
Win10
如何免費識別圖片文字？圖片文字識別軟體怎麼用
2021-12-27
c#文字型別控制元件
2024-06-22
C#型別控制元件
css字型
2024-10-04
CSS
分享：識別圖片文字方法
2021-12-14
Tesseract OCR 圖片文字識別
2021-10-24
Python批次裁剪圖片
2024-03-27
Python

字型圖片批次生成-字型識別模型資料

相關文章