需求

网上有不少文稿转视频的文章，但是往往依赖于Stable Diffusion等大模型（比如link），部署和使用都不太方便。于是就打算基于 Python 和开源库，做一个文章转视频的工具，方便地将图文并茂的网页文章转换成视频。

代码已经上传到GitHub： github.com/sim4d/text2…

具体步骤

通过 requests 获取文章内容
借 edge-tts 生成语音及字幕
利用 BeautifulSoap 抓取文章图像
再借助 moviepy 将图像拼接为视频，并匹配语音与字幕

项目特点

完全基于 Python 及其第三方应用库，无需依赖于stable-diffusion/midjourney等大模型。
实现以微信公众号文章为例，函数的具体查询是基于公众号文章的特点 (如 make_audio.py 里的 get_title / get_wechat_article 等函数)。若用于其它类型文章，可能需要调整查询的tag和class信息

开发环境

Windows 11, WSL2 + Rocky 9.3，Python 3.12

安装 Rocky Linux 9.3 for Windows Subsystem for Linux 2 (WSL2), refer to link

wget -Uri https://dl.rockylinux.org/pub/rocky/9/images/x86_64/Rocky-9-Container-Base.latest.x86_64.tar.xz -OutFile ./Rocky-9-Container-Base.latest.x86_64.tar.xz
mkdir wsl-rocky
wsl --import rocky9 ./wsl-rocky ./Rocky-9-Container-Base.latest.x86_64.tar.xz --version 2
wsl -d rocky9

升级系统，添加普通用户

# dnf install epel-release
# dnf update && dnf upgrade
#
# dnf install sudo
# adduser wsl
# passwd wsl
# usermod -aG wheel wsl
# exit

以普通用户进入 Rocky9

wsl -d rocky9 -u wsl

安装 pip 和 ImageMagick

sudo dnf install python3-pip
sudo dnf install ImageMagick
sudo dnf install git vim

准备 sandbox (要求 pub key 已经在 GitHub 账户 Profile 设置好)

cd ~/
mkdir sandbox
cd sandbox
git clone [email protected]:sim4d/text2video.git text2video

安装依赖

cd ~/sandbox/text2video
python3 -m venv my_venv
source ./my_venv/bin/activate
pip3 install -r requirements.txt

设置url，运行text2video.py

python3 text2video.py

References

zulko.github.io/moviepy/ref…
github.com/rany2/edge-…

其它问题

1. edge-tts 生成 vtt 字幕时，只能以词语为 Boundary

做成视频后，字幕看起来就很乱。如图

解决方案

把 edge-tts 项目导入进来，并打上patch，改为以句子为 Boundary

Patch diff

$ diff -u communicate.py-original communicate.py
--- communicate.py-original     2024-05-12 16:28:58.031623420 +0800
+++ communicate.py      2024-05-13 00:20:53.785773846 +0800
@@ -331,7 +331,7 @@
                 "Content-Type:application/json; charset=utf-8\r\n"
                 "Path:speech.config\r\n\r\n"
                 '{"context":{"synthesis":{"audio":{"metadataoptions":{'
-                '"sentenceBoundaryEnabled":false,"wordBoundaryEnabled":true},'
+                '"sentenceBoundaryEnabled":true,"wordBoundaryEnabled":false},'
                 '"outputFormat":"audio-24khz-48kbitrate-mono-mp3"'
                 "}}}}\r\n"
             )
@@ -359,7 +359,7 @@
         def parse_metadata() -> Dict[str, Any]:
             for meta_obj in json.loads(data)["Metadata"]:
                 meta_type = meta_obj["Type"]
-                if meta_type == "WordBoundary":
+                if meta_type in ("WordBoundary", "SentenceBoundary"):
                     current_offset = meta_obj["Data"]["Offset"] + offset_compensation
                     current_duration = meta_obj["Data"]["Duration"]
                     return {

新的字幕效果

2. 同样的代码，用 WSL + Ubuntu，就会碰到以下问题。换成 Ubuntu 24.04 也一样。最后换成 Rocky 9.3 才行。

Traceback (most recent call last):
  File "/home/wsl/sandbox/text2video/text2video.py", line 116, in <module>
    main(url, font_path)
  File "/home/wsl/sandbox/text2video/text2video.py", line 107, in main
    generate_video(images_dir, audio_path, vtt_path, font_path, output_path, front_txt, title_txt)
  File "/home/wsl/sandbox/text2video/text2video.py", line 36, in generate_video
    front_clip = mp.TextClip(front_txt, color='black', bg_color='white', font=font_path, align='West', kerning=5, fontsize=18)
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/wsl/sandbox/text2video/my_venv/lib/python3.12/site-packages/moviepy/video/VideoClip.py", line 1146, in __init__
    raise IOError(error)
OSError: MoviePy Error: creation of None failed because of the following error:

convert-im6.q16: attempt to perform an operation not allowed by the security policy `@/tmp/tmpn53ke08b.txt' @ error/property.c/InterpretImageProperties/3771.
convert-im6.q16: label expected `@/tmp/tmpn53ke08b.txt' @ error/annotate.c/GetMultilineTypeMetrics/782.
convert-im6.q16: no images defined `PNG32:/tmp/tmp3vlxrrq6.png' @ error/convert.c/ConvertImageCommand/3234.
.

.This error can be due to the fact that ImageMagick is not installed on your computer, or (for Windows users) that you didn't specify the path to the ImageMagick binary in file conf.py, or that the path you specified is incorrect

解决方案

这可能是 Windows + Python + ImageMagick 特有的问题，好像这个 link 碰到类似问题，并成功解决了。

作者：Simford

链接：https://juejin.cn/post/7368637177428426752

来源：稀土掘金

基于 Python 构建的文章转视频神器

需求

具体步骤

项目特点

开发环境

References

其它问题

解决方案

解决方案

继续阅读

【张雪峰要给员工放20天暑假】#张雪峰称多占用员工1天都双倍奉还#根据张雪峰老师团队在社交媒体发布的视频显示，张雪峰近期

差价4300元！OPPOReno12对比iPhone15Pro有优势吗？还真别说，看了博主的对比视频后，我都想入手一款R

三只小狗过水渠视频一夜爆火：我可能是那小黄狗，却还不如它

“破除股份制银行有别论”惹争议？招行文章已删除相关字眼

大哥真的脱光衣服让吃瓜群众看了。艾森豪威尔舰长放出的视频，但只看到一部分舰首甲板及舰载机，舰尾甲板都没敢拍摄，只看到舰尾

实名举报倪海杉的大智哥删除了所有与倪海杉有关的作品，包括实名举报的视频。老头真的怂了，还跑到仆人永在作品后面留言。牛轰轰

1-6年级上下册动画版学语文内容丰富画质清晰生动活泼还有更多学习资料，关注我，每天不定时更新。知识付费，需要的可以

#头条创作挑战赛#①0-12岁家长育儿教育课·80节②0-6岁早教游戏课·100节③专注力家庭游戏课·45节④亲子沟通

3-6年级！每年级15课次！可独立开课！思维导图快速作文解决传统作文的缺点！可让孩子快速灵感，确定主题，明确立意！还有更

潘金莲对武松的感情如何体现：潘金莲对武松的感情主要体现在以下几个方面：一见钟情：潘金莲在见到武松时，对他产生了一见钟情的

#头条创作挑战赛#一视频中，商场厕所男士上厕所时拍的，厕所挡板离地有一个人脸宽的空隙，先是两条胳膊着地，然后露出一张放倒

践行金融为民中国人寿着力做好“五篇大文章”

又一办公室不雅行为！摄像头忘关视频被曝光，女子身份引热议

一分钟小说。第208集某导发了一个吃饭的视频吃着吃着饭咧着嘴掩面痛哭流涕某导和王大娘在一个小饭店里吃饭。某导说，他怀孕，

@爱好短视频的你，歌曲《走过剑门关》短视频网络比赛火热开启！

牛皮真不是吹的吗？今天看了一个视频，让我大吃一惊！从视频上看到，上百头黄牛要去河对岸吃草，面对百米宽汹涌澎湃的大河，它们