需求

網上有不少文稿轉視訊的文章，但是往往依賴于Stable Diffusion等大模型（比如link），部署和使用都不太友善。于是就打算基于 Python 和開源庫，做一個文章轉視訊的工具，友善地将圖文并茂的網頁文章轉換成視訊。

代碼已經上傳到GitHub： github.com/sim4d/text2…

具體步驟

通過 requests 擷取文章内容
借 edge-tts 生成語音及字幕
利用 BeautifulSoap 抓取文章圖像
再借助 moviepy 将圖像拼接為視訊，并比對語音與字幕

項目特點

完全基于 Python 及其第三方應用庫，無需依賴于stable-diffusion/midjourney等大模型。
實作以微信公衆号文章為例，函數的具體查詢是基于公衆号文章的特點 (如 make_audio.py 裡的 get_title / get_wechat_article 等函數)。若用于其它類型文章，可能需要調整查詢的tag和class資訊

開發環境

Windows 11, WSL2 + Rocky 9.3，Python 3.12

安裝 Rocky Linux 9.3 for Windows Subsystem for Linux 2 (WSL2), refer to link

wget -Uri https://dl.rockylinux.org/pub/rocky/9/images/x86_64/Rocky-9-Container-Base.latest.x86_64.tar.xz -OutFile ./Rocky-9-Container-Base.latest.x86_64.tar.xz
mkdir wsl-rocky
wsl --import rocky9 ./wsl-rocky ./Rocky-9-Container-Base.latest.x86_64.tar.xz --version 2
wsl -d rocky9

更新系統，添加普通使用者

# dnf install epel-release
# dnf update && dnf upgrade
#
# dnf install sudo
# adduser wsl
# passwd wsl
# usermod -aG wheel wsl
# exit

以普通使用者進入 Rocky9

wsl -d rocky9 -u wsl

安裝 pip 和 ImageMagick

sudo dnf install python3-pip
sudo dnf install ImageMagick
sudo dnf install git vim

準備 sandbox (要求 pub key 已經在 GitHub 賬戶 Profile 設定好)

cd ~/
mkdir sandbox
cd sandbox
git clone [email protected]:sim4d/text2video.git text2video

安裝依賴

cd ~/sandbox/text2video
python3 -m venv my_venv
source ./my_venv/bin/activate
pip3 install -r requirements.txt

設定url，運作text2video.py

python3 text2video.py

References

zulko.github.io/moviepy/ref…
github.com/rany2/edge-…

其它問題

1. edge-tts 生成 vtt 字幕時，隻能以詞語為 Boundary

做成視訊後，字幕看起來就很亂。如圖

解決方案

把 edge-tts 項目導入進來，并打上patch，改為以句子為 Boundary

Patch diff

$ diff -u communicate.py-original communicate.py
--- communicate.py-original     2024-05-12 16:28:58.031623420 +0800
+++ communicate.py      2024-05-13 00:20:53.785773846 +0800
@@ -331,7 +331,7 @@
                 "Content-Type:application/json; charset=utf-8\r\n"
                 "Path:speech.config\r\n\r\n"
                 '{"context":{"synthesis":{"audio":{"metadataoptions":{'
-                '"sentenceBoundaryEnabled":false,"wordBoundaryEnabled":true},'
+                '"sentenceBoundaryEnabled":true,"wordBoundaryEnabled":false},'
                 '"outputFormat":"audio-24khz-48kbitrate-mono-mp3"'
                 "}}}}\r\n"
             )
@@ -359,7 +359,7 @@
         def parse_metadata() -> Dict[str, Any]:
             for meta_obj in json.loads(data)["Metadata"]:
                 meta_type = meta_obj["Type"]
-                if meta_type == "WordBoundary":
+                if meta_type in ("WordBoundary", "SentenceBoundary"):
                     current_offset = meta_obj["Data"]["Offset"] + offset_compensation
                     current_duration = meta_obj["Data"]["Duration"]
                     return {

新的字幕效果

2. 同樣的代碼，用 WSL + Ubuntu，就會碰到以下問題。換成 Ubuntu 24.04 也一樣。最後換成 Rocky 9.3 才行。

Traceback (most recent call last):
  File "/home/wsl/sandbox/text2video/text2video.py", line 116, in <module>
    main(url, font_path)
  File "/home/wsl/sandbox/text2video/text2video.py", line 107, in main
    generate_video(images_dir, audio_path, vtt_path, font_path, output_path, front_txt, title_txt)
  File "/home/wsl/sandbox/text2video/text2video.py", line 36, in generate_video
    front_clip = mp.TextClip(front_txt, color='black', bg_color='white', font=font_path, align='West', kerning=5, fontsize=18)
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/wsl/sandbox/text2video/my_venv/lib/python3.12/site-packages/moviepy/video/VideoClip.py", line 1146, in __init__
    raise IOError(error)
OSError: MoviePy Error: creation of None failed because of the following error:

convert-im6.q16: attempt to perform an operation not allowed by the security policy `@/tmp/tmpn53ke08b.txt' @ error/property.c/InterpretImageProperties/3771.
convert-im6.q16: label expected `@/tmp/tmpn53ke08b.txt' @ error/annotate.c/GetMultilineTypeMetrics/782.
convert-im6.q16: no images defined `PNG32:/tmp/tmp3vlxrrq6.png' @ error/convert.c/ConvertImageCommand/3234.
.

.This error can be due to the fact that ImageMagick is not installed on your computer, or (for Windows users) that you didn't specify the path to the ImageMagick binary in file conf.py, or that the path you specified is incorrect

解決方案

這可能是 Windows + Python + ImageMagick 特有的問題，好像這個 link 碰到類似問題，并成功解決了。

作者：Simford

連結：https://juejin.cn/post/7368637177428426752

來源：稀土掘金

基于 Python 建構的文章轉視訊神器

需求

具體步驟

項目特點

開發環境

References

其它問題

解決方案

解決方案

繼續閱讀

沒法寫了！什麼破首發維權！漏成了篩子！我前腳寫，他們後腳複制圖自拍，他們不改文章，他們直接複制。不是網易，就是百家号！抄

好友月光照的一篇文章《父母兜底有多重要？》爆款了，觀點鮮明，引起了很多家長的共鳴（圖一紅線）其中有一位網友在評論區說，她

波士頓動力這個黃色機器狗，開發了7年，各種宣傳視訊賣萌擺拍照，數量遠超實際遛狗視訊。結果最近剛上展會，爬個樓梯就突然半身

張泉靈來回應自己的白發了她說她的丈夫不喜歡她的白發，但是又不敢明确反對，隻是給她轉發了一篇有關她白發顯老的文章，間接的勸

真是一個鬧心的端午節。直到現在，我的手都還在抖。就因為孩子沒去他家吃飯，就上來砸門鬧事[流淚]。離婚多年，孩子我教我養，

#蔡磊#蔡磊早前采訪時，說到自己“低頭擡頭都非常艱難”，可是，剛剛6月8、9号發的視訊，頭也能自主擡頭扭動，圖四揭牌也能

最近在抖音上遇到一個遊戲小程式，瞬間上了點小瘾，不知不覺地玩了一個月了。這個小遊戲經常以給額外小加成的名義讓看廣告30秒

趙雲曾有個結義兄弟三國演義中，趙雲曾有一個結義兄弟，想必大家都不知道吧，他就是桂陽郡太守趙範。本來趙雲是率軍去攻打桂陽郡

劇推薦:今日定檔開播的幾部影視劇《海天雄鷹》6/11央視訊道/優酷/騰訊視訊《鳳落江湖》6/11芒果TV《聘貓記》6/1

蔡斌的言外之意。中國女排澳門站的表現，讓國不滿，其中焦點就是主教練蔡斌對朱婷的使用。在各種因素的交織下，蔡斌到達香港站之

【榮耀MagicVFlip小折疊手機預熱：多款應用可直接在外屏打開】榮耀MagicVFlip外屏尺寸為4英寸，屏占比85

這個爸爸連犯兩錯，都被孩子看見！網友還拍了視訊

印軍拒不撤離，被解放軍拽過來暴打日前社交媒體上再次出現了一段中印邊境沖突視訊，拍攝時間很可能是在2020年加勒萬河谷沖

#頭條創作挑戰賽#閱讀《紅樓夢》必須要知道它的原名叫《石頭記》。必須要閱讀《脂硯齋重評石頭記》這個版本。《紅樓夢》裡作者

這是發表在《無線電》雜志1959年第8期P21上的一篇，介紹在電子管擴音機裡的電位器雜音的檢修方法的文章。

應急科普丨遇到自然災害怎麼辦？一篇文章教會你如何緊急避險！