python 爬取天堂圖檔網腳本

2023-08-07 05:23:16

import requests
from lxml import etree
url = 'https://www.ivsky.com/bizhi/'
headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/86.0.4240.198 Safari/537.36'
}
res = requests.get(url=url, headers=headers).text
tree = etree.HTML(res)
li_list = tree.xpath('//div[@class="left"]/ul[@class="ali"]/li')
for li in li_list:
    a = li.xpath('.//a/@href')[0]
    print(a)
    a_url = 'https://www.ivsky.com' + a
    print(a_url)
    res2 = requests.get(url=a_url, headers=headers).text
    atree = etree.HTML(res2)
    li_list2 = atree.xpath('//div[@class="left"]/ul[@class="pli"]/li')
    a = 0
    for li2 in li_list2:
        img = li2.xpath('.//img/@src')[0]
        img_name = li2.xpath('.//img/@alt')[0]
        res3 = requests.get(url='https:' + img, headers=headers).content
        a += 1
        print('正在下載下傳：%s' % str(a) + '_' + img_name)
        with open('./img_list/' + str(a) + '_' + img_name + '.jpg', 'wb') as f:
            f.write(res3)

python 爬取天堂圖檔網腳本

繼續閱讀

無法解析的外部符号 wmain，該符号在函數 "void cdecl mainCRTStartupHelper(struct HINSTANCE *,unsigned short con......

TestLink導出用例轉換工具(XML2Excel)

YAML簡介和PyYAML安全操作YAML支援的類型YAML的優點：yaml的基本文法python操作

Small tricks

libsvm for python 安裝

學習軟體測試基礎測試第七天

Zeppelin 配置通路 REST APIApache Zeppelin Configuration REST API

【Torch】最簡潔logging使用指南

27. Remove Element(清單)題目代碼

sort()函數到底是怎樣進行數字排序的

Cloud Studio初體驗

使用 ctypes 進行 Python 和 C 的混合程式設計

【python】【資料處理】畫多元資料分布圖

【python】netconf協定對接管理裝置

「Python 網絡自動化」NETCONF —— Python 使用 NETCONF 管理配置 H3C 網絡裝置

在python中建立excel并寫入