python網頁爬蟲通用代碼示例

2022-11-18 13:46:12

import requests
#import time

def getHTMLText(url):
    try:
        r = requests.get(url,timeout = 30)
        r.raise_for_status()#如果狀态不是200，産生HTTPError異常
        #print(r.status_code)
        r.encoding = r.apparent_encoding
        return r.text
    except:
        return "産生異常"

if __name__ == '__main__':
    url = "http://www.baidu.com"
    print(getHTMLText(url))

html Python爬蟲通用模闆網頁爬蟲

上一篇: android 與伺服器之間的推送方式

下一篇: python單線程爬蟲安裝與調試

繼續閱讀

HTML addEventListener() 方法和attachEvent()差別分析
html javascript
08-07
Boss直聘Python爬蟲實戰
Python python程式設計 Python爬蟲網絡爬蟲程式設計語言
08-07
web前端布局練手項目
工程師的素養--前端 css html web
08-07
Django之驗證碼（十七）驗證碼
html django
08-07
Vue項目 - 單檔案元件和Vue中的路由
vue.js 前端 webpack html javascript
11-09
龍珠訓練營task04
html sql html5
08-07
趕工心得（一）
胡謅八扯&想法工作 html 程式設計 css web
08-07
一個小小的移動web版音樂播放器
小嘗試 html
08-07
Docker - Dockerfile之ADD、COPY、WORKDIR、USER、EXPOSE指令詳解
Docker Linux centos 目标路徑 html
11-09
Compile workrave under windows &ndash; My exprience 在Windows上編譯Workrave
C/C++ Linux Windows download reference include macros html
08-07
門戶通專訪草根站長九天狼：做站貴在堅持
一滴水的站長訪談資料庫工作 html 搜尋引擎百度工具
08-07
tabpanel 使用問題
javascript EXT fp css html ViewUI
08-07
為什麼把CSS放頭部，script放下面
# 面試筆記（一）【html和浏覽器】篇 html
08-07
CSS之折疊菜單
前端 css html
08-07
web開發之前後端渲染
前端 react 模闆引擎渲染 html
08-07
403 Forbidden，You don't have permission to access / on this server.Forbidden
html
08-07