Crawler：基于requests庫+urllib3庫+僞裝浏覽器實作爬取抖音賬号的資訊資料

2021-10-27 23:50:00

輸出結果

更新……

代碼設計

from contextlib import closing

import requests, json, time, re, os, sys, time

import urllib3

urllib3.disable_warnings(urllib3.exceptions.InsecureRequestWarning)

headers = {

'accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8',

'accept-encoding': 'gzip, deflate, br',

'accept-language': 'zh-CN,zh;q=0.9',

'cache-control': 'max-age=0',

'upgrade-insecure-requests': '1',

'user-agent': 'Mozilla/5.0 (Linux; U; Android 5.1.1; zh-cn; MI 4S Build/LMY47V) AppleWebKit/537.36 (KHTML, like Gecko) Version/4.0 Chrome/53.0.2785.146 Mobile Safari/537.36 XiaoMi/MiuiBrowser/9.1.3',

}

class DouYin(object):

def __init__(self, width=500, height=300):

"""

抖音App視訊下載下傳

# 無頭浏覽器

chrome_options = Options()

chrome_options.add_argument(

def get_video_urls(self, user_id):

獲得視訊播放位址

Parameters:

user_id：查詢的使用者ID

Returns:

video_names: 視訊名字清單

video_urls: 視訊連結清單

nickname: 使用者昵稱

def video_downloader(self,video_urls, video_names, watermark_flag=False):

for i in range(len(video_urls)):

try:

video_url = video_urls[i]

def run(self):

user_id = input('請輸入ID(例如108561773):')

if __name__ == '__main__':

douyin = DouYin()

douyin.run()

Crawler：基于requests庫+urllib3庫+僞裝浏覽器實作爬取抖音賬号的資訊資料

輸出結果

代碼設計

繼續閱讀

vsftp虛拟多使用者多權限一鍵部署腳本

Ubuntu14.04 LTS下安裝mongodb

httpd服務的部署、啟動、配置和簡單優化一、部署二、啟動三、配置檔案

配置網頁内容通路

手動安裝Intel network I217-LM網卡的Linux驅動

禁止ubuntu系統彈出報錯界面

Ubuntu Linux下Apache的配置檔案

Android電視機（機頂盒）初次開發的一些經驗分享

vue-cli簡介（中文翻譯）

samba伺服器的功能

Ajax發送和擷取json資料到Spring mvc 1.spring mvc後端2.web前段

【Linux】UDP廣播封包接收速率問題

Linux裝置模型（中）之上層容器

PowerPC平台 Linux移植三

JSONObject包導入異常 java.lang.NoClassDefFoundErrorweb項目的導入包的問題