基于python解析網易雲歌單.

2023-03-08 02:49:21

在寫到解析歌單的時候發現幾個問題

擷取歌曲名字和id的xpath寫法明明和爬取熱榜的寫法一樣

基于python解析網易雲歌單.

id_list=html.xpath('//a[contains(@href,"song?")]')
id_list=id_list[0:-11]
for id in id_list:
    href=id.xpath('./@href')[0]
    song_id=href.split('=')[1]
    songid.append(song_id)
    song_name=id.xpath('./text()')[0]
    songname.append(song_name)

但無法擷取明文資訊

基于python解析網易雲歌單.

研究了下發現,歌單中的資訊被放在了iframe裡.

基于python解析網易雲歌單.

通過查詢後發現可以通過安裝selenium實作.如果僅僅作為爬蟲工具來使用則很簡單,沒什麼入門成本.

from selenium import webdriver
wd = webdriver.Chrome(r'd:\chromedriver.exe')
wd.get('https://music.163.com/#/playlist?id=6702371651')
wd.switch_to.frame(0)
elements =wd.find_elements_by_xpath('//a[contains(@href,"/song?id")]')
name=[]
for i in elements:
    print(i.get_attribute('href'))
elements2 =wd.find_elements_by_xpath('//b[@title]')
for i in elements2:
    print(i.get_attribute('title'))

擷取成功

基于python解析網易雲歌單.

受限于未登入狀态隻能看到部分資訊,隻能看到十條

基于python解析網易雲歌單.

繼續閱讀

TestLink導出用例轉換工具(XML2Excel)

利用Selenium內建TestLink做自動化測試

YAML簡介和PyYAML安全操作YAML支援的類型YAML的優點：yaml的基本文法python操作

Small tricks

libsvm for python 安裝

學習軟體測試基礎測試第七天

Zeppelin 配置通路 REST APIApache Zeppelin Configuration REST API

【Torch】最簡潔logging使用指南

27. Remove Element(清單)題目代碼

sort()函數到底是怎樣進行數字排序的

Cloud Studio初體驗

使用 ctypes 進行 Python 和 C 的混合程式設計

【python】【資料處理】畫多元資料分布圖

【python】netconf協定對接管理裝置

「Python 網絡自動化」NETCONF —— Python 使用 NETCONF 管理配置 H3C 網絡裝置

在python中建立excel并寫入