Python爬虫实战：网易云音乐爬取！

程序员文章站 2022-07-02 09:10:17

本次目标爬取网易云音乐https://music.163.com/PS：如有需要Python学习资料的小伙伴可以加点击下方链接自行获取python免费学习资料以及群交流解答点击即可加入环境python 3.6pycharm爬虫代码导入工具import requestsimport re请求网站、解析网站数据def get_music_url(music_id, music_title): url = 'https://api.zhuol.....

本次目标

爬取网易云音乐

https://music.163.com/

Python爬虫实战：网易云音乐爬取！

PS：如有需要Python学习资料的小伙伴可以加点击下方链接自行获取

python免费学习资料以及群交流解答点击即可加入

环境

python 3.6
pycharm

爬虫代码

导入工具

import requestsimport re

请求网站、解析网站数据

def get_music_url(music_id, music_title):
    url = 'https://api.zhuolin.wang/api.php'
    headers = {
        'Accept': '*/*',
        'Accept-Encoding': 'gzip, deflate, br',
        'Accept-Language': 'zh-CN,zh;q=0.9',
        'Cache-Control': 'no-cache',
        'Connection': 'keep-alive',
        'Cookie': 'UM_distinctid=175aca5b31d39e-06d658eceb014a-3962420d-1fa400-175aca5b31e92e',
        'Host': 'api.zhuolin.wang',
        'Pragma': 'no-cache',
        'Referer': 'https://music.zhuolin.wang/',
        'Sec-Fetch-Dest': 'script',
        'Sec-Fetch-Mode': 'no-cors',
        'Sec-Fetch-Site': 'same-site',
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.138 Safari/537.36',
    }
    params = {
        'callback': 'jQuery111305698848623906863_1604919341715',
        'types': 'url',
        'id': '{}'.format(music_id),
        'source': 'netease',
        '_': '1604919341751',
    }
    response = requests.get(url=url, params=params, headers=headers)
    html_data = response.text
    if music_url == '':
        print('无音频下载链接')
 
def music_id():
    url = 'https://music.163.com/discover/toplist'
    headers = {
        'user-agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.138 Safari/537.36'
    }
    response = requests.get(url=url, headers=headers)
    lis = re.findall('<li><a href="(.*?)">(.*?)</a></li>', response.text, re.S)[0:100]
    for i in lis:
        music_id = i[0].split('id=')[-1]
        title = i[1]
        pattern = re.compile(r"[\/\\\:\*\?\"\<\>\|]")  # '/ \ : * ? " < > |'
        music_title = re.sub(pattern, "_", title)  # 替换为下划线
        get_music_url(music_id, music_title)

保存数据

    else:
        path = '保存地址\\' + music_title + '.mp3'
        response = requests.get(url=music_url)
        with open(path, mode='wb') as f:
            f.write(response.content)
            print(music_title, music_url)

运行代码，结果如下图

Python爬虫实战：网易云音乐爬取！

本文地址：https://blog.csdn.net/weixin_43881394/article/details/109643806

相关标签： Python 人工智能数据挖掘大数据数据分析

上一篇：生活中与山药相宜相克的食物有哪些

下一篇： numpy入门：大作业

Python爬虫实战：网易云音乐爬取！

本次目标

环境

爬虫代码

运行代码，结果如下图

xpath+多进程爬取网易云音乐热歌榜。

网易云歌单信息爬取及数据分析（python爬虫）

Python爬虫实战教程：爬取网易新闻

Python爬虫实战用 BeautifulSoup 爬取电影网站信息

python爬虫教程：爬取酷狗音乐

Python学习-使用Python爬取陈奕迅新歌《我们》网易云热门评论

python爬虫实战爬取B站柯南弹幕+梳理主线剧情

Python3爬虫系列：理论+实验+爬取妹子图实战

Python爬虫实战之Requests+正则表达式爬取猫眼电影Top100

Python爬虫，爬取腾讯漫画实战

Python爬虫实战：网易云音乐爬取！

本次目标

环境

爬虫代码

运行代码，结果如下图

xpath+多进程爬取网易云音乐热歌榜。

网易云歌单信息爬取及数据分析（python爬虫）

Python爬虫实战教程：爬取网易新闻

Python爬虫实战用 BeautifulSoup 爬取电影网站信息

python爬虫教程：爬取酷狗音乐

Python学习-使用Python爬取陈奕迅新歌《我们》网易云热门评论

python爬虫实战 爬取B站柯南弹幕+梳理主线剧情

Python3爬虫系列：理论+实验+爬取妹子图实战

Python爬虫实战之Requests+正则表达式爬取猫眼电影Top100

Python爬虫，爬取腾讯漫画实战

python爬虫实战爬取B站柯南弹幕+梳理主线剧情