【Python】抓取电影海报并下载
生活随笔
收集整理的這篇文章主要介紹了
【Python】抓取电影海报并下载
小編覺得挺不錯的,現(xiàn)在分享給大家,幫大家做個參考.
關于BeautifulSoup :
soup.find_all(class_='v_picConBox mt15')
tag.find("div",{"class": "pic"})
pic.img['data-src']
title =text.span.em.a['title']
#coding=utf-8
import re,os
from urllib.request import urlretrieve
import requests
from bs4 import BeautifulSoupdef geturl(url):html=requests.get(url).contentsoup=BeautifulSoup(html,'lxml')return soup#抓取電影海報
soup=geturl('http://dianying.2345.com/list/kehuan------.html')
bookAlbum=soup.title.string.split('_')[0]
# print(bookAlbum)
tags=soup.find_all(class_='v_picConBox mt15')
# print(tags)
movies=[]
for tag in tags[0].find_all('li'): pic= tag.find("div",{"class": "pic"}) if pic is None:continuetext = tag.find("div",{"class": "txtPadding"}) img_url = pic.img['data-src']title =text.span.em.a['title']# print(title,img_url)movies.append([title,img_url])print(movies)
if not os.path.exists(bookAlbum):os.makedirs(bookAlbum)for movie in movies:filename=os.path.join(bookAlbum,movie[0]+'.png')print(filename)with open(filename,'w') as f:urlretrieve('http:'+movie[1],filename)
總結(jié)
以上是生活随笔為你收集整理的【Python】抓取电影海报并下载的全部內(nèi)容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: TCP/IP 协议——知识集锦
- 下一篇: mysqldump 备份数据库