<thead id="yt92j"></thead>

溫馨提示×

溫馨提示×

您好，登錄后才能下訂單哦！

密碼登錄×

忘記密碼？

登錄注冊×

獲取短信驗證碼

其他方式登錄

點(diǎn)擊登錄注冊即表示同意《億速云用戶服務(wù)條款》

用戶登錄×

賬戶密碼登錄

請使用微信掃描上方二維碼

使用幫助

請求超時！

請點(diǎn)擊重新獲取二維碼

python爬蟲多次請求超時怎么辦

發(fā)布時間：2021-06-25 13:52:11 來源：億速云閱讀：238 作者：chen 欄目：編程語言

這篇文章主要介紹“python爬蟲多次請求超時怎么辦”，在日常操作中，相信很多人在python爬蟲多次請求超時怎么辦問題上存在疑惑，小編查閱了各式資料，整理出簡單好用的操作方法，希望對大家解答”python爬蟲多次請求超時怎么辦”的疑惑有所幫助！接下來，請跟著小編一起來學(xué)習(xí)吧！

第一種方法

headers = Dict()
url = 'https://www.baidu.com'try:
    proxies = Noneresponse = requests.get(url, headers=headers, verify=False, proxies=None, timeout=3)except:# logdebug('requests failed one time')try:
        proxies = Noneresponse = requests.get(url, headers=headers, verify=False, proxies=None, timeout=3)except:# logdebug('requests failed two time')print('requests failed two time')

總結(jié) ：代碼比較冗余，重試try的次數(shù)越多，代碼行數(shù)越多，但是打印日志比較方便

第二種方法

def requestDemo(url，):
	headers = Dict()
	trytimes = 3  #  重試的次數(shù)for i in range(trytimes):		try:
		    proxies = Noneresponse = requests.get(url, headers=headers, verify=False, proxies=None, timeout=3)#	注意此處也可能是302等狀態(tài)碼if response.status_code == 200:		    	breakexcept:	    	# logdebug(f'requests failed {i}time')	print(f'requests failed {i} time')

總結(jié) ：遍歷代碼明顯比第一個簡化了很多，打印日志也方便

第三種方法

def requestDemo(url， times=1):
	headers = Dict()	try:
	    proxies = Noneresponse = requests.get(url, headers=headers, verify=False, proxies=None, timeout=3)
	    html = response.text()#	todo  此處處理代碼正常邏輯passreturn html	except:    	# logdebug(f'requests failed {i}time')	trytimes = 3  #  重試的次數(shù)if times < trytimes:
    		times += 1   		return requestDemo(url， times)       	return 'out of maxtimes'

總結(jié) ：迭代顯得比較高大上，中間處理代碼時有其它錯誤照樣可以進(jìn)行重試；缺點(diǎn) 不太好理解，容易出錯，另外try包含的內(nèi)容過多時，對代碼運(yùn)行速度不利。

第四種方法

@retry(3)	#	重試的次數(shù) 3def requestDemo(url):
	headers = Dict()
    proxies = Noneresponse = requests.get(url, headers=headers, verify=False, proxies=None, timeout=3)
    html = response.text()#	todo  此處處理代碼正常邏輯passreturn html   
def retry(times):def wrapper(func):def inner_wrapper(*args, **kwargs):i = 0while i < times:try:
                    print(i)return func(*args, **kwargs)except:                	#	此處打印日志  func.__name__ 為say函數(shù)print("logdebug: {}()".format(func.__name__))
                    i += 1return inner_wrapperreturn wrapper

總結(jié) ：裝飾器優(yōu)點(diǎn) 多種函數(shù)復(fù)用，使用十分方便

第五種方法

#!/usr/bin/python# -*-coding='utf-8' -*-import requestsimport timeimport jsonfrom lxml import etreeimport warnings
warnings.filterwarnings("ignore")def get_xiaomi():try:# for n in range(5):  # 重試5次#     print("第"+str(n)+"次")for a in range(5): # 重試5次print(a)
            url = "https://www.mi.com/"headers = {"Accept": "text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3","Accept-Encoding": "gzip, deflate, br","Accept-Language": "zh-CN,zh;q=0.9,en;q=0.8","Connection": "keep-alive",# "Cookie": "xmuuid=XMGUEST-D80D9CE0-910B-11EA-8EE0-3131E8FF9940; Hm_lvt_c3e3e8b3ea48955284516b186acf0f4e=1588929065; XM_agreement=0; pageid=81190ccc4d52f577; lastsource=www.baidu.com; mstuid=1588929065187_5718; log_code=81190ccc4d52f577-e0f893c4337cbe4d|https%3A%2F%2Fwww.mi.com%2F; Hm_lpvt_c3e3e8b3ea48955284516b186acf0f4e=1588929099; mstz=||1156285732.7|||; xm_vistor=1588929065187_5718_1588929065187-1588929100964","Host": "www.mi.com","Upgrade-Insecure-Requests": "1","User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/75.0.3770.90 Safari/537.36"}
            response = requests.get(url,headers=headers,timeout=10,verify=False)
            html = etree.HTML(response.text)# print(html)result = etree.tostring(html)# print(result)print(result.decode("utf-8"))
            title = html.xpath('//head/title/text()')[0]
            print("title==",title)if "左左" in title:# print(response.status_code)# if response.status_code ==200:breakreturn titleexcept:
        result = "異常"return resultif __name__ == '__main__':
    print(get_xiaomi())

第六種方法

Python重試模塊retrying

# 設(shè)置最大重試次數(shù)@retry(stop_max_attempt_number=5)def get_proxies(self):r = requests.get('代理地址')
    print('正在獲取')raise Exception("異常")
    print('獲取到最新代理 = %s' % r.text)
    params = dict()if r and r.status_code == 200:
        proxy = str(r.content, encoding='utf-8')
        params['http'] = 'http://' + proxy
        params['https'] = 'https://' + proxy

# 設(shè)置方法的最大延遲時間，默認(rèn)為100毫秒(是執(zhí)行這個方法重試的總時間)@retry(stop_max_attempt_number=5,stop_max_delay=50)# 通過設(shè)置為50，我們會發(fā)現(xiàn)，任務(wù)并沒有執(zhí)行5次才結(jié)束！# 添加每次方法執(zhí)行之間的等待時間@retry(stop_max_attempt_number=5,wait_fixed=2000)# 隨機(jī)的等待時間@retry(stop_max_attempt_number=5,wait_random_min=100,wait_random_max=2000)# 每調(diào)用一次增加固定時長@retry(stop_max_attempt_number=5,wait_incrementing_increment=1000)# 根據(jù)異常重試，先看個簡單的例子def retry_if_io_error(exception):return isinstance(exception, IOError)@retry(retry_on_exception=retry_if_io_error)def read_a_file():with open("file", "r") as f:return f.read()

read_a_file函數(shù)如果拋出了異常，會去retry_on_exception指向的函數(shù)去判斷返回的是True還是False，如果是True則運(yùn)行指定的重試次數(shù)后，拋出異常，F(xiàn)alse的話直接拋出異常。
當(dāng)時自己測試的時候網(wǎng)上一大堆抄來抄去的，意思是retry_on_exception指定一個函數(shù)，函數(shù)返回指定異常，會重試，不是異常會退出。真坑人啊！
來看看獲取代理的應(yīng)用(僅僅是為了測試retrying模塊)

到此，關(guān)于“python爬蟲多次請求超時怎么辦”的學(xué)習(xí)就結(jié)束了，希望能夠解決大家的疑惑。理論與實踐的搭配能更好的幫助大家學(xué)習(xí)，快去試試吧！若想繼續(xù)學(xué)習(xí)更多相關(guān)知識，請繼續(xù)關(guān)注億速云網(wǎng)站，小編會繼續(xù)努力為大家?guī)砀鄬嵱玫奈恼拢?/p>

向AI問一下細(xì)節(jié)

推薦閱讀：

免責(zé)聲明：本站發(fā)布的內(nèi)容（圖片、視頻和文字）以原創(chuàng)、轉(zhuǎn)載和分享為主，文章觀點(diǎn)不代表本網(wǎng)站立場，如果涉及侵權(quán)請聯(lián)系站長郵箱：is@yisu.com進(jìn)行舉報，并提供相關(guān)證據(jù)，一經(jīng)查實，將立刻刪除涉嫌侵權(quán)內(nèi)容。

上一篇新聞：
hbuilderx怎么調(diào)用儀表的provider
下一篇新聞：
Linux內(nèi)核中斷初始化的介紹

猜你喜歡

AI
助
手

產(chǎn)品服務(wù)

地區(qū)劃分

專題活動

幫助支持

關(guān)于我們

售后咨詢

7*24小時在線電話：400-100-2938

7*24小時在線 QQ：800811969

關(guān)注億速云

億速云公眾號

手機(jī)網(wǎng)站二維碼

<samp id="z2b4u"></samp>

<td id="z2b4u"><small id="z2b4u"><pre id="z2b4u"></pre></small></td>