Python中urllib库读取网站相关信息的方法

时间：2024-10-13 10:04:59

1、用 import 代码导入 urllib库，具体如下：import urllib.request

2、建立一个变量名称为 url 赋值为我们需要获取的网站地址，具体代码如下：url = "http://m.sinovision.net/newpneumonia.php"

3、建立一个变量名称为 res，用urllib.request.urlopen() 发送请求具体代码如下：res = urllib.request.urlopen(url)

4、接下来我们获取相关信息，代码如下：print猾诮沓靥(res.status)print(res.geturl())print(res.getcode())print(res.info()像粜杵泳)res.status 为获取状态码res.geturl() 为获取urlres.info() 为获取响应头res.getcode() 为获取状态码

5、运行此程序，查看打印结果，效果如图所示。我们已经打印出了状态码为200url地址，以及状态头。

6、接下来我们建立变量名称为web ，用read方法读取网页源码web = res.read().decode("utf8")

7、用文件写入方法，把读取出来的源码保存到文件当中，具体代码如下：f = open(&孥恶膈茯quot;html",mode="w",encoding="utf8")f.write(web)f.flush()f.close()

8、运行整个代码，出现了名称为html 的文档，里面内容为网站的源码，效果如图

9、整体代码总窑钕仇焱结：import urllib.requesturl = "http://m.sinovision.net/newp荏鱿胫协neumonia.php"res = urllib.request.urlopen(url)print(res.status)print(res.geturl())print(res.getcode())print(res.info())web = res.read().decode("utf8")f = open("html",mode="w",encoding="utf8")f.write(web)f.flush()f.close()

EXCEL表格怎么快速去除小数点？

PS如何给单个图层调整颜色？

PS(Photoshop2022)图层边缘怎么打开

Photoshop通道配合画笔工具合成图像

PS中给照片重命名的方法。