BeautifulSoup
安装
安装
# terminal
pip3 install bs4 lxml模块导入
from bs4 import BeautifulSoup使用
创建对象及加载
f = open('test.html', 'r')
soup = BeautifulSoup(f, 'lxml')标签定位
语法
说明
提取标签中的内容
语法
说明
Last updated
# terminal
pip3 install bs4 lxmlfrom bs4 import BeautifulSoupf = open('test.html', 'r')
soup = BeautifulSoup(f, 'lxml')Last updated
# 示例
tag = soup.div
tag = soup.find('div')
tag = soup.find(class_='p-3')
tag = soup.find('div', class_='p-3')
tag = soup.find_all('div')
tag = soup.find_all(class_='p-2')
tag = soup.find_all('div', class_='p-2')
tag = soup.select('div')
tag = soup.select('.p-2')
tag = soup.select('div > ul > li')
tag = soup.select('.p-2 a')# 示例
tag = soup.find('div', class_='p-2')
print(tag)
print(tag.string)
print(tag.text)
tag = soup.select('.p-2 a')
print(tag[0]['href'])