当前位置:首页 > python > 正文

爬虫python例子(python爬虫10种实例)

  • python
  • 2024-03-26 14:22:22
  • 5641
Python 爬虫示例
要素:
导入必要的库:
python
import requests
from bs4 import BeautifulSoup
发送 HTTP 请求:
python
url = "http://www.example.com"
response = requests.get(url)
解析 HTML 响应:
python
soup = BeautifulSoup(response.text, "html.parser")
提取数据:
python
# 查找所有包含 "class" 属性且值为 "article" 的元素
articles = soup.find_all("article", class_="article")
# 提取每篇文章的标题和内容
for article in articles:
title = article.find("h1").text
content = article.find("div", class_="content").text
存储或处理数据:
python
# 存储数据到数据库
import sqlite3
conn = sqlite3.connect("data.db")
c = conn.cursor()
c.execute("INSERT INTO articles (title, content) VALUES (?, ?)", (title, content))
conn.commit()
示例:
爬取文章标题和内容:
python
import requests
from bs4 import BeautifulSoup
url = "http://www.example.com/articles"
# 发送 HTTP 请求
response = requests.get(url)
# 解析 HTML 响应
soup = BeautifulSoup(response.text, "html.parser")
# 查找所有文章
articles = soup.find_all("article")
for article in articles:
# 提取标题和内容
title = article.find("h1").text
content = article.find("div", class_="content").text
# 打印标题和内容
print(f"Title: {title}")
print(f"Content: {content}")
结果:
console
Title: Article Title 1
Content: Article content 1...
Title: Article Title 2
Content: Article content 2...
...