Skip to main content
Open on GitHub

ScrapeGraph AI

ScrapeGraph AI is a service that provides AI-powered web scraping capabilities. It offers tools for extracting structured data, converting webpages to markdown, and processing local HTML content using natural language prompts.

安装与设置

安装所需的包:

pip install langchain-scrapegraph

设置您的API密钥:

export SGAI_API_KEY="your-scrapegraph-api-key"

工具

查看一个 使用示例

有四种工具可供选择:

from langchain_scrapegraph.tools import (
SmartScraperTool, # Extract structured data from websites
MarkdownifyTool, # Convert webpages to markdown
LocalScraperTool, # Process local HTML content
GetCreditsTool, # Check remaining API credits
)

每个工具都有特定的用途:

  • SmartScraperTool: 给定URL、提示和可选的输出模式,从网站提取结构化数据
  • MarkdownifyTool: 将任何网页转换为简洁的Markdown格式
  • LocalScraperTool: 根据提示和可选的输出模式,从本地HTML文件中提取结构化数据
  • GetCreditsTool: 检查您剩余的ScrapeGraph AI积分