scrapai-cli - CLI Tools | SkillsLLM

scrapai-cli

Name: scrapai-cli
Author: discourselab

by discourselab

Pending

AI-powered web scraping CLI. Describe what you want, get a production-ready Scrapy spider. Write once, reuse forever.

104stars

11forks

Python

Added 3/23/2026

View on GitHub Download ZIP

CLI Toolsaiclaude-codeclicloudflarecloudflare-bypass

Installation

# Add to your Claude Code skills
git clone https://github.com/discourselab/scrapai-cli

Getting Started

Guides for using cli tools skills like scrapai-cli.

Getting Started with AI Skills
First-time install walkthrough for Claude Code, Codex CLI, and ChatGPT.
SkillsLLM Features Guide
Voting, bookmarks, comments, comparison pages, and security scanning explained.
What is an AI Skills Marketplace?
Definitions, how marketplaces work, and how to choose between them in 2026.

README.md

You: "Add https://bbc.co.uk to my news project"

Minutes later you have a tested, production-ready scraper stored in a database. No Python, no CSS selectors, no Scrapy knowledge. The AI agent analyzes the site, writes extraction rules, verifies quality, and saves a reusable config. Run it tomorrow or next year. Same command, no AI costs.

Built by DiscourseLab. Used in production across 500+ websites.

Who This Is For
Why ScrapAI?
How It Works
Features
Quick Start
- Using with AI Agents
- Migrating Existing Scrapers
For Developers
Architecture
Security
CLI Reference
Configuration
Limitations
Documentation
Contributing
Responsible Use
License

Who This Is For

Good fit:

Teams that need to scrape many websites and don't want to write individual scrapers
Non-technical users who can describe what they want in plain English
Organizations where scraping is a means to an end, not the core competency

LLM Engineer for Beginners

Ship LLM features to production - prompts, RAG, structured outputs, evaluation

39 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

n8n

by n8n-io

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

186,820

Popular in CLI Tools

Top skills in this category by stars

system_prompts_leaks

by asgeirtj

Extracted system prompts from ChatGPT (GPT-5.5 Thinking), Claude (Opus 4.7, Opus 4.6, Sonnet 4.6, Claude Code), Gemini (3.1 Pro, 3 Flash, Gemini CLI), Grok (4.3 beta), Perplexity, and more. Updated regularly.

clawdex-mobile claude-file-recovery

You (plain English) → AI Agent → JSON config → Database → Scrapy crawl
                       (once)                               (forever)

{
  "name": "bbc_co_uk",
  "allowed_domains": ["bbc.co.uk"],
  "start_urls": ["https://www.bbc.co.uk/news"],
  "rules": [
    {
      "allow": ["/news/articles/[^/]+$"],
      "callback": "parse_article",
      "follow": false
    },
    {
      "allow": ["/news/?$"],
      "follow": true
    }
  ],
  "settings": {
    "EXTRACTOR_ORDER": ["newspaper", "trafilatura"],
    "DOWNLOAD_DELAY": 2
  }
}

git clone https://github.com/discourselab/scrapai-cli.git
cd scrapai-cli
./scrapai setup
./scrapai verify

scrapai-cli

Table of Contents

Who This Is For

Related Skills

Popular in CLI Tools

Why ScrapAI?

How It Works

What's Under the Hood

Features

Quick Start