data2story-skill

Name: data2story-skill
Author: QinghongLin

Pending

Data Journalist Agent: Transforming Data into Verifiable Multimodal Story

106stars

9forks

Python

Installation

# Add to your Claude Code skills
git clone https://github.com/QinghongLin/data2story-skill

Getting Started

Guides for using ai agents skills like data2story-skill.

Caveman: Cut Claude Token Use by 65%
How agent-side prompt compression works, when to use it, and when not to.
What is an AI Skills Marketplace?
Definitions, how marketplaces work, and how to choose between them in 2026.
Getting Started with AI Skills

README.md

Frequently Asked Questions

What is data2story-skill?

data2story-skill is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by QinghongLin. Data Journalist Agent: Transforming Data into Verifiable Multimodal Story. It has 106 GitHub stars.

Is data2story-skill safe to use?

data2story-skill's catalog security scan is still queued. You can run an instant dependency and prompt-injection check now with the "Scan for vulnerabilities" button above.

How do I install data2story-skill?

Clone the repository with "git clone https://github.com/QinghongLin/data2story-skill" and add it to your Claude Code skills directory (see the Installation section above).

What programming language is data2story-skill written in?

data2story-skill is primarily written in Python. It is open-source under QinghongLin on GitHub, so you can review or fork the full source.

Are there alternatives to data2story-skill?

Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh data2story-skill against similar tools.

Agentic AI for Beginners

Build your first AI agent from scratch - tool use, ReAct pattern, memory, deployment

41 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

superpowers

by obra

An agentic skills framework & software development methodology that works.

234,966

OnionClaw os-moda

Data2Story

A data-journalist agent skill that turns any dataset into a verifiable, evidence-grounded multimodal story — a self-contained HTML article where every sentence traces back to the data or source that justifies it.

What it does

Dataset → finished article. Point it at a CSV, a folder of data, or a paper, and a fixed pipeline of seven roles produces a publishable HTML story end to end.
Evidence-grounded. Every sentence and visual links back to its source. The pipeline emits a viewer.html inspector where you can click any claim to see the data, code, or citation behind it.
Multimodal by default. Charts, images, video, audio, maps, and interactive elements — chosen from the data's actual properties, not a fixed checklist. Media generation routes through OpenRouter.
Verifiable. Each run writes to its own versioned project folder and snapshots the exact skill versions used, so every result can be traced and re-checked.
Progressive disclosure. Each role's SKILL.md holds only its instructions; bulky reference material (output schemas, field rules, lookup tables) lives in that role's references/ folder as JSON and is loaded only when needed.

Installation & usage

Data2Story is an agent skill. The orchestrator lives in skills/data2story/SKILL.md — it works first-class with Claude Code, and equally with Codex, Cursor, Gemini CLI, and other agents.

Set your API key. Media generation routes through OpenRouter by default:
```
export OPENROUTER_API_KEY=sk-or-...
```
Run the skill on a dataset:
- Claude Code — make the skill available (place skills/data2story/ under ~/.claude/skills/, or run from inside this repo), then:
```
/data2story data/pick_a_card
```
- Codex / other agents — open the repo and ask the agent to follow the orchestrator:
```
Read skills/data2story/SKILL.md and run the Data2Story pipeline on data/pick_a_card
```
Open the output: index.html (the finished article) and viewer.html (the evidence inspector).

Project structure

data2story-skill/
├── skills/
│   ├── data2story/        the agent: SKILL.md (orchestrator) + one folder per role
│   │                      each role = SKILL.md + references/ (JSON) + scripts/ (the tools it runs)
│   │                        · designer/scripts/  — OpenRouter media tools (text→image/video/music, embeddings)
│   │                        · inspector/scripts/ — verify.py + generate_viewer.py
│   │                        · detective/scripts/ — Wikimedia/Commons fetch helpers
│   └── frontend-design/   shared UI/visual design system the Designer & Programmer borrow from
├── data/     example datasets
└── assets/   shared images

The virtual newsroom

Think of it as a small newsroom in a box. Each role reads what the previous one produced, then adds its own artifact — a fixed pipeline that runs once, end to end.

#	Role	What it does	Produces
1	Detective	Researches external context — domain background, history, why the data matters	`detective.json`
2	Analyst	Exhaustively profiles the data — distributions, correlations, trends, anomalies	`analyst.json`, `code/*.py`
3	Editor	Decides the narrative — what the article argues and which findings matter	`editor.md`, `editor.json`
4	Designer	Chooses how to show each point — charts, images, video, audio, interactives	`designer.json`, `assets/`
5	Programmer	Builds the final HTML, tagging every element with its source IDs	`index.html`
6	Auditor	Fixes layout issues — overlap, spacing, alignment — without changing content	`index.html` (fixed), `auditor.json`
7	Inspector	Verifies every sentence traces to its evidence; builds an interactive viewer	`inspector.json`, `viewer.html`

flowchart TB
    DATA[(dataset)]

    subgraph plan [Plan and design]
        direction LR
        DET[Detective] -->|detective.json| ANA[Analyst]
        ANA -->|analyst.json + code| EDT[Editor]
        EDT -->|editor.md + editor.json| DES[Designer]
    end

    subgraph build [Build and verify]
        direction LR
        PRG[Programmer] -->|index.html| AUD[Auditor]
        AUD -->|index.html| INS[Inspector]
    end

    DATA --> DET
    DES -->|designer.json + assets| PRG
    INS -->|inspector.json + viewer.html| OUT([article + evidence viewer])

License

Released under the MIT License.

Acknowledgement

If you use Data2Story in your research, please kindly cite:

@article{data2story,
  title   = {Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories},
  author  = {Lin, Kevin Qinghong and EI, Batu and Shi, Yuhong and Lu, Pan and Torr, Philip and Zou, James},
  journal = {arXiv preprint arXiv:2606.11176},
  year    = {2026}
}