research-units-pipeline-skills

Name: research-units-pipeline-skills
Author: WILLOSCAR

Verified

Research pipelines as semantic execution units: each skill declares inputs/outputs, acceptance criteria, and guardrails. Evidence-first methodology prevents hollow writing through structured intermediate artifacts.

486stars

38forks

Python

Installation

# Add to your Claude Code skills
git clone https://github.com/WILLOSCAR/research-units-pipeline-skills

Getting Started

Guides for using data processing skills like research-units-pipeline-skills.

Getting Started with AI Skills
First-time install walkthrough for Claude Code, Codex CLI, and ChatGPT.
What is an AI Skills Marketplace?
Definitions, how marketplaces work, and how to choose between them in 2026.

Security ReportVerified

Last scanned: 5/20/2026

{
  "issues": [],
  "status": "PASSED",
  "scannedAt": "2026-05-20T07:43:38.067Z",
  "semgrepRan": false,
  "npmAuditRan": true,
  "pipAuditRan": true
}

README.md

Frequently Asked Questions

What is research-units-pipeline-skills?

research-units-pipeline-skills is an open-source data processing skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by WILLOSCAR. Research pipelines as semantic execution units: each skill declares inputs/outputs, acceptance criteria, and guardrails. Evidence-first methodology prevents hollow writing through structured intermediate artifacts. It has 486 GitHub stars.

Is research-units-pipeline-skills safe to use?

Yes. research-units-pipeline-skills passed SkillsLLM's automated security scan — a dependency vulnerability audit plus prompt-injection heuristics — with no high-severity issues. You can read the full report in the Security Report section on this page.

How do I install research-units-pipeline-skills?

Clone the repository with "git clone https://github.com/WILLOSCAR/research-units-pipeline-skills" and add it to your Claude Code skills directory (see the Installation section above).

What programming language is research-units-pipeline-skills written in?

research-units-pipeline-skills is primarily written in Python. It is open-source under WILLOSCAR on GitHub, so you can review or fork the full source.

Are there alternatives to research-units-pipeline-skills?

Yes. SkillsLLM lists many other Data Processing skills you can browse and compare side by side. Open the Data Processing category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh research-units-pipeline-skills against similar tools.

LLM Engineer for Beginners

Ship LLM features to production - prompts, RAG, structured outputs, evaluation

39 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

superpowers

by obra

An agentic skills framework & software development methodology that works.

234,966

Popular in Data Processing

Top skills in this category by stars

academic-research-skills

by Imbad0202

Academic Research Skills for Claude Code: research → write → review → revise → finalize

37,268

Developers Also Liked

Based on votes and bookmarks from developers who liked this skill

academic-research-skills

by Imbad0202

Academic Research Skills for Claude Code: research → write → review → revise → finalize

37,268

awesome-ai-tools sprite-gen

research-units-pipeline-skills

Languages: English | 简体中文

An Auto Research Design System for agent-assisted research work.

This repository combines semantic research skills with a file-first harness. It is meant for people using coding agents such as Codex to run research workflows without losing the intermediate evidence. A run becomes a durable workspace: planned units, intermediate artifacts, checkpoints, audits, and improvement records.

The short version:

intent -> workflow -> workspace -> unit -> skill -> artifact -> audit -> improvement

It is not a generic workflow engine, a prompt collection, or a claim that research can be fully automated. The repo is narrower and more practical: it keeps research work inspectable, resumable, and improvable while the model handles the semantic reading and writing.

What It Produces

Use this repo when the output matters enough that you want files, checkpoints, and reviewable evidence rather than a one-off chat answer.

Goal	Path to use	Main deliverable
Evidence-first literature survey	`arxiv-survey`	`output/DRAFT.md`
Survey with LaTeX/PDF delivery	`arxiv-survey-latex`	`output/DRAFT.md`, `latex/main.pdf`
Course paper or end-of-term report from a topic	Use `arxiv-survey` for Markdown or `arxiv-survey-latex` for PDF; this is a use-case overlay, not a new workflow	report draft, optional PDF
Fast topic briefing and reading path	`research-brief`	`output/SNAPSHOT.md`
Single-paper critique or referee-style review	`paper-review`	`output/REVIEW.md`
Protocol-driven evidence synthesis	`evidence-review`	`output/SYNTHESIS.md`
Literature-grounded research ideas	`idea-brainstorm`	`output/REPORT.md`, `output/REPORT.json`
Tutorial from webpages, PDFs, notes, or repo docs	`source-tutorial`	`output/TUTORIAL.md`, PDF, slides
Guided Chinese thesis organization	`graduate-paper`	thesis project artifacts

Most users choose one of these paths and inspect the workspace outputs. Maintainers work on the pipeline contracts, project skills, harness scripts, and validation rules behind those paths.

How A Run Works

flowchart TD
    A["User intent"] --> B["Executable workflow contract or research-stage design"]
    B --> C["Workspace ledger"]
    C --> D["Units"]
    D --> E["Project skills"]
    E --> F["Artifacts"]
    F --> G["Harness audit"]
    G --> H["Deliverable"]
    G --> I["Improvement record"]
    I --> B

A workflow is the user-facing product path, such as paper-review.
A workspace is one run directory under workspaces/<name>/.
A unit is a small, checkable step in UNITS.csv.
A skill is a reusable research or writing capability under .codex/skills/.
An artifact is an intermediate or final file, usually Markdown, CSV, YAML, JSON, TeX, or PDF.
An audit is a bounded check of workspace state, run state, or output quality.
An improvement record maps weak output back to a concrete repair surface: a skill, pipeline, artifact, validator, or decision.

The design choice is artifact-first execution. The model should not rely on conversation memory to carry a complex research workflow. It should write state, evidence, and decisions to files that can be inspected by humans and reused by later units.

Quick Start

Start an agent session in this repository and ask for a concrete outcome:

Use paper-review to critique this manuscript and give me a lab-style review.

Use research-brief to explain test-time adaptation for robotics and produce a reading path.

Use source-tutorial to turn these webpages and repo docs into a tutorial with PDF and slides.

Write an arxiv-survey-latex survey about embodied agents and show me the outline first.

Use arxiv-survey-latex to write a compact course paper on robot learning. Keep the outline reviewable before drafting and target a final PDF.

For tighter control, name the executable pipeline contract directly:

Research-stage design document:

pipelines/graduate-paper-pipeline.md

Feature guides:

Path	English	中文
`arxiv-survey` / `arxiv-survey-latex`	Guide	说明
`research-brief`	Guide	说明
`paper-review`	Guide	说明
`evidence-review`	Guide	说明
`idea-brainstorm`	Guide	说明
`source-tutorial`	Guide	说明
`graduate-paper`	Guide	说明

Architecture

The repo has two cooperating layers.

Skills hold semantic research behavior:

what sources or inputs to read;
what artifact to write;
what acceptance criteria apply;
what guardrails the model must respect.

Harness holds deterministic execution support:

workspace initialization and recovery;
pipeline contract validation;
unit execution;
doctor, audit, improve, and pack commands;
output manifests and report schemas;
repo-level tests and readiness checks.

Keep this split when extending the project. Put research judgment in skills. Put repeatable checks and recovery logic in the harness.

For the full architecture map and current function map, see docs/AUTO_RESEARCH_DESIGN_SYSTEM.md.

Current Status

The active workflow families are:

Survey: arxiv-survey, arxiv-survey-latex
Orientation: research-brief
Review: paper-review, evidence-review
Ideation: idea-brainstorm
Tutorial: source-tutorial
Thesis: graduate-paper

Seven workflows currently have pipeline contracts, unit templates, and harness validation. graduate-paper is guided thesis design material with thesis-oriented skills; it is not yet a strict executable pipeline.

Course papers and end-of-term reports are treated as a survey use case, not a separate workflow family. Use arxiv-survey when Markdown is enough and arxiv-survey-latex when the class deliverable needs a PDF.

The maintainer roadmap is focused on paper-review: a completed Auto Review workspace, semantic rubric, scorecard, final review, audit, improvement report, and artifact pack. Some of those proof artifacts are not yet required by the current paper-review pipeline contract; the next proof should produce them around the current contract before the contract is promoted. Here, an artifact pack means a manifest of the files that make a run inspectable and portable. Do not add a new workflow family before that proof exists.

For the current catalog and maturity map, see docs/PIPELINE_TAXONOMY.md.

Developer Surface

This section is for maintainers. Use these checks when changing pipeline contracts, skill IO, workspace artifacts, schemas, or validation rules.

uv run python scripts/validate_repo.py --no-check-quality --strict
uv run python scripts/readiness_audit.py --progress workspaces/harness-upgrade/GOAL_STATUS.md --strict
uv run python scripts/audit_skills.py --fail-on WARN
uv run --extra test python -m pytest -q
uv run python scripts/audit_skills.py --review-category template_placeholder --limit 20
uv run python scripts/audit_skills.py --summary-only
uv run python scripts/generate_skill_graph.py

Workspace diagnostics:

uv run python scripts/pipeline.py doctor --workspace workspaces/<name> --write
uv run python scripts/pipeline.py audit --workspace workspaces/<name> --write
uv run python scripts/pipeline.py improve --workspace workspaces/<name> --write
uv run python scripts/pipeline.py pack --workspace workspaces/<name> --write

doctor diagnoses workspace state. audit summarizes the run. improve maps defects to repair surfaces. pack creates a deliverable manifest.

Reading Map

docs/AUTO_RESEARCH_DESIGN_SYSTEM.md: system model and architecture diagram.
docs/PIPELINE_TAXONOMY.md: workflow catalog, maturity, and next proof.
docs/PROJECT_LANGUAGE.md: canonical language for workflow, workspace, unit, artifact, audit, and improvement.
docs/HARNESS_ROADMAP.md: current product and engineering direction.
docs/HARNESS_READINESS.md: local checks and readiness criteria.
docs/SCHEMAS.md: generated report schema names.
docs/adr/: architecture decisions.
SKILL_INDEX.md: skill index.
SKILLS_STANDARD.md: skill authoring standard.

Multi-language feature documentation hubs live under readme/README.*.md.

research-units-pipeline-skills

Frequently Asked Questions

What is research-units-pipeline-skills?

Is research-units-pipeline-skills safe to use?

How do I install research-units-pipeline-skills?

What programming language is research-units-pipeline-skills written in?

Are there alternatives to research-units-pipeline-skills?

Related Skills

Popular in Data Processing

Developers Also Liked

research-units-pipeline-skills

What It Produces

How A Run Works

Quick Start

Architecture

Current Status

Developer Surface

Reading Map

Star History