AI4Science · Scientific Workflow Data Collection

ShortComputationalSimulation

SciHarbor

Lead: Young-Jun Lee · 1–2 hr per task

If your lab works with simulators or computational tools, contribute well-specified tasks that become a public benchmark for AI agents on real scientific software.

What we collectA task description, its expected outcome, and the metrics that decide success — across chemistry, biology, aerospace, civil, and materials science.

Step 1

Scope

A short interview to scope the tasks your lab cares about.

Step 2

Submit

Specify each task — description, expected outcome, evaluation metrics — as a YAML manifest.

Step 3

Benchmark

We run agents against your tasks and post results to a public leaderboard.

Co-authorshipAll participants are invited as co-authors

ProducesSciHarbor — a benchmark evaluating AI agents on real simulators (OpenFOAM, GROMACS, AlphaFold, Ansys, SAP2000, OpenRocket…). 300+ tasks in Phase 1, MIT-licensed, with an open leaderboard.

Phase 2 extends to physical workflows once robotic arms are installed.

Contact: Young-Jun Lee ↗

Project site ↗ Team ↗

ShortHands-onVideo

Lab Workflows

Lead: Karin de Langis · 1–2 hr per session

If your lab is running physical experiments, record short sessions so we can test how well vision-language models understand the goals behind each action — the User-Aware AI effort.

What we collectHead-mounted video with think-aloud narration; the participant's intent is annotated as the prediction target.

Step 1 · 30 min

Interview

Orientation, instructions, and scheduling.

Step 2 · 2 hr

Recording

Record two ~1-hour sessions in the lab.

Step 3 · 30 min

Reflection

Interview about actions, intentions, and goals.

Compensation$150 per participantup to $50 / hr · co-authorship for contributing labs

ProducesA dataset of hands-on lab workflows for benchmarking how well AI connects actions to scientific goals and intentions.

Example

Contact: Karin de Langis ↗

Project site ↗ Participate ↗

Long-horizonLongitudinalDigital

ResearchWorkflow

Lead: Khanh Chi Le · 6–12 months

If you are running a long-term project in digital tools, let us record how it unfolds from inception to publication.

What we collectKeystrokes, screen video, meeting notes, and documents — gathered passively as you work.

Step 1 · 1 hr

Onboarding

Set up recording and connect your tools.

Step 2 · 12 mo

Record & collect

Auto-records sessions and collects research files.

Step 3 · 3 × 1 hr

Reflection

Interviews about your research reasoning and decisions.

Compensation$150 per team / 4 monthsup to $600 total

ProducesA longitudinal dataset of how research unfolds — to study how early actions shape later ones, and to build better AI research assistants.

Example

Contact: Khanh Chi Le ↗

How does real science actually get done?
Help us record it.

One question, three scales of data

The three projects

SciHarbor

Lab Workflows

ResearchWorkflow

How to get involved

Participate

Collaborate

The team & contributors

Project leads

Young-Jun Lee

Karin de Langis

Khanh Chi Le

Dongyeop Kang

Contributor PIs — collaborating faculty across UMN departments

Jihye Park

Chris Bartel

Seung Hwan (Allen) Lee

Seongjin Choi