LawFlow: Collecting and Simulating Lawyers' Thought Processes

Debarati Das Debarati Das
Computer Science
Khanh Chi Le Khanh Chi Le*
Computer Science
Ritik Parkar Ritik Parkar*
Computer Science
Karin De Langis Karin De Langis
Computer Science
Brendan Madson Brendan Madson
Law School
Chad Berryman Chad Berryman
Law School
Robin Willis Robin Willis
Law School
Daniel Moses Daniel Moses
Law School
Dongyeop Kang Dongyeop Kang+
Computer Science
*project leads,    +senior advisors


                 

Research Overview

Data Collection Tool

Our database captures real-world business entity formation scenarios, starting with a seed scenario provided by a local business. Through conversations with the business owner, a lawyer initiates the legal process, meticulously documenting and taking notes of each step—from initial discussions to formal filings. The process culminates in a finalized agreement document, serving as the definitive legal outcome.

Key Features:

This data collection tool enables us to study not just what decisions are made while drafting by law students, but also how they adapt, iterate, and navigate ambiguity across the full drafting workflow.

Entity Formation Workflow

Our data collection tool is rooted in a visual, expert-informed task plan that maps out the full legal drafting workflow—from initial client intake to final agreement. 🎯 This task plan supports flexible navigation across subtasks in the data collection tool, while maintaining a clear structure rooted in real-world practice.

The workflow is organized across three levels of detail:

This multi-layered structure allows us to capture the dynamic, non-linear nature of legal work—supporting both detailed analysis and AI tools that align with how real practitioners think and operate.

For example, the entity formation workflow includes five key stages:

Together, these levels form a flexible yet robust blueprint for capturing and analyzing legal workflows in real-world scenarios.

Generated Operating Agreements

Each scenario in our dataset is grounded in real-world business needs, with law students simulating the full legal workflow from client intake to final agreement drafting. Human workflows are flexible and iterative, often looping back to revise earlier steps. In contrast, LLMs follow a more rigid, linear path—highlighting key differences in how humans and AI approach complex legal reasoning.

Human Execution Graph
Human Execution Graph
O1 Execution Graph
OpenAI O1 Execution Graph

Insights from LawFlow

Dataset Card

We gathered data from law students working through real-world business formation scenarios—an example is shown below under the 'Human' tab. Using the same task plan, we also had LLMs like DeepSeek R1 and OpenAI O1 generate workflows for these scenarios. Switch to the LLM tab to see a sample output from the O1 model.

Call for Participation

We plan to scale up our data collection and develop innovative legal reasoning benchmarks for evaluating long-range, multi-turn, deep legal reasoning tasks. We welcome collaborators, funders, and researchers interested in exploring AI for knowledge intensive tasks, such as legal processing. If you’d like to try our tool or contribute to our dataset, please get in touch! Contact us at dongyeop@umn.edu