name: Summarize testdata changes
description:
Instructions for summarizing changes to Carbon testdata files
(`toolchain/*/testdata`).
Summarize testdata changes
This skill provides instructions for creating a comprehensive report summarizing
changes to Carbon testdata files (toolchain/*/testdata) and associating them
with related code changes.
Goals
Produce a report that:
- Summarizes code changes outside of testdata.
- Groups similar testdata changes together, listing all affected files for
each group. Every change to testdata must be represented by at least one
group. This includes changes to CHECK lines.
- Provides detailed breakdowns of test input changes and diagnostic output
changes in the corresponding group. Every single change to inputs or to
STDERR checks must be explicitly mentioned in the group, with either an
inline diff or a link to the file.
Process
1. Identify Changes
Use your VCS (Git or Jujutsu) or query Github to identify changes. For large
changes, it is recommended to use the included helper script to extract test
input changes.
For Git Users:
- Summarize code changes:
git diff --stat -- ':!toolchain/*/testdata'
- To see content of non-testdata changes:
git diff -- ':!toolchain/*/testdata'
- Identify testdata changes:
git diff --name-only 'toolchain/*/testdata'
For Jujutsu (jj) Users:
- Summarize code changes:
jj --no-pager diff --stat '~toolchain/*/testdata'
- Note: Quoting the fileset
'~toolchain/*/testdata' is critical if it
contains wildcards.
- To see content of non-testdata changes, use
--git to get standard
unified diff format: jj --no-pager diff --git '~toolchain/*/testdata'
- Identify testdata changes:
jj --no-pager diff --name-only 'toolchain/*/testdata'
For Github Pull Requests:
- Summarize code changes:
gh pr diff
- Identify testdata changes:
gh pr diff --name-only | grep '^toolchain/.*/testdata'
Handling Specific Revisions:
If you are summarizing changes in a specific revision (for example, @-) or
pull request (for example, #1234), add -r <rev> or <pr_number> to the
commands:
git diff <rev>^ <rev> ... (or use git show <rev>)
jj --no-pager diff -r <rev> ...
gh pr diff <pr_number>
2. Extract Test Input Changes (Recommended)
To easily identify changes, use the included Python helper script to extract all
text additions and removals from the diff, categorized by Input, STDERR, and
STDOUT changes. This script reads a unified diff from stdin.
# For Git:
git diff -- 'toolchain/*/testdata' | python3 .agents/skills/summarize_testdata_changes/scripts/parse_diff.py
# For Jujutsu (jj):
jj diff --git 'toolchain/*/testdata' | python3 .agents/skills/summarize_testdata_changes/scripts/parse_diff.py
# For a specific revision with jj:
jj diff -r @- --git 'toolchain/*/testdata' | python3 .agents/skills/summarize_testdata_changes/scripts/parse_diff.py
# For a specific PR with Github:
gh pr diff 1234 | python3 .agents/skills/summarize_testdata_changes/scripts/parse_diff.py
3. Identify Patterns and Produce a List of Groups
- Read the diff and produce a list of groups of changes that share a common
theme or cause (for example, "Updated expected output for integer literals",
"Added tests for new keyword").
- CRITICAL: Every single change in the testdata diff must be represented
by at least one group. Do not ignore changes to
CHECK lines.
- If it's not clear what group a change belongs to, create a new group for
it.
- For each group:
- Provide a brief description of the group.
- (Optional) Briefly note if the group appears to be an intended or
unintended consequence of the code changes.
- Divide the groups into sections:
- Test Changes: Changes to test inputs (lines not prefixed with
// CHECK), along with diagnostic output changes where relevant
- Diagnostic Changes: Changes to diagnostic output (lines prefixed with
// CHECK:STDERR) with no corresponding changes to test inputs
- [Output Type] Changes: Changes to STDOUT (lines prefixed with
// CHECK:STDOUT)
- Create one section for each relevant kind of test. For example,
parser tests should typically be in a "Parse Tree Changes" section,
check tests should typically be in a "SemIR Changes" section, and
lower tests should typically be in an "LLVM IR Changes" section.
4. Improve Grouping
- Read the list of groups and check to see if any of them should be combined
or split apart. If needed, do so.
5. Assign Changes to Groups
- Read the diff again, and then for each change in the diff:
- Add the change to the appropriate group (or, rarely, groups).
- CRITICAL: Every single change in the testdata diff must be
represented by at least one group. Do not ignore changes to
CHECK
lines.
- If the change affects test inputs (lines not prefixed with
// CHECK)
or diagnostic output (lines prefixed with // CHECK:STDERR):
- List the file within the group. Don't just give one or a few
examples. Include every file.
- Provide an inline diff if the change is small.
- Provide a link to the file if the change is large.
- Otherwise, if the change only affects STDOUT (lines prefixed with
// CHECK:STDOUT):
- Ensure the group contains a representative example that matches the
current change.
- The representative example should be an inline diff of the change.
- CRITICAL: Every single change to test inputs and diagnostic
outputs in the files being summarized must be explicitly listed in at
least one group. Do not skip changes, even if they are similar to
changes you've already seen, and do not just give examples.
6. Validation
As a final validation step:
- Read through the testdata diff again.
- Ensure that every change in the diff is reflected by at least one group in
the report.
Report Template
Use the following template for the generated report:
# `testdata` Change Summary
## Code Changes
[One paragraph summarizing changes outside of testdata.]
## Test Changes
### [Group Name]
[Description of the group.]
[Change 1: diff context OR link]
[Change 2: diff context OR link]
...
## Diagnostic Changes
### [Group Name]
[Description of the group.]
[Change 1: diff context OR link]
[Change 2: diff context OR link]
...
## [Output Type] Changes
### [File Path]
[Description of the group.]
[Example diff context]
Changes of this kind were found in [Number] files. Examples: [List of files]
...
Skip sections that would be empty.