Strom: bf6f21b8e9

Gemini & AI Assistant Guide for Carbon

This document provides high-density technical context for AI assistants (and humans!) contributing to the Carbon Language project. If you are an AI assistant, read this first to avoid common pitfalls.

Project Structure
Building and Testing
Debugging and Diagnostics
C++ Coding Patterns
Common Pitfalls

Project Structure

toolchain/: The C++ implementation of the compiler (Toolchain).
- toolchain/check/: Semantic analysis (SemIR generation).
- toolchain/parse/: Parsing (Token -> Parse Tree).
- toolchain/lex/: Lexing (Source -> Tokens).
- toolchain/sem_ir/: Semantic Intermediate Representation (SemIR) definitions.
- toolchain/lower/: Lowering to LLVM IR.
proposals/: Evolution proposals.

Note: The explorer codebase (a prototype interpreter) has been moved to its own repository. You may see references to it in old proposals or documentation, but it is not part of the active toolchain development in this repository.

Building and Testing

We use Bazel.

Essential Commands

Test everything: bazel test //...
Test specific target: bazel test //toolchain/check:check_test
Build toolchain: bazel build //toolchain/...

Updating Test Data

Carbon tests often use file_test (for example, //toolchain/testing/file_test). If you change compiler behavior, you likely need to update expected test outputs. Do not manually edit thousands of lines of expected output. Use the script:

./toolchain/autoupdate_testdata.py
# Or for a specific file:
./toolchain/autoupdate_testdata.py toolchain/check/testdata/my_test.carbon

Pre-commit

Running pre-commit is mandatory.

pre-commit run -a

Debugging and Diagnostics

Printing to stderr: Use llvm::errs() << "debug info\n"; or std::cerr.
- Avoid std::cout (it may interfere with tool output).
SemIR Stringification:
- SemIR objects often have a Print method or operator<<.
- inst.Print(llvm::errs())
Debugging Crashes:
- Bazel sandboxing can hide artifacts. Use --sandbox_debug if needed, but often running the binary directly from bazel-bin/ is easier for debugging.

C++ Coding Patterns

Carbon's toolchain uses LLVM-style C++ with some specific conventions.

Error Handling

No Exceptions: We do not use C++ exceptions.
ErrorOr<T>: Return ErrorOr<T> for fallible operations.
- Check with if (auto result = Function(); result) { Use(*result); }
llvm::Expected<T>: Similar to ErrorOr, used when interfacing with LLVM.

Casting (LLVM Style)

Use llvm::cast<T>(obj) (checked, asserts on failure).
Use llvm::dyn_cast<T>(obj) (returns null on failure).
Use llvm::isa<T>(obj) (boolean check).
Avoid dynamic_cast and standard RTTI.

Data Structures

Prefer LLVM ADTs: llvm::SmallVector, llvm::StringRef, llvm::DenseMap.
StringRef is a view; be careful with lifetimes.

Common Pitfalls

Legacy explorer references: The explorer prototype has been moved. Ignore references to it in proposals or old docs; focus on toolchain.
Manually updating test files: Always check if autoupdate_testdata.py can do it for you.
Using std::string unnecessarily: Prefer llvm::StringRef for arguments.
Header Includes: We use specific include orders (often enforced by clang-format).

GEMINI.md 3.9 KB

História Raw

Gemini & AI Assistant Guide for Carbon

Table of Contents

Project Structure

Building and Testing

Essential Commands

Updating Test Data

Pre-commit

Debugging and Diagnostics

C++ Coding Patterns

Error Handling

Casting (LLVM Style)

Data Structures

Common Pitfalls

GEMINI.md 3.9 KB História Raw

Gemini & AI Assistant Guide for Carbon

Table of Contents

Project Structure

Building and Testing

Essential Commands

Updating Test Data

Pre-commit

Debugging and Diagnostics

C++ Coding Patterns

Error Handling

Casting (LLVM Style)

Data Structures

Common Pitfalls

GEMINI.md 3.9 KB

História Raw